<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Paper Jammed &#187; Indexing</title>
	<atom:link href="http://paperjammed.com/tag/indexing/feed/" rel="self" type="application/rss+xml" />
	<link>http://paperjammed.com</link>
	<description>Has paper taken over your life?</description>
	<lastBuildDate>Tue, 17 Aug 2010 00:01:05 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Another useful addition to my PDF document library—a home circuit map</title>
		<link>http://paperjammed.com/2010/06/25/another-useful-addition-to-my-pdf-document-library%e2%80%94a-home-circuit-map/</link>
		<comments>http://paperjammed.com/2010/06/25/another-useful-addition-to-my-pdf-document-library%e2%80%94a-home-circuit-map/#comments</comments>
		<pubDate>Sat, 26 Jun 2010 00:22:15 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Paperless Life]]></category>
		<category><![CDATA[Searching and Indexing]]></category>
		<category><![CDATA[Files and Folders]]></category>
		<category><![CDATA[Indexing]]></category>
		<category><![CDATA[Knowledge Management]]></category>
		<category><![CDATA[Organization]]></category>
		<category><![CDATA[Tips]]></category>

		<guid isPermaLink="false">http://paperjammed.com/?p=1012</guid>
		<description><![CDATA[If you live in a slightly older home, such as mine, you occasionally might want to know which circuit breaker or fuse controls a particular outlet. Besides making it more convenient to disable the power for repairs, some of us have to deal with easily overloaded circuits that weren&#8217;t meant for all of the modern [...]]]></description>
			<content:encoded><![CDATA[<p><img class="size-full wp-image-1013 alignright" src="http://paperjammed.com/wp-content/uploads/2010/06/20100625-124149_4457-small.jpg" alt="" width="300" height="225" /></p>
<p>If you live in a slightly older home, such as mine, you occasionally might want to know which circuit breaker or fuse controls a particular outlet. Besides making it more convenient to disable the power for repairs, some of us have to deal with easily overloaded circuits that weren&#8217;t meant for all of the modern gadgetry we depend on.</p>
<p>Every homeowner can benefit from having a good map to their home outlets and circuit breakers, and a PDF scan of this map can make it extremely convenient to find two years later when you forgot you ever made it.<span id="more-1012"></span></p>
<p><strong>My Map</strong></p>
<p>Last week my wife was asking if she could run her <a href="http://www.jiffysteamer.com/">Jiffy Steamer</a> in the bathroom, or if it would trip the breaker. I remembered making my cheat sheet, so I simply brought up Spotlight on my Mac and typed in &#8220;home circuit&#8221; and was rewarded with this document:</p>
<p><img class="alignnone size-full wp-image-1014" src="http://paperjammed.com/wp-content/uploads/2010/06/20100625-circuit-breaker-list.jpg" alt="" width="529" height="502" /></p>
<p>It&#8217;s nothing fancy, but it gets the job done. I quickly scrolled to the bathroom and identified the circuit that she was using. A quick scroll through the other rooms showed that she would be safe as long as she turned off the air conditioner in the bedroom.</p>
<p><strong>Making a Circuit Map</strong></p>
<p>There&#8217;s a little bit of process involved here, and it helps if you have someone else to help you.</p>
<p>Get a notepad and scrawl a rough sketch of each room in your house that has electrical outlets, switches, and lights. Don&#8217;t forget the basement, garage, and attic. Draw a rough sketch of each electrical outlet/switch on the maps. You can see in the image above that I simply drew a little box for each outlet and a box with bumps on it for a set of switches.</p>
<p>Then, shut off a single breaker and go around the house to see everything that lost power. Take a small desk lamp with you or, better yet, a proper <a href="http://en.wikipedia.org/wiki/Test_light">line voltage test light</a>, and test every single outlet until you identify the ones that are off.</p>
<p>Every time you find an outlet or switch that is off, write the breaker number on its spot on your map. You can see that breaker 19a and breaker 7 both control the living room in my house.</p>
<p>Now turn that breaker on and then turn off the next breaker and repeat the whole process.</p>
<p>It may take ten or fifteen minutes to make the first round-trip, but with each new breaker you have fewer things to test. You really only need to test outlets or switches that have not been identified yet.</p>
<p>When you are done, scan in everything and give the file a nice long descriptive name. Throw in some keywords if you are indexing your files in some application.</p>
<p><strong>Using the Circuit Map</strong></p>
<p>If you are concerned about the load on a given circuit, you can go through the whole document and look for every matching number (such as the &#8220;19a&#8221; from my living room) to see how many devices are on that circuit.</p>
<p>If you need to shut down power to a switch or outlet for any reason, find its breaker on your map, shut off the breaker, and then <em>test the outlet with your line voltage tester before you do anything else</em>. Even though you know the right breaker, you must always double-check that the circuit is dead before performing work.</p>
<p>By the way, my wife has had that Jiffy Steamer for years, and she absolutely <em>loves</em> it—it probably ranks right next to her iPad as all-time coolest and most useful products.</p>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2010/06/25/another-useful-addition-to-my-pdf-document-library%e2%80%94a-home-circuit-map/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Sort out those disorganized thoughts with a Mind Map</title>
		<link>http://paperjammed.com/2010/05/27/sort-out-those-disorganized-thoughts-with-a-mind-map/</link>
		<comments>http://paperjammed.com/2010/05/27/sort-out-those-disorganized-thoughts-with-a-mind-map/#comments</comments>
		<pubDate>Fri, 28 May 2010 01:35:37 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Software]]></category>
		<category><![CDATA[Indexing]]></category>
		<category><![CDATA[Knowledge Management]]></category>
		<category><![CDATA[Macintosh]]></category>
		<category><![CDATA[Tips]]></category>
		<category><![CDATA[Windows]]></category>

		<guid isPermaLink="false">http://paperjammed.com/?p=997</guid>
		<description><![CDATA[You know the feeling: you are involved in some intractable problem that has all kinds of weird angles and you just can’t get your head around it—perhaps you feel like you are inspecting an elephant, one square inch at a time, or maybe you simply feel like you are herding cats. There are plenty of [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-medium wp-image-999" src="http://paperjammed.com/wp-content/uploads/2010/05/iStock_000008990728XSmall-300x199.jpg" alt="iStockphoto" width="300" height="199" />You know the feeling: you are involved in some intractable problem that has all kinds of weird angles and you just can’t get your head around it—perhaps you feel like you are inspecting an elephant, one square inch at a time, or maybe you simply feel like you are herding cats.</p>
<p>There are plenty of different ways to catalog loosely associated knowledge of varying complexity—a few months back I discussed using a wiki for this—but some problems just don’t need that level of complexity and depth.</p>
<p>Some problems are more suited to random scribblings on a whiteboard, and that is where mind mapping software comes in.<span id="more-997"></span></p>
<p><strong>What&#8217;s a Mind Map?</strong></p>
<p>Imagine you and your family are going to fly to Rio de Janeiro this summer to visit distant relatives and you realize that there are about a thousand things to do  in preparation but you just can&#8217;t sort it all out.</p>
<p>You know that you need passports and you need to verify that everyone&#8217;s visa is still valid. There is the monumental task of deciding what to pack. You might want to make a checklist of places you want to visit. And you want to go hang gliding down to the beach, but there&#8217;s something nagging at you about whether or not your health insurance would cover a broken leg in a foreign land.</p>
<p>The problem is that it is difficult to keep the whole thing in your mind—if you concentrate on the luggage, you forget about the international driver&#8217;s license.</p>
<p>Mind maps allow you to visualize the whole thing at once, and you can slide stuff around and get it looking nice and pretty.</p>
<p><img class="alignnone size-full wp-image-1004" src="http://paperjammed.com/wp-content/uploads/2010/05/20100527-mind-map-11.png" alt="" width="550" height="305" /></p>
<p>This is a relatively simple start at a mind map that represents the vacation. These maps are typically read from the top right going clockwise, though many have no specific sequence. In fact, you can do pretty much what you like with a mind map as long as it works for you.</p>
<p><strong>Just Another Outliner?</strong></p>
<p>At first glance these tools look like glorified outlining applications such as OmniOutliner, and they do serve admirably in this respect, but they are so much more. An outline gives you a very easy way to organize topics and thoughts, adding annotations and such along the way, but it isn&#8217;t nearly as easy to visualize and nonlinear concepts do not map well to an outline.</p>
<p>Consider the Rio Trip example above—you could put all of that information into an outline, but it would not be nearly as easy to process mentally.</p>
<p>And these mind maps look especially cool in presentations.</p>
<p><strong>Slick Document Generation</strong></p>
<p>Some of the commercial mind mapping products provide pretty good integration with Microsoft Office products.</p>
<p>Some time back, I needed to write up a set of style standards for Oracle&#8217;s PL/SQL programming language for our offshore team. Rather than just dive into Word and hope for the best, I used Mind Manager Pro from Mindjet to make the following map:</p>
<p><img class="alignnone size-full wp-image-1005" src="http://paperjammed.com/wp-content/uploads/2010/05/20100527-mind-map-21.png" alt="" width="550" height="331" /></p>
<p>This tool supported attaching rich text to nodes in the map, so I used these notes to handle the actual code examples.</p>
<p>I was then able to put together a nice Word template that matched our corporate documents and I clicked the <strong>Export to MS Word </strong>button and had an instant document:</p>
<p><img class="alignnone size-full wp-image-1006" src="http://paperjammed.com/wp-content/uploads/2010/05/20100527-mind-map-31.png" alt="" width="550" height="483" /></p>
<p>I have also used mind maps to auto-generate PowerPoint slide decks as well. These days I use Mind Manager on a daily basis at the office; it makes a great difference when I am trying to grasp complex topics with lots of strange dangly bits hanging off of the edges.</p>
<p><strong>Nothing New</strong></p>
<p>Mind maps have been around for a long time. A quick search of the &#8216;Net will show you that mind mapping software is quite plentiful and mature. There are good free products for PC and Mac available and there are many commercial products that take mind mapping a step further, often integrating with Microsoft Office.</p>
<p>Take a look at <a href="http://en.wikipedia.org/wiki/Mind_map">what Wikipedia has to say</a> about mind mapping. <a href="http://www.43folders.com/2006/09/17/mac-mind-mapping">Here&#8217;s a post</a> from 43 Folders on the same topic. Peter Russell <a href="http://www.peterrussell.com/MindMaps/Uses.php">has useful information</a> as well about them.</p>
<p>I learned about mind mapping from a friend at work who had been using them for years. After seeing him make some quick notes during a meeting, I was sold.</p>
<p><strong>Closing Thoughts</strong></p>
<p>I made the Rio de Janeiro example using XMind, the free &#8220;lite&#8221; version of <a href="http://www.xmind.net/">XMindPro</a>. This application is pretty full featured for a free basic version—the Pro version adds enterprise features such as import/export and collaboration.</p>
<p>There are <a href="http://www.xmind.net/downloads/">XMind versions for Mac, PC, and Linux</a>.</p>
<p>If you are looking for more, all of the commercial products offer a trial period. I use MindJet&#8217;s <a href="http://www.mindjet.com/products/mindmanager-8-win/overview">Mind Manager</a>. Be warned, these tools are expensive, just like buying MS Office, but you might just find that they more than make up for their cost with your newly found productivity.</p>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px;"><!--[if gte mso 9]><xml> <w:WordDocument> <w:View>Normal</w:View> <w:Zoom>0</w:Zoom> <w:PunctuationKerning /> <w:ValidateAgainstSchemas /> <w:SaveIfXMLInvalid>false</w:SaveIfXMLInvalid> <w:IgnoreMixedContent>false</w:IgnoreMixedContent> <w:AlwaysShowPlaceholderText>false</w:AlwaysShowPlaceholderText> <w:Compatibility> <w:BreakWrappedTables /> <w:SnapToGridInCell /> <w:WrapTextWithPunct /> <w:UseAsianBreakRules /> <w:DontGrowAutofit /> </w:Compatibility> <w:BrowserLevel>MicrosoftInternetExplorer4</w:BrowserLevel> </w:WordDocument> </xml><![endif]--><!--[if gte mso 9]><xml> <w:LatentStyles DefLockedState="false" LatentStyleCount="156"> </w:LatentStyles> </xml><![endif]--><!--  /* Style Definitions */  p.MsoNormal, li.MsoNormal, div.MsoNormal 	{mso-style-parent:""; 	margin:0in; 	margin-bottom:.0001pt; 	mso-pagination:widow-orphan; 	font-size:12.0pt; 	font-family:"Times New Roman"; 	mso-fareast-font-family:"Times New Roman";} @page Section1 	{size:8.5in 11.0in; 	margin:1.0in 1.25in 1.0in 1.25in; 	mso-header-margin:.5in; 	mso-footer-margin:.5in; 	mso-paper-source:0;} div.Section1 	{page:Section1;} --><!--[if gte mso 10]> <mce:style><!   /* Style Definitions */  table.MsoNormalTable 	{mso-style-name:"Table Normal"; 	mso-tstyle-rowband-size:0; 	mso-tstyle-colband-size:0; 	mso-style-noshow:yes; 	mso-style-parent:""; 	mso-padding-alt:0in 5.4pt 0in 5.4pt; 	mso-para-margin:0in; 	mso-para-margin-bottom:.0001pt; 	mso-pagination:widow-orphan; 	font-size:10.0pt; 	font-family:"Times New Roman"; 	mso-ansi-language:#0400; 	mso-fareast-language:#0400; 	mso-bidi-language:#0400;} --> <!--[endif]--></p>
<p class="MsoNormal">You know the feeling: you are involved in some intractable problem that has all kinds of weird angles and you just can’t get your head around it—perhaps you feel like you are inspecting an elephant, one square inch at a time, or maybe you simply feel like you are herding cats.</p>
<p class="MsoNormal">
<p class="MsoNormal">There are plenty of different ways to catalog loosely associated knowledge of varying complexity—a few months back I discussed using a wiki for this—but some problems just don’t need that level of complexity and depth.</p>
<p class="MsoNormal">
<p class="MsoNormal">Some problems are more suited to random scribblings on a whiteboard, and that is where mind mapping software comes in.</p>
<p class="MsoNormal">
<p class="MsoNormal">A short time back I was asked to serve on a project team that is involved in managing a stream of data going to dozens of downstream systems. Not only is the project massive, but each of these downstream products has its own project team and politics to deal with.</p>
<p class="MsoNormal">How do you even begin to learn about forty different computer systems? What do you do with all of the odd little tidbits of information that keep flowing in from all sides?</p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2010/05/27/sort-out-those-disorganized-thoughts-with-a-mind-map/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Dodged the corrupt-document bullet this time, just barely&#8230;</title>
		<link>http://paperjammed.com/2009/10/27/dodged-the-corrupt-document-bullet-this-time-just-barely/</link>
		<comments>http://paperjammed.com/2009/10/27/dodged-the-corrupt-document-bullet-this-time-just-barely/#comments</comments>
		<pubDate>Tue, 27 Oct 2009 21:52:30 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Searching and Indexing]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Data Loss]]></category>
		<category><![CDATA[Geeky]]></category>
		<category><![CDATA[Indexing]]></category>
		<category><![CDATA[Knowledge Management]]></category>
		<category><![CDATA[PDF]]></category>

		<guid isPermaLink="false">http://paperjammed.com/?p=750</guid>
		<description><![CDATA[A couple of weeks ago, a co-worker sent me a PDF document to look at. He said that he was having trouble copying and pasting from the document and was scratching his head about why this particular PDF would have such issues. As it would turn out, there were several thousand other documents on a [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-medium wp-image-751" title="gibberish document in a file folder" src="http://paperjammed.com/wp-content/uploads/2009/10/iStock_000006486654XSmall-300x199.jpg" alt="gibberish document in a file folder" width="300" height="199" />A couple of weeks ago, a co-worker sent me a PDF document to look at. He said that he was having trouble copying and pasting from the document and was scratching his head about why this particular PDF would have such issues.</p>
<p>As it would turn out, there were several thousand other documents on a file server that shared the same funny behavior. By the time we were done struggling with this problem I had gained new respect for PDF corruption issues and their prevention.<span id="more-750"></span></p>
<p><strong>The Problem</strong></p>
<p>We were looking to load a few thousand of these scientific reports into a fancy-schmancy new database, with linguistics searching and other bells and whistles. Much to our chagrin, these documents just weren&#8217;t loading, and we couldn&#8217;t understand why. They were text documents, with some embedded images, but mostly straightforward text.</p>
<p>Here is an excerpt:</p>
<p><img class="alignnone size-full wp-image-755" title="20091027-plaintext" src="http://paperjammed.com/wp-content/uploads/2009/10/20091027-plaintext.gif" alt="20091027-plaintext" width="521" height="93" /></p>
<p>And you can tell that it is right and proper text because when I blow it up all the way, the fonts are nice and smooth—this isn&#8217;t just an image of text.</p>
<p><img class="alignnone size-full wp-image-756" title="20091027-smooth-letter" src="http://paperjammed.com/wp-content/uploads/2009/10/20091027-smooth-letter.gif" alt="20091027-smooth-letter" width="258" height="295" /></p>
<p>But if I copy and paste that particular paragraph into any handy editor (Notepad, in this case), this is what I see:</p>
<p><img class="alignnone size-full wp-image-757" title="20091027-notepad" src="http://paperjammed.com/wp-content/uploads/2009/10/20091027-notepad.gif" alt="20091027-notepad" width="496" height="155" /></p>
<p>And as far as I know, at this point the actual text is beyond the reach of average folks like me. We tried, believe me we tried.</p>
<p><strong>What went wrong?</strong></p>
<p>A quick Google of the subject led us to understand that many PDF generation tools embed subsets of fonts, with nonstandard mappings from the text to the font.</p>
<p>This fellow explains it nicely:</p>
<p>&#8220;The PDF file does not contain all the information to extract the text. The problem is that a character in a PDF file may not contain information what &#8220;real&#8221; character it relates to. Some PDF generators do a pretty bad job when they embed fonts into PDF files. They use a proprietary encoding mechanism (e.g. 1 is A, 2 is B, 3 is C, &#8230;) in both the embedded font and when they place glyphs on the page. Without a table that implements the reverse (e.g. character code 1 is &#8216;A&#8217;) you cannot extract text from such a file.</p>
<p>There is nothing you can do (besides to complain to whoever created the PDF file, and the author of the software that created this file).&#8221;<br />
— from <a href="http://www.experts-exchange.com/Web_Development/Document_Imaging/Adobe_Acrobat/Q_21426533.html">khkremer on experts-exchange.com</a></p>
<p>As it would turn out, many of the reports had been generated by printing to Adobe Distiller from Microsoft Word. It would seem that the default settings used for Distiller included the &#8220;totally hose my document content&#8221; switch.</p>
<p><strong>The Solution</strong></p>
<p>We fretted over this quite a bit. These are important scientific reports, and there is no way to easily ungarble them. We finally ended up contacting the <a href="http://finereader.abbyy.com/">Abbyy Finereader</a> folks and trying out their OCR toolkit for Linux: not only did this product make fast work of running optical character recognition on the sample document, but once we had a script running, we managed to blow through the 10,000 pages the trial license gave us, in a day or two.</p>
<p><strong>Imperfect, at best</strong></p>
<p>I am happy that we were able to salvage the bulk of the electronic knowledge found within those thousands of files, but our work barely scratched the surface.</p>
<p>For example, most of these documents have rich bookmarking of sections and keywording, such as this (content tastefully blurred on purpose).</p>
<p><img class="alignnone size-full wp-image-760" title="20091027-doc-with-contents" src="http://paperjammed.com/wp-content/uploads/2009/10/20091027-doc-with-contents.gif" alt="20091027-doc-with-contents" width="500" height="348" /></p>
<p>In addition, scientific documents typically have loads of tables full of numbers. Though it is possible to mine this data with a good OCR tool (the FineReader API provides tools for just this purpose), the tables are far more difficult to extract correctly once the original text information is lost.</p>
<p><strong>Final thoughts</strong></p>
<p>I wrote a few weeks about document formats, <a href="http://paperjammed.com/2009/09/29/are-your-portable-document-format-files-all-that/">mentioning the PDF/A document standard</a>. This is worth investigating, regardless of what your document needs are.</p>
<p>If our thousands of files had been originally generated as PDF/A, it is certain that we would have been able to copy/paste from them without problem: PDF/A prohibits such font shenanigans as were perpetrated on our garbled reports.</p>
<p>In the end, our OCR sledgehammer approach worked like a charm, and is probably sufficient for our needs. Text mining is a pretty slushy business, so no-one will complain if there are a few typos on each page—if they find the doc in a search, they can print it and read it the old fashioned way.</p>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2009/10/27/dodged-the-corrupt-document-bullet-this-time-just-barely/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Macworld: 7 tips for using Faces in iPhoto &#8217;09</title>
		<link>http://paperjammed.com/2009/07/20/macworld-7-tips-for-using-faces-in-iphoto-09/</link>
		<comments>http://paperjammed.com/2009/07/20/macworld-7-tips-for-using-faces-in-iphoto-09/#comments</comments>
		<pubDate>Tue, 21 Jul 2009 01:13:34 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Searching and Indexing]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Indexing]]></category>
		<category><![CDATA[Macintosh]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[Photos]]></category>
		<category><![CDATA[Reviews]]></category>

		<guid isPermaLink="false">http://paperjammed.com/?p=613</guid>
		<description><![CDATA[Thanks to its face-recognition tool, iPhoto ’09 can now put names to the faces in your photographs, letting you quickly sift through your library based on content rather than how photos are arranged. But putting this feature to work requires some effort on your part. A few months back I received my copy of iLife [...]]]></description>
			<content:encoded><![CDATA[<blockquote><p>Thanks to its face-recognition tool, iPhoto ’09 can now put names to the faces in your photographs, letting you quickly sift through your library based on content rather than how photos are arranged. But putting this feature to work requires some effort on your part.</p></blockquote>
<p>A few months back I received my copy of iLife &#8217;09 and <a href="http://paperjammed.com/2009/02/24/tagging-my-photos-just-got-a-little-bit-easier-on-the-mac/">was quite pleased with the new Faces feature</a>—I recall losing a couple of evenings sifting through photos and finding people I had forgotten.</p>
<p>Derrick Story from Macworld has written an article providing some tips for getting the most out of Faces. Even if you have been using the product since it was released, you may find these tips useful.</p>
<p>For example, after reading the article I immediately set up a smart folder that shows all unassigned faces—called &#8220;Missing Persons.&#8221; A simple idea that I would have never thought of.</p>
<p>I still don&#8217;t tire of refining the Faces gallery in my iPhoto collection!</p>
<p>Read the full article here: <a href="http://www.macworld.com/article/141746/2009/07/faces_tips.html">7 tips for using Faces in iPhoto ’09</a> (Macworld.com)</p>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2009/07/20/macworld-7-tips-for-using-faces-in-iphoto-09/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>HowStuffWorks — How Paperless Offices Work</title>
		<link>http://paperjammed.com/2009/07/03/howstuffworks-%e2%80%94-how-paperless-offices-work/</link>
		<comments>http://paperjammed.com/2009/07/03/howstuffworks-%e2%80%94-how-paperless-offices-work/#comments</comments>
		<pubDate>Sat, 04 Jul 2009 00:30:42 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Green Living]]></category>
		<category><![CDATA[Paperless Life]]></category>
		<category><![CDATA[Workflow]]></category>
		<category><![CDATA[Good Sites]]></category>
		<category><![CDATA[Indexing]]></category>
		<category><![CDATA[Online Services]]></category>
		<category><![CDATA[Organization]]></category>
		<category><![CDATA[Security]]></category>

		<guid isPermaLink="false">http://paperjammed.com/?p=594</guid>
		<description><![CDATA[I have always been a big fan of HowStuffWorks, with their detailed in-depth articles describing such disparate topics as manual transmissions and money laundering. Anyway, author Diane Dannenfeldt has written a lengthy article on How Paperless Offices Work, giving ample coverage to myriad aspects of the topic: Introduction to How Paperless Offices Work Benefits of [...]]]></description>
			<content:encoded><![CDATA[<p><img class="size-full wp-image-595 alignnone" src="http://paperjammed.com/wp-content/uploads/2009/07/20090703-howstuffworks.jpg" alt="20090703-howstuffworks" width="492" height="352" /></p>
<p>I have always been a big fan of HowStuffWorks, with their detailed in-depth articles describing such disparate topics as <a href="http://auto.howstuffworks.com/transmission.htm">manual transmissions</a> and <a href="http://money.howstuffworks.com/money-laundering.htm">money laundering</a>.</p>
<p>Anyway, author Diane Dannenfeldt has written a lengthy article on How Paperless Offices Work, giving ample coverage to myriad aspects of the topic:</p>
<ul>
<li>Introduction to How Paperless Offices Work</li>
<li>Benefits of a Paperless Office</li>
<li>Transitioning to a Paperless Office</li>
<li>Managing Digital Documents</li>
<li>Going Paperless at Home</li>
<li>Paperless Office Solutions</li>
</ul>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Introduction to HoPaperless Offices Work</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Benefits of a Paperless Office</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Transitioning to a Paperless Office</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Managing Digital Documents</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Going Paperless at Home</div>
<div id="_mcePaste" style="position: absolute; left: -10000px; top: 0px; width: 1px; height: 1px; overflow-x: hidden; overflow-y: hidden;">Paperless Office Solutions</div>
<p>Take a look at the full article here: <a href="http://communication.howstuffworks.com/how-paperless-offices-work.htm">How Paperless Offices Work</a> (howstuffworks.com)</p>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2009/07/03/howstuffworks-%e2%80%94-how-paperless-offices-work/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Pick a file name style and stick with it</title>
		<link>http://paperjammed.com/2009/02/07/pick-a-file-name-style-and-stick-with-it/</link>
		<comments>http://paperjammed.com/2009/02/07/pick-a-file-name-style-and-stick-with-it/#comments</comments>
		<pubDate>Sun, 08 Feb 2009 03:09:16 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Searching and Indexing]]></category>
		<category><![CDATA[Workflow]]></category>
		<category><![CDATA[Indexing]]></category>

		<guid isPermaLink="false">http://paperjammed.net/?p=115</guid>
		<description><![CDATA[As soon as you begin storing your paper life on your computer, you are faced with the question of how to name your files. If you carefully select one system and stick with it, your entire document workflow will improve. Here are some thoughts to consider, and a suggestion or two.]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-114" title="20090207-finder-1" src="http://paperjammed.net/wp-content/uploads/2009/02/20090207-finder-1.jpg" alt="20090207-finder-1" width="224" height="161" />One of the problems we all face in our digital lives is the question of naming and storing our files. We create possibly thousands of documents in many formats and try to store them in some meaningful fashion.</p>
<p>If you carefully select one system and stick with it, your entire document workflow will improve.</p>
<p>Here are a few things to consider as your paperless life grows.<span id="more-115"></span></p>
<p><strong>Consistent names are good</strong></p>
<p>You probably do need a consistent style for your document names. Arbitrary names such as &#8220;Homework assignment 1&#8243; work fine when you have a few dozen documents, but as your library grows, the file names become less helpful. Imagine trying to find that document in a deep folder structure with thousands of similar items.</p>
<p>Remember that anything of importance that you have in your personal digital library is likely to be moved around quite a bit in your lifetime. Operating systems change. Software companies go out of business. Having a good &#8220;lowest common denominator&#8221; in your file names can help protect you as your files are tossed about over the years.</p>
<p><strong>Pick one system and stick with it</strong></p>
<p>Once you have a substantial collection of documents, it becomes quite tedious to go through them on some great name-changing expedition. In addition, changing file names causes indigestion in your backups, and is particularly hard to implement if you keep permanent backups on media such as CDs and DVDs.</p>
<p><strong>Keep the name relatively computer friendly</strong></p>
<p>There was once a time when savvy users would never embed a space character in a filename. Those days are all but gone, and we can happily use spaces to improve the readability of filenames. This doesn&#8217;t mean that you don&#8217;t need to consider other limitations.</p>
<p>Don&#8217;t make big long cumbersome names that choke &#8220;open file&#8221; windows and folder views.</p>
<p>Avoid funny characters, even if the operating system allows them. For example, Mac OS X allows you to use characters such as &#8220;/&#8221; in names, but when you do so, OS X silently substitutes them, keeping a &#8220;display name&#8221; and a &#8220;real name&#8221; for each file.</p>
<p><img class="alignnone size-full wp-image-116" title="20090207-filenames-1" src="http://paperjammed.net/wp-content/uploads/2009/02/20090207-filenames-1.gif" alt="20090207-filenames-1" width="529" height="166" /></p>
<p>In the example above you can see that I named a file &#8220;bozo/clown.jpg&#8221; but the terminal window on the left shows that OS X kindly replaced the Unix-unfriendly &#8220;/&#8221; with a &#8220;:&#8221;.</p>
<p>Imagine the challenges you will face when you try sharing this file with a Windows user or searching for duplicates of this file in your backups.</p>
<p><strong>Make the name meaningful</strong></p>
<p>There are loads of content management applications out there, but they pretty much all depend on maintaining their own special repository of keywords, tags, and annotations for your files. Guess what happens if you ever have to change to a new document management application? All of those fancy duds are stripped away, leaving your documents bare, with only their names and their content.</p>
<p>There are many situations where the filename is all you have to go on; quite often, filenames are the only thing you can search for.</p>
<p>In other words, put some meaning in the name. Instead of &#8220;Homework assignment #1&#8243; try &#8220;EN-101 Writing assignment 1, My Summer Vacation.&#8221; I&#8217;ll admit, that&#8217;s a bit wordy, but you see that it includes the course name, the type of document, and a bit of the title.</p>
<p>Consider putting dates in your filenames. Files get copied all the time, and sometimes the &#8220;creation date&#8221; doesn&#8217;t survive the copy operation. In addition, if you are scanning in a paper document that is dated &#8220;March 25, 1992,&#8221; you might want to have the file name reflect the original paper document&#8217;s date and not an arbitrary file system date.</p>
<p><strong>My own preferred style</strong></p>
<p>This is how I like to name my documents:</p>
<p><em>yyyymmdd</em> {General Topic} &#8211; {Specific Title}.pdf</p>
<p>For example:</p>
<p><tt>20090207 Auto - Registration for Jetta.pdf</tt></p>
<p>When I scan in a document, I try to find the most appropriate date for the creation date; in the example, I might have chosen either the date that NJ DMV wrote on the renewal letter, or the postmark.</p>
<p>I include the eight-digit date in exactly that format because it sorts perfectly. If you have a folder with a hundred files that have been named in this way, you can sort on the file name and they will appear in chronological order.</p>
<p>There are as many different systems for naming files as there are people—find one that works for you and stick with it.</p>
<p>Here&#8217;s <a href="http://getitdone.quickanddirtytips.com/file-naming-conventions.aspx">a short page on the subject</a> from Get-it-Done Guy&#8217;s Quick and Dirty Tips.</p>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2009/02/07/pick-a-file-name-style-and-stick-with-it/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>It&#8217;s all about searching</title>
		<link>http://paperjammed.com/2009/02/05/its-all-about-searching/</link>
		<comments>http://paperjammed.com/2009/02/05/its-all-about-searching/#comments</comments>
		<pubDate>Fri, 06 Feb 2009 03:55:22 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Searching and Indexing]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Tools of the Trade]]></category>
		<category><![CDATA[Indexing]]></category>
		<category><![CDATA[Security]]></category>

		<guid isPermaLink="false">http://paperjammed.net/?p=85</guid>
		<description><![CDATA[The first time I searched for a random scanned document on my computer—and found it—I realized that this is one of the greatest benefits of the paperless home. This is one area where traditional paper lacks. And with the free products available today, there&#8217;s no need to be without. In this article, I describe two [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-86" title="istock_000008277336xsmall" src="http://paperjammed.net/wp-content/uploads/2009/02/istock_000008277336xsmall.jpg" alt="istock_000008277336xsmall" width="170" height="254" />The first time I searched for a random scanned document on my computer—and <em>found</em> it—I realized that this is one of the greatest benefits of the paperless home. This is one area where traditional paper lacks. And with the free products available today, there&#8217;s no need to be without.</p>
<p>In this article, I describe two searching options, one for the Mac and one for the PC.<span id="more-85"></span></p>
<p>A few days ago I wrote a post about <a href="http://paperjammed.net/2009/02/03/five-minutes-in-the-paperless-life/">five minutes in the paperless life</a>, where I described registering my wife&#8217;s car and dealing with the associated paperwork.</p>
<p>As I filled out the form, I needed my insurance information, but had no idea where it was. I simply hit ⌘-Space and typed in my insurance company&#8217;s name, immediately bringing up all documents that mention the company, including my recent policy renewal.</p>
<p>Until the past couple of years, such full-system document searching on a PC or Mac was hit-or-miss, slow and unreliable. Anyone serious about finding their documents needed special document indexing software.</p>
<p>Today, anyone can begin effortless full-system searching for free on either operating system.</p>
<p><strong>Spotlight</strong></p>
<p>If you are on a Macintosh, running Tiger or Leopard, you have access to an <a href="http://support.apple.com/kb/HT2531">excellent document indexing tool</a> that is built right in to your operating system. Just click on the little magnifying glass in the upper-right corner of the screen and start typing; you will see all kinds of documents, files, applications, etcetera, that match your search.</p>
<p><img class="alignright size-thumbnail wp-image-88" title="20090205-screenshot-1" src="http://paperjammed.net/wp-content/uploads/2009/02/20090205-screenshot-1-150x150.jpg" alt="20090205-screenshot-1" width="150" height="150" />This is especially nice because there is no sluggish indexing process running all the time. The Os X operating system provides filesystem events that tell any interested applications that &#8220;something has changed over here on the disk&#8221; so there is no need to laboriously crawl the filesystem looking for changes.</p>
<p>Spotlight will always search filenames and any metadata you have added to the file using the Finder.</p>
<p>If your documents are in PDF (that have been OCR&#8217;d) then Spotlight will search the entire contents of every single PDF on your system for the search string. This is good.</p>
<p>Of course, any word processing documents are indexed and searched as well.</p>
<p>It&#8217;s a thing of beauty when you type in some obscure account number and see five documents pop up that happen to mention the number.</p>
<p>If you prefer the keyboard to the mouse, you can hit ⌘-Space to bring up Spotlight.</p>
<p><strong>Google Desktop</strong></p>
<p>On the PC (and the Mac) you can use <a href="http://desktop.google.com/features.html">Google Desktop</a>. This is truly an amazing product, and it&#8217;s free. It does exactly what its name says: it puts Google on your desktop, in a very literal sense.</p>
<p>I use this tool regularly at work, where our machines all run Windows XP, and I love it. The only reason why I don&#8217;t use it on my Mac is because Spotlight is more tightly integrated with the operating system.</p>
<p><img class="alignright size-full wp-image-90" title="20090205-quicksearch" src="http://paperjammed.net/wp-content/uploads/2009/02/20090205-quicksearch.gif" alt="20090205-quicksearch" width="300" height="152" />Usage is simple: you type in a few words and are immediately presented with a dropdown list of possible document hits. If you want, you can see a full web page, appearing exactly like a Google search page (no surprise there), but with your local documents. In addition, when you do regular Google searches, you will see hits that have been found on your local machine. Very nice.</p>
<p>Just like Spotlight on the Macintosh, Google Desktop searches file names and file content. Again, if you have run OCR on your scanned PDF documents, the content will be searched.</p>
<p>Like Spotlight, there is a hotkey to launch the search. Just hit the <tt>Ctrl</tt> key twice.</p>
<p><strong>Privacy Issues</strong></p>
<p>It is important for you to understand that your data is easily searchable from your keyboard. For an illustration of what this means, try typing in your Social Security number in the search field—if you have scanned in many private documents (e.g. tax returns), you might be surprised how many hits you find.</p>
<p>This is not necessarily a bad thing since quite often you are searching for sensitive documents. Besides, the search tool itself provides an easy way to smoke out these kinds of documents in order to protect them better. </p>
<p>Both tools provide the ability to tweak the kind of content searched as well as the ability to identify which portions of your hard drive are accessible and which are private.</p>
<p>If you are very serious about locking down confidential files you need to carefully study the security concerns surrounding indexing tools. For example, <a href="http://www.heise-online.co.uk/news/Vista-Word-and-Google-Desktop-circumvent-TrueCrypt-function--/111118">here&#8217;s an article</a> discussing how tools such as Google Desktop might circumvent the privacy of encrypted volumes offered by <a href="http://en.wikipedia.org/wiki/TrueCrypt">TrueCrypt</a> (a popular open-source encryption product).</p>
<p>Key Points</p>
<blockquote><p>If you have never used either of these products, they are both free and you will be amazed with the results.</p>
<p>These search tools really shine when you run OCR on any scanned documents.</p>
<p>Remember that your data is now searchable by anyone sitting at your machine.</p></blockquote>
<p>If you aren&#8217;t using Google Desktop or Spotlight right now, what are you waiting for?</p>
<p>[Update: I guess I should practice what I preach—a few minutes ago I typed my SSN into the Spotlight search field and was quite surprised to find a decade-old Word document. It was a letter I sent to a former employer giving my address change information. Oops!]</p>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2009/02/05/its-all-about-searching/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
