<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Paper Jammed &#187; Scanning</title>
	<atom:link href="http://paperjammed.com/category/scanning/feed/" rel="self" type="application/rss+xml" />
	<link>http://paperjammed.com</link>
	<description>Has paper taken over your life?</description>
	<lastBuildDate>Wed, 30 Jun 2010 02:14:53 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Showin&#8217; your chops on those piles of sheet music</title>
		<link>http://paperjammed.com/2010/03/29/showin-your-chops-on-those-piles-of-sheet-music/</link>
		<comments>http://paperjammed.com/2010/03/29/showin-your-chops-on-those-piles-of-sheet-music/#comments</comments>
		<pubDate>Tue, 30 Mar 2010 00:34:23 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Paperless Life]]></category>
		<category><![CDATA[Scanning]]></category>
		<category><![CDATA[Searching and Indexing]]></category>
		<category><![CDATA[Files and Folders]]></category>
		<category><![CDATA[Music]]></category>
		<category><![CDATA[Organization]]></category>
		<category><![CDATA[PDF]]></category>

		<guid isPermaLink="false">http://paperjammed.com/?p=957</guid>
		<description><![CDATA[Show me a musician and I&#8217;ll show you someone who has at least a three foot stack of sheet music squirreled away somewhere.
My situation is worse—both my wife and I are musicians, to one degree or another. Throw in the fact that she is a music teacher and you can imagine just how many pages [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-medium wp-image-959" title="Hollow Body" src="http://paperjammed.com/wp-content/uploads/2010/03/iStock_000000536065XSmall-300x257.jpg" alt="iStockphoto" width="300" height="257" />Show me a musician and I&#8217;ll show you someone who has at least a three foot stack of sheet music squirreled away somewhere.</p>
<p>My situation is worse—both my wife and I are musicians, to one degree or another. Throw in the fact that she is a music teacher and you can imagine just how many pages of sheet music there are filling bins and flexing cheap shelving in my house.</p>
<p><strong>What do I have and Where is it?</strong></p>
<p>The biggest problem we face is knowing what we have and where it is. I have hundreds and hundreds of pages of classical and jazz guitar sheet music, but if I need to find Villalobos&#8217; <em>Choros no. 1</em>, where do I look?<span id="more-957"></span></p>
<p>Shortly after I bought my ScanSnap, I began scanning in all of my sheet music (I have left much of my wife&#8217;s collection untouched—I&#8217;m sure you&#8217;ll understand). In most cases, I simply hacked the spine off of the original book and fed the sheets through the scanner. Now, I have less paper in the house and my music is searchable.</p>
<p>In most cases I didn&#8217;t bother to run OCR on the documents since there is little in the way of printed words on most sheet music that is worth indexing. I did take care to name the files well.</p>
<p>If you ever hope to find your music on your computer, make sure you include at least the composer/artist and song title in the file name.</p>
<p><strong>Is this really cutting down on paper?</strong></p>
<p>Whenever I find what I&#8217;m looking for I might play it directly off of the computer screen, but it is more likely that I&#8217;ll print it out. Doesn&#8217;t this kind of negate the idea of removing paper from my home? Not really. Think about it—most sheet music is never played. We have books with hundreds of songs in them and we play only  a handful. That&#8217;s just the way it is.</p>
<p>The fact that I print out five or ten pages in a month does not negate the many hundreds of pages that were scanned and then recycled.</p>
<p><strong>Great for Music Lessons</strong></p>
<p>I started taking jazz lessons again a month or two back, and my teacher gave me some lead sheets, with all kinds of useful annotations on them. As soon as I was home, I scanned those babies in, so I would not risk losing the valuable information. I also went through all of my notes from prior lessons and scanned them in as well. These kinds of things are precisely the sorts of paper that tend to get lost in some mismash of unsorted music.</p>
<p>Now, I can type in &#8220;Four&#8221; in my favorite PDF library application and find the lead sheet for Miles Davis&#8217; <em>Four</em>.</p>
<p><img class="alignnone size-full wp-image-958" title="20100329-yep" src="http://paperjammed.com/wp-content/uploads/2010/03/20100329-yep.png" alt="" width="535" height="404" /></p>
<p>Maybe you don&#8217;t have that many notebooks full of music lesson notes, but when you have been trying (poorly) to learn for as many years as I have, those notebooks begin to proliferate. Just scan them all in, give them some good filenames, add some keywords to help, and you&#8217;re in business.</p>
<p><strong>What about copyright?</strong></p>
<p>It seems that the jury is still out on digitizing works you own. There&#8217;s <a href="http://www.wired.com/gadgetlab/2009/12/diy-book-scanner/">one fellow who made a right awesome device</a> for scanning in textbooks in minutes, by photographing the pages. That guy&#8217;s machine has spurred much debate about whether or not you have the right to digitize your own stuff.</p>
<p>On the one hand, you bought the book and paid for it, so it would seem that fair use covers this; on the other hand, publishers are eager to monetize digital media, reselling the same works to you if they can.</p>
<p>So, is Daniel Reetz&#8217;s butt-kickin&#8217; book scanner legal?</p>
<p style="padding-left: 30px;">That would depend on who you talk to, says Pamela Samuelson, a professor at University of California at Berkeley, who specializes in digital-copyright law. Trade publishers are almost certain to cry copyright infringement, she says, though it may not necessarily be the case.</p>
<p style="padding-left: 30px;">Google was recently forced to pay $125 million to settle with angry book publishers and authors who claimed copyright infringement as a result of the search giant’s book-scanning project.</p>
<p style="padding-left: 30px;">But not so individual users who already own the book, says Samuelson. If you scan a book that you have already purchased, it is “fine, and fair use,” she says. “Personal-use copying should be deemed to be fair, unless there is a demonstrable showing of harm to the market for the copyright at work,” says Samuelson.</p>
<p style="padding-left: 30px;">(<a href="http://www.wired.com/gadgetlab/2009/12/diy-book-scanner/">Source</a>: wired.com)</p>
<p>Here&#8217;s another take on this:</p>
<p style="padding-left: 30px;">Question</p>
<p style="padding-left: 60px;">I bought a book for school, can I make a copy of the book for my own use to write on so I don&#8217;t write in the book and can get my money back when I return the book to the campus store.</p>
<p style="padding-left: 30px;">Accepted Answer</p>
<p style="padding-left: 60px;">You have the right to make a copy of the book you purchased as long as you are using the copy for your personal use. The copyright laws merely prevent you from making copies to sell or distribute.</p>
<p style="padding-left: 30px;">(<a href="http://www.justanswer.com/questions/2heyq-i-bought-a-book-for-school-can-i-make-a-copy-of-the-book-for">Source</a>: justanswer.com)</p>
<p>Of course, if you go passing your PDF documents around to all of your friends, all bets are off.</p>
<p><strong>Final thoughts</strong><br />
Music is a hobby that seems to accumulate great stacks of paper, but these music sheets are peculiar in that you only need one or two out of every hundred. Why not digitize the whole lot and keep those book shelves from sagging?</p>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2010/03/29/showin-your-chops-on-those-piles-of-sheet-music/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Snow Leopard Update for ScanSnap</title>
		<link>http://paperjammed.com/2009/11/13/snow-leopard-update-for-scansnap/</link>
		<comments>http://paperjammed.com/2009/11/13/snow-leopard-update-for-scansnap/#comments</comments>
		<pubDate>Sat, 14 Nov 2009 04:36:58 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Scanning]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Tools of the Trade]]></category>
		<category><![CDATA[Hardware]]></category>
		<category><![CDATA[Macintosh]]></category>
		<category><![CDATA[PDF]]></category>

		<guid isPermaLink="false">http://paperjammed.com/?p=770</guid>
		<description><![CDATA[This evening I opened my email and found a most welcome message: Fujitsu has released their patched version of the ScanSnap software for Snow Leopard.
[UPDATE: I spoke too soon—they only delivered half of the goods. See below.]
[UPDATE 2: Hurray! It's fixed! The birds are chirping and the sun is shining and life is good!]
When Snow [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-773" title="20091113-scansnap-update" src="http://paperjammed.com/wp-content/uploads/2009/11/20091113-scansnap-update.gif" alt="20091113-scansnap-update" width="371" height="228" />This evening I opened my email and found a most welcome message: <a href="http://www.fujitsu.com/us/services/computing/peripherals/scanners/support/sl_download.html">Fujitsu has released their patched version of the ScanSnap software for Snow Leopard</a>.</p>
<p>[UPDATE: I spoke too soon—they only delivered half of the goods. See below.]</p>
<p>[UPDATE 2: Hurray! It's fixed! The birds are chirping and the sun is shining and life is good!]</p>
<p>When Snow Leopard came out back in August, I ordered my copy the first week and was so excited that I installed it the day it arrived. <a href="http://paperjammed.com/2009/09/07/when-migrating-to-a-new-operating-system-look-before-you-leap/">My joy was short-lived</a>, however: the most important software package I use did not work!<span id="more-770"></span></p>
<p>I depend greatly on the OCR capabilities of the ABBYY FineReader software that comes with the ScanSnap scanners, and this was one of the many pieces of software that did not smoothly transition to Snow Leopard. I could scan documents, with limited functionality, but the OCR feature did not work.</p>
<p>Now that Fujitsu has released their official update, I will probably be installing Snow Leopard tomorrow evening. Now, do I do an upgrade or a full install? Hmmmm&#8230;</p>
<p>UPDATE</p>
<p>Well, they only delivered half of the goods. <img src='http://paperjammed.com/wp-includes/images/smilies/icon_sad.gif' alt=':-(' class='wp-smiley' /> </p>
<p>I was reading through the seven-step process for updating the ScanSnap drivers and I arrived at step seven:</p>
<blockquote><p><strong>Step 7:</strong> The download for FineReader for ScanSnap update to Snow Leopard will be hosted by ABBYY but is not yet available. If you have already subscribed to be notified by Fujitsu regarding the Snow Leopard updates, an email will be sent to you when it is posted.</p></blockquote>
<p>How displeasing. The only thing I really cared about was getting the OCR to work, and apparently Abbyy has not yet delivered their part (How hard can it be to update the part in your code that says &#8220;If the PDF metadata doesn&#8217;t match X then the document isn&#8217;t a ScanSnap doc&#8221; ?).</p>
<p>UPDATE 2</p>
<p>Instead of making us wait another month or two, ABBYY has delivered their patch in record time <img src='http://paperjammed.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> . I quickly installed the update and tested my ScanSnap script&#8230;</p>
<p>There are a few kinks to work out, but it is pretty clear that I will have my same old workflow AppleScript folder action up and running in short order. I will post the newer script as soon as I have it running properly.</p>
<p>And, as a side note, I had already purchased <a href="http://www.smileonmymac.com/PDFpen/index.html">PDFpen</a> from <a href="http://www.smileonmymac.com/">SmileOnMyMac</a> as a backup plan. Their tool incorporates the <a href="http://www.nuance.com/imaging/omnipage/omnipage-professional.asp">OmniPage OCR engine</a>, an engine that rivals that of ABBYY. My script was already running with PDFpen, but there were some issues with tables in documents that I forwarded on to SmileOnMyMac. One of these days I&#8217;ll post my scripts for PDFpen.</p>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2009/11/13/snow-leopard-update-for-scansnap/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Are your Portable Document Format files all that?</title>
		<link>http://paperjammed.com/2009/09/29/are-your-portable-document-format-files-all-that/</link>
		<comments>http://paperjammed.com/2009/09/29/are-your-portable-document-format-files-all-that/#comments</comments>
		<pubDate>Wed, 30 Sep 2009 00:41:36 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Green Living]]></category>
		<category><![CDATA[Paperless Life]]></category>
		<category><![CDATA[Scanning]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Data Loss]]></category>
		<category><![CDATA[PDF]]></category>
		<category><![CDATA[Printing]]></category>
		<category><![CDATA[Searching and Indexing]]></category>

		<guid isPermaLink="false">http://paperjammed.com/?p=692</guid>
		<description><![CDATA[Like most people who are trying to archive reams of paper, the one reliable tool I always turn to is Adobe Portable Document Format.
I trust my digital life to PDF. Almost everything I scan and most documents I write eventually end up squirreled away somewhere as PDF documents.
Have you ever considered just how portable those [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-medium wp-image-696" src="http://paperjammed.com/wp-content/uploads/2009/09/iStock_000009658438XSmall-201x300.jpg" alt="Lost keys at the beach" width="201" height="300" />Like most people who are trying to archive reams of paper, the one reliable tool I always turn to is Adobe Portable Document Format.</p>
<p>I trust my digital life to PDF. Almost everything I scan and most documents I write eventually end up squirreled away somewhere as PDF documents.</p>
<p>Have you ever considered just how portable those documents really are?</p>
<p><strong>What&#8217;s wrong with PDF?</strong></p>
<p><strong></strong>It seems strange to question the portability of these files, doesn&#8217;t it?</p>
<p>For the past ten or fifteen years Adobe has been providing Acrobat Reader and singing the wonders of their new universal document format. And it seemed to be all that, too—regardless of the audible groan we give when Acrobat launches after we click a link, isn&#8217;t it amazing that we can download press-ready copies of our income tax forms, that are guaranteed to look exactly the same when you print them as when I print them? Read on to see what dangers lurk within.<span id="more-692"></span></p>
<p><strong>What&#8217;s the Problem?</strong></p>
<p>In order to understand the nature of the PDF portability issues, one need only look as far as the web browser for an analogy. Consider how the web browser went from a barebones tool that could display a simple language, HTML, in a neutral way, fitting the web content onto each user&#8217;s screen, to a memory hogging behemoth that is an integral part of your operating system. It didn&#8217;t happen all at one; it has been death by a thousand cuts.</p>
<p>Mirroring the evolution of web browsers, the PDF document standard has adapted over the years to include many bells and whistles such as embedded audio, video, and JavaScript. It is these features that chip away at the core purpose and <em>raison d&#8217;être</em> of the PDF standard.</p>
<p><strong>An example: Font Issues</strong></p>
<p><strong></strong>A simple example of the weakness of these extended PDF features is the humble text font. When your application generates a PDF document, there is the option of using 14 standard PDF fonts, local machine fonts, or embedded TTF or Postscript fonts.</p>
<blockquote><p>There are 14 standard fonts that should be available by default in each PDF reader. These fonts are Courier, Courier Bold, Courier Italic (Oblique), Courier Bold and Italic, Helvetica, Helvetica Bold, Helvetica Italic (Oblique), Helvetica Bold and Italic, Times Roman, Times Roman Bold, Times Roman Italic, Times Roman Bold and Italic, Symbol and ZapfDingBats® (<a href="http://itextdocs.lowagie.com/tutorial/fonts/index.php">source</a>)</p></blockquote>
<p>Guess what happens when you set your document in <em><a href="http://new.myfonts.com/fonts/linotype/itc-mona-lisa/">Mona Lisa Solid ITC</a></em> and then print to PDF and send to all of your colleagues? Does your friend&#8217;s machine have a copy of this font? Maybe, and maybe not.</p>
<p>As I was writing this, I planned on putting together a cute demo by saving a document set in Mona Lisa Solid ITC in PDF from my Mac and then opening it on a PC. Much to my surprise (and delight), I found that the default &#8220;Print to PDF&#8221; functionality on my Mac does, in fact, embed the font within the document.</p>
<p>Regardless, if you have always just trusted that the fonts would be identical across platforms, you could get quite a surprise when your friend tries to print your beautiful document.</p>
<p><strong>PDF/A Standard</strong></p>
<p>Some time back, Adobe recognized the need for a more tightly controlled standard, for creating <em>really portable</em> documents, instead of mere <em>portable</em> documents. This standard, dating from 2005, is referred to as <a href="http://en.wikipedia.org/wiki/PDF/A">PDF/A</a>, where the A stands for Archive.</p>
<blockquote><p>A key element to &#8230; reproducibility is the requirement for PDF/A documents to be 100 % self-contained. All of the information necessary for displaying the document in the same manner every time is embedded in the file. This includes, but is not limited to, all content (text, raster images and vector graphics), fonts, and color information. A PDF/A document is not permitted to be reliant on information from external sources (e.g. font programs and hyperlinks). (<a href="http://en.wikipedia.org/wiki/PDF/A#Description">Wikipedia</a>)</p></blockquote>
<p>Basically PDF/A forbids all of the flashy stuff and sticks to the basics: good solid document rendering.</p>
<p>Banned features include:</p>
<ul>
<li>Audio and Video</li>
<li>JavaScript</li>
<li>Encryption</li>
<li>Nonstandard metadata</li>
<li>Transparent images</li>
</ul>
<p>In addition to the loss of several features, PDF/A documents can be somewhat larger, due to the embedded fonts, and they might have rendering issues with images that depend on transparency.</p>
<p>With all that, it still sounds like an enticing concept. Many PDF tools speak fluent PDF/A. Check out your own toolkit and see if you can future-proof your documents a little more</p>
<p><strong>Here&#8217;s more on PDF/A documents</strong></p>
<p><a href="http://blog.nitropdf.com/index.php/2009/07/13/longterm-digital-archiving-pdfa/">Long-term digital archiving with PDF/A</a> (The PDF Blog)<br />
<a href="http://en.wikipedia.org/wiki/PDF/A">PDF/A</a> (Wikipedia)<br />
<a href="http://www.pdfa.org/doku.php?id=pdfa:en:pdfa_whitepaper">PDF/A &#8211; A new Standard for Long-Term Archiving</a> (PDF/A Competence Center)</p>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2009/09/29/are-your-portable-document-format-files-all-that/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>How to simplify your tech life</title>
		<link>http://paperjammed.com/2009/05/29/how-to-simplify-your-tech-life/</link>
		<comments>http://paperjammed.com/2009/05/29/how-to-simplify-your-tech-life/#comments</comments>
		<pubDate>Fri, 29 May 2009 20:28:03 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Paperless Life]]></category>
		<category><![CDATA[Scanning]]></category>
		<category><![CDATA[Searching and Indexing]]></category>
		<category><![CDATA[Good Sites]]></category>
		<category><![CDATA[Green Living]]></category>
		<category><![CDATA[Online Services]]></category>
		<category><![CDATA[Reviews]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://paperjammed.com/?p=553</guid>
		<description><![CDATA[23 tips for getting organized, streamlining your online time, managing your media and more
In this Computerworld article, the writer gives several great tips on getting your geeky side in order. I&#8217;m happy to note that procurement of a Fujitsu ScanSnap and scanning your life to PDF made number 4 on his list.
Other useful tips include [...]]]></description>
			<content:encoded><![CDATA[<blockquote><p>23 tips for getting organized, streamlining your online time, managing your media and more</p></blockquote>
<p>In this Computerworld article, the writer gives several great tips on getting your geeky side in order. I&#8217;m happy to note that procurement of a Fujitsu ScanSnap and scanning your life to PDF made number 4 on his list.</p>
<p>Other useful tips include topics such as cable management solutions (for that rat&#8217;s nest of wires), Google Desktop search and Google Sync, online identity management, and protecting your children online.</p>
<p>Check it out: <a href="http://www.computerworld.com/action/article.do?command=viewArticleBasic&amp;taxonomyName=Networking+and+Internet&amp;articleId=9133521&amp;taxonomyId=16&amp;pageNumber=1">How to simplify your tech life</a> (Computerworld)</p>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2009/05/29/how-to-simplify-your-tech-life/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why you should digitize &#8216;everything&#8217;</title>
		<link>http://paperjammed.com/2009/05/11/why-you-should-digitize-everything/</link>
		<comments>http://paperjammed.com/2009/05/11/why-you-should-digitize-everything/#comments</comments>
		<pubDate>Mon, 11 May 2009 23:53:13 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Backups]]></category>
		<category><![CDATA[Paperless Life]]></category>
		<category><![CDATA[Scanning]]></category>
		<category><![CDATA[Searching and Indexing]]></category>
		<category><![CDATA[Data Loss]]></category>
		<category><![CDATA[Good Sites]]></category>
		<category><![CDATA[Online Services]]></category>
		<category><![CDATA[Photos]]></category>

		<guid isPermaLink="false">http://paperjammed.com/?p=532</guid>
		<description><![CDATA[
&#8220;How a lifestyle experiment and a disaster made me realize the value of turning atoms into bits&#8221; — Mike Elgin

A couple of months back, Mike Elgin of Computerworld posted an article on his foray into the paperless world: Paperless office? Ha! How about a paperless life?
In this followup article, he considers how lifestyle changes and the raging [...]]]></description>
			<content:encoded><![CDATA[<blockquote>
<p style="text-align: left;">&#8220;How a lifestyle experiment and a disaster made me realize the value of turning atoms into bits&#8221; — Mike Elgin</p>
</blockquote>
<p>A couple of months back, Mike Elgin of Computerworld posted an article on his foray into the paperless world: <a href="http://www.computerworld.com/action/article.do?command=viewArticleBasic&amp;articleId=9128737">Paperless office? Ha! How about a paperless life?</a></p>
<p>In this followup article, he considers how lifestyle changes and the raging wildfires closing in on his city have made it clear to him that it is critical to protect whatever can be easily preserved in digital form.</p>
<p>Mike gives a sound strategy, starting off with a fast pass at just getting it digitized with little regard for perfection, followed by more focused efforts.</p>
<p>He pointed out how some objects just aren&#8217;t as important as the memories that they represent, illustrating this with a photograph of his son&#8217;s martial arts trophies that he has since donated to various organizations.</p>
<p>Mike makes it very clear that backups are mandatory. My kind of guy!</p>
<p>Here&#8217;s the whole article: <a href="http://www.computerworld.com/action/article.do?command=viewArticleBasic&amp;articleId=9132739">Why you should digitize &#8216;everything&#8217;</a> (Computerworld.com)</p>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2009/05/11/why-you-should-digitize-everything/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Face it—Your great CD Collection Ripping Project is never going to end!</title>
		<link>http://paperjammed.com/2009/05/03/your-great-cd-collection-ripping-project-is-never-going-to-end/</link>
		<comments>http://paperjammed.com/2009/05/03/your-great-cd-collection-ripping-project-is-never-going-to-end/#comments</comments>
		<pubDate>Mon, 04 May 2009 01:44:49 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Paperless Life]]></category>
		<category><![CDATA[Scanning]]></category>
		<category><![CDATA[Workflow]]></category>
		<category><![CDATA[Backups]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[Organization]]></category>
		<category><![CDATA[Ripping]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://paperjammed.com/?p=524</guid>
		<description><![CDATA[This afternoon was kind of lazy and rainy, and I found myself sifting through stacks of CD cases again, full of enthusiasm as I discovered some lost Rolling Stones and David Bowie albums, imagining how few discs remained before I could declare victory. But then I stumbled across a huge cache of classical music discs [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-medium wp-image-527" src="http://paperjammed.com/wp-content/uploads/2009/05/istock_000008144609xsmall-200x300.jpg" alt="" width="200" height="300" />This afternoon was kind of lazy and rainy, and I found myself sifting through stacks of CD cases again, full of enthusiasm as I discovered some lost Rolling Stones and David Bowie albums, imagining how few discs remained before I could declare victory. But then I stumbled across a huge cache of classical music discs which immediately dampened my spirits, because these are usually the kind that come up as &#8220;Unknown Disc/Unknown Artist&#8221; when you try to download the titles. And there were loads of them.</p>
<p>Here are a few thoughts I have to share on these great media library projects. Not only are they never-ending, but the longevity of the task should guide some key decisions you make as you plod along.<span id="more-524"></span></p>
<p><strong>Accept that it&#8217;s Never Going to End</strong></p>
<p>I currently have ongoing projects to scan in every scrap of paper in my house, rip every CD to MP3, and scan in every pre-digital-era photograph. But the corpus is just too large. And I haven&#8217;t even considered ripping video; I have accepted that a full digital video library is a little too ambitious for me these days.</p>
<p>We need to be aware of this from the very beginning; if you feel like you bit off more than you can chew, you are normal.</p>
<p><strong>Never Give Up</strong></p>
<p>Try to keep your goals clear, so that you don&#8217;t lose steam and give up. Perhaps you are looking forward to being able to place the plastic boxes all in storage. Perhaps you can&#8217;t wait to be able to make those uber party mixes from your rich collection. In any case, every hour you spend doing the drudgery of ripping CDs brings you a little closer to these goals.</p>
<p><strong>Prioritize Your Work</strong></p>
<p>It is a whole lot better to spend your first days ripping your favorite music to MP3 than to be ripping some weird CDs your roommate left behind in college. Today I happily skipped over the Talking Heads CDs that I inherited from my brother—those don&#8217;t really need to be in my iTunes library, do they?</p>
<p><strong>Keep it Simple</strong></p>
<p>Make your work flow as simple as possible. You may still be doing this in three years—don&#8217;t over complicate things.</p>
<p>I have been working on my own collection for the past three years. When I first started, I was ripping the music on a PC using a tool called <a href="http://cdexos.sourceforge.net/?q=screenshots">cdex</a> to rip the MP3s and another tool called <a href="http://www.mp3tag.de/en/">Mp3tag</a> to edit the tags. These days I use <a href="http://www.apple.com/itunes/">iTunes</a> on my Mac to do both of these tasks. If I had some really fancy work flow going, it might not have been so easy to change machines. By keeping things simple, I was able to do exactly the same thing today that I did three years ago.</p>
<p><strong>Choose Good Settings</strong></p>
<p>&#8230;for your software. You don&#8217;t want to have to do this again, do you? And don&#8217;t go cheap on the bit rate, if you have the space.</p>
<p>In addition, you will probably want to decide between the convenience of a universal format versus the power of a proprietary or niche format. In my example, I opted to use MP3 for all of my music instead of some of the other, more powerful, formats available. I was stung once by the switch from PC to Mac where the handful of CDs that I had ripped to .wmf files were no longer usable.</p>
<p>If you are ripping <em>everything</em> to some killer lossless format, be sure you are looking far down the road and considering compatibility issues.</p>
<p><strong>Back it up!</strong></p>
<p>Make sure your <a href="http://paperjammed.com/2009/01/29/backup-your-life/">backup process</a> is ready for the increased load. If your backups consist of burning DVDs (or worse, CDs) of your data, you will very quickly tire of this, and you may just abandon one of the most important parts of your work flow. Oh, and remember, <a href="http://paperjammed.com/2009/02/19/if-there-arent-two-copies-in-separate-places-it-isnt-a-backup/">if there aren&#8217;t two copies in separate places, it isn&#8217;t a backup</a>.</p>
<p>A couple of months ago <a href="http://paperjammed.com/2009/02/27/one-step-closer-to-sleeping-well-at-night/">I bought two of those nice pocket-size portable hard drives</a>, and I always leave one at the office and I swap them once per week. This works for me: they are 320GB hard drives, providing plenty of room for all of my thousands of scanned documents, tens of thousands of photographs, and many thousands of MP3s, with plenty of elbow room.</p>
<p>When ripping a CD collection, the data risk is lower since you still retain the original CDs, but it still is a lot of work to rip them all. Don&#8217;t overlook the backup, or you will cry when the hard drive crashes.</p>
<p><strong>Know Where You Have Been</strong></p>
<p><strong></strong>Come up with a simple &#8220;bread crumb&#8221; mechanism to know where you have already been. You want to be able to quickly identify the music CDs you already ripped, without having to bring up iTunes every time. Keep the three year horizon in mind, so you choose a technique that will not be mysterious to you after some time passes (Hmmmm&#8230;. did I dog ear the corner of the CD insert to mark it? Where did I put that notebook where I wrote them down?)</p>
<p>I chose simplicity: I have one of those nice metallic Sharpie markers, the silvery kind. I put a small dot on the hub of each CD after I ripped it, just above the center hole. This way, several years from now, I should easily be able to spot the CDs that haven&#8217;t been introduced to my digital library.</p>
<p>I do this with photographs as well. As I scan them, I put a small marker dot on the back of each photograph, in the corner.</p>
<p><strong>Finally, Enjoy Your Music!</strong></p>
<p>I am quite capable of getting all wrapped up in the process for the sake of the process. Don&#8217;t forget to actually listen to the music you ripped!</p>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2009/05/03/your-great-cd-collection-ripping-project-is-never-going-to-end/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Keeping your secrets to yourself—what can your shared documents tell others?</title>
		<link>http://paperjammed.com/2009/04/21/keeping-your-secrets-to-yourself%e2%80%94what-can-your-shared-documents-tell-others/</link>
		<comments>http://paperjammed.com/2009/04/21/keeping-your-secrets-to-yourself%e2%80%94what-can-your-shared-documents-tell-others/#comments</comments>
		<pubDate>Wed, 22 Apr 2009 02:23:16 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Paperless Life]]></category>
		<category><![CDATA[Scanning]]></category>
		<category><![CDATA[Security]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Good Sites]]></category>
		<category><![CDATA[Privacy]]></category>

		<guid isPermaLink="false">http://paperjammed.com/?p=505</guid>
		<description><![CDATA[Do you ever send documents to other people that might have &#8230; sensitive information embedded in them?
Not everyone who works with documents in the home will run into this problem, but sooner or later you are probably going to find yourself in a situation where you would like to email someone a useful document that [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-medium wp-image-507" src="http://paperjammed.com/wp-content/uploads/2009/04/istock_000004573310xsmall-300x199.jpg" alt="" width="300" height="199" />Do you ever send documents to other people that might have &#8230; sensitive information embedded in them?</p>
<p>Not everyone who works with documents in the home will run into this problem, but sooner or later you are probably going to find yourself in a situation where you would like to email someone a useful document that just happens to have your social security number embedded in it, or your full name and address, or some other info that you would rather keep private.</p>
<p>This process of editing documents to remove sensitive content is referred to as <em>redaction</em>—that&#8217;s the keyword you probably want to be searching for as you tip toe through Google for guidance.</p>
<p>In this article I discuss the obvious problems we face using the most naïve approach toward document redaction, and provide some resources for better options.<span id="more-505"></span></p>
<p><strong>The only sure way</strong></p>
<p>The only absolutely certain way of guaranteeing that you cut out secret information would be to print the document, physically cut out the bad bits, scan in the document, and send the scanned PDF to your colleague. This may seem a bit extreme, but if you were an anonymous tipster sending the media a document full of mob-related evidence, containing <em>your name</em>, you might go this route (You probably don&#8217;t want to send the email from your personal account. Try a throwaway email account at the library.)</p>
<p><strong>Other options&#8230; Microsoft Word</strong></p>
<p>Don&#8217;t even think about sending a raw MS Word document to your recipient. There&#8217;s <a href="http://www.usatoday.com/tech/columnist/kimkomando/2006-01-19-hidden-msword-data_x.htm">loads of hidden stuff</a> within those documents that you might forget. If you really must, you can look into some <a href="http://support.microsoft.com/kb/223396">recommendations from Microsoft</a>, and consider tools such as Microsoft&#8217;s <a href="http://www.microsoft.com/downloads/details.aspx?familyid=144e54ed-d43e-42ca-bc7b-5446d34e5360&amp;displaylang=en">free Office add-in</a> for removing hidden data.</p>
<p><strong>Danger lurking in PDF documents</strong></p>
<p>Since my paperless life really revolves around PDF documents, this is the most likely kind of document that I would be sending via email. Unfortunately, PDF documents have even more hidden data within than MS Office documents. Many people have been burned when they tried simple attempts at obscuring parts of a PDF.</p>
<p><strong>A Simple Demonstration</strong></p>
<p>I started with a nice PDF of the Declaration of Independence.</p>
<p><img class="alignnone size-full wp-image-508" src="http://paperjammed.com/wp-content/uploads/2009/04/20090421-independence1.gif" alt="" width="403" height="210" /></p>
<p>Now, supposing that we needed to send this document to a colleague, but we must not reveal the name of the original signer, we might try opening up the PDF in our favorite PDF markup tool and slapping a big fat rectangle over the sensitive information.</p>
<p><img class="alignnone size-full wp-image-509" src="http://paperjammed.com/wp-content/uploads/2009/04/20090421-independence2.gif" alt="" width="448" height="330" /></p>
<p>Now, all is good. But the enemy is crafty and they exploit the huge flaw in our thinking: the information never left the document. All they need to do is copy and paste:</p>
<p><img class="alignnone size-full wp-image-510" src="http://paperjammed.com/wp-content/uploads/2009/04/20090421-independence3.gif" alt="" width="673" height="448" /></p>
<p>A quick copy/paste from the PDF viewer application to Microsoft Word lets the whole world see that John Hancock is to blame! Better let him know we slipped up so he can take appropriate actions.</p>
<p>This sounds trivial, right?</p>
<p>In February, <a href="http://www.techcrunch.com/2009/02/11/the-ap-reveals-details-of-facebookconnectu-settlement-with-best-hack-ever/">the Associated Press was able to uncover the secret details of the Facebook/ConnectU settlement</a> using this same technique.</p>
<p>Apparently, the U.S. military has been <a href="http://www.schneier.com/blog/archives/2005/05/pdf_radacting_f.html">caught in the same trap</a>.</p>
<p>Last year, Google founder Larry Page&#8217;s home address info was <a href="http://hackaday.com/2008/08/01/exposing-poorly-redacted-pdfs/">leaked in a similar fashion</a>.</p>
<p><strong>How about Scanned Documents?</strong></p>
<p>Up to this point I was working with a document that had been printed to PDF, thereby preserving the document text perfectly.</p>
<p>What about a document that we scan in?</p>
<p>Here&#8217;s some honest-to-goodness missile plans&#8230;</p>
<p><img class="alignnone size-full wp-image-511" src="http://paperjammed.com/wp-content/uploads/2009/04/20090421-missile-plans1.gif" alt="" width="547" height="448" /></p>
<p>This is an excerpt from a scanned copy of the U.S. patent for the venerable Sidewinder Missile, complete with a black square that I have added to obscure some special information.</p>
<p><img class="alignnone size-full wp-image-512" src="http://paperjammed.com/wp-content/uploads/2009/04/20090421-missile-plans2.gif" alt="" width="616" height="237" /></p>
<p>As seen here, the copy/paste trick still worked.</p>
<p>But why does it still work? Because the document had OCR run on it in the past.</p>
<p>A brief look at Acrobat&#8217;s document inspector tool shows the hidden secrets:</p>
<p><img class="alignnone size-full wp-image-513" src="http://paperjammed.com/wp-content/uploads/2009/04/20090421-hidden-text.gif" alt="" width="588" height="364" /></p>
<p>All of the red text above is hidden text. The actual hidden text is displayed by itself in the box on the right side of the screen above. It isn&#8217;t very pretty, but it has all of the details.</p>
<p><strong>Proper Redaction</strong></p>
<p>If you are concerned about keeping your secrets secret, do a bit of research into the tools available. You want to be absolutely certain that you don&#8217;t pass along any more information than you intend to.</p>
<p>Adobe Acrobat Professional comes with tools to do just this, and I show their use here:</p>
<p><img class="alignnone size-full wp-image-514" src="http://paperjammed.com/wp-content/uploads/2009/04/20090421-redaction2.gif" alt="20090421-redaction2" width="477" height="175" /></p>
<p>You can see that I have used a redaction tool to select scanned text. Acrobat is selecting the hidden text as well as the bitmap image of the page. Once I apply the redaction, you can see the result below:</p>
<p><img class="alignnone size-full wp-image-515" src="http://paperjammed.com/wp-content/uploads/2009/04/20090421-redaction3.gif" alt="" width="609" height="175" /></p>
<p>Now when my enemy tries the old copy/paste trick, the stuff between <strong>38</strong> and <strong>said means</strong> is totally blank, as intended.</p>
<p><strong>Summary</strong></p>
<p>I covered a very simplistic form of redaction here as well as a very simple way of getting around someone&#8217;s naïve censoring. Don&#8217;t stop here. You should use your PDF editor to search the metadata and hidden text for any terms you don&#8217;t want made public. You may wish to strip all metadata from your documents.</p>
<p>This is a topic that has been covered in depth by many, particularly in the legal field. Here&#8217;s a few articles worth reading on the topic:</p>
<p><a href="http://office.microsoft.com/en-us/help/HA011400341033.aspx">Control metadata in your legal documents</a> (Microsoft)</p>
<p><a href="http://seminars.adobe.acrobat.com/p95867520">Redaction and Metadata Removal eSeminar</a> (<a href="http://blogs.adobe.com/acrolaw/2009/02/acrobat_legal_training_movies.html">Acrobat for Legal Professionals</a>)</p>
<p><a href="http://www.acrobatusers.com/articles/2006/10/redacting_pdf">Redacting PDF files with Acrobat 8</a> (AcrobatUsers.com)</p>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2009/04/21/keeping-your-secrets-to-yourself%e2%80%94what-can-your-shared-documents-tell-others/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Smooth out the bumps in your workflow with desktop scripting tools</title>
		<link>http://paperjammed.com/2009/03/10/smooth-out-the-bumps-in-your-workflow-with-desktop-scripting-tools/</link>
		<comments>http://paperjammed.com/2009/03/10/smooth-out-the-bumps-in-your-workflow-with-desktop-scripting-tools/#comments</comments>
		<pubDate>Wed, 11 Mar 2009 02:06:05 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Scanning]]></category>
		<category><![CDATA[Searching and Indexing]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Tools of the Trade]]></category>
		<category><![CDATA[Workflow]]></category>
		<category><![CDATA[Geeky]]></category>
		<category><![CDATA[Macintosh]]></category>
		<category><![CDATA[Macros]]></category>
		<category><![CDATA[Scripting]]></category>

		<guid isPermaLink="false">http://paperjammed.com/?p=339</guid>
		<description><![CDATA[Work flow is inherent in the kind of work that we do when scanning, indexing, searching, filing, tagging, and backing up all of our documents, photos, music, and video. Once you are committed to digital media, you will find that you often need to cobble together different programs in order to do away with some of the tedious manual labor.]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-340" src="http://paperjammed.com/wp-content/uploads/2009/03/istock_000004776175xsmall.jpg" alt="" width="341" height="226" />I love new software—when I order something new, such as the latest iLife suite, I eagerly check the UPS website to see if the package has been delivered, and I race home to try it out.</p>
<p>Of course, it is only a matter of days, or even hours, before the new glow has worn off and I begin to grouse about the things I wish they had done and I grumble about the nit-picky little bugs that are always there.</p>
<p>Even when the program works perfectly, we often want to use it in a work flow, using each program for what it does best.</p>
<p>Work flow is inherent in the kind of work that we do when scanning, indexing, searching, filing, tagging, and backing up all of our documents, photos, music, and video. Once you are committed to digital media, you will find that you often need to cobble together different programs in order to do away with some of the tedious manual labor.</p>
<p>Scripting tools can be as simple as macro recorders, as fancy as drag-and-drop pipelining tools, and as complex as full programming languages. Choose one that fits you and use it to bridge the gaps in your work flow.<span id="more-339"></span></p>
<p><strong>An Example</strong></p>
<p>I have never had a harsh thought about my sheet-fed scanner, the venerable <a href="http://www.fujitsu.com/us/services/computing/peripherals/scanners/scansnap/s510m.html">ScanSnap S510m</a>. But one thing has always annoyed me about the software package they provide: you can’t start scanning a new document until the OCR process is complete on the first.</p>
<p>What happens is this: you scan in a document; ScanSnap Manager receives it as a PDF and immediately passes it to Abbyy FineReader for OCR. All is good at this point.</p>
<p>But, the OCR process can take anywhere from 10 seconds to a few minutes, and if you were to scan in another document, FineReader complains:</p>
<p><img class="alignnone size-full wp-image-341" src="http://paperjammed.com/wp-content/uploads/2009/03/20090310-finereader.gif" alt="" width="434" height="199" /></p>
<p>This is where a little bit of AppleScript magic smoothed over this minor work flow annoyance.</p>
<p>I found that you can attach a bit of AppleScript as a “folder action” to a folder, and when a file appears in that folder, your AppleScript gets called. All I needed to do was figure out a way to get AppleScript to do the following:</p>
<ul>
<li>Wake up when a file appears</li>
<li>Grab the first unprocessed PDF file</li>
<li>Tell FineReader to start up, if it isn&#8217;t running</li>
<li>Tell FineReader to open the file</li>
<li>Wait for the processing to finish</li>
<li>Go back to step 2 until all files are done.</li>
</ul>
<p>It wasn’t exactly trivial—FineReader doesn’t actually have any special AppleScript support built in. But after a few experiments (i.e. a full weekend of fussing around) it was working smoothly.</p>
<p>Now I can scan as many documents as I want, knowing that OCR is happening at its own pace in a totally different flow.</p>
<p>Some day, when I have cleaned up the embarrassing bits of the code, I will share it here for fellow ScanSnap users.</p>
<p><strong>What scripting tools are out there</strong></p>
<p>The automation tools you use will depend on what software you are working with and what operating system you are using. On the Macintosh side, AppleScript and Automator will probably cover the bases. Things aren’t so pretty on the Windows side, but if you are willing to get a little dirty, Windows scripting can work miracles.</p>
<p>Whatever tool you use, I recommend finding as many simple examples on the Web as you can and running them and tweaking them slightly until you make them do what you want.</p>
<p>I am not an AppleScript programmer; I simply downloaded other people’s sample scripts that did file manipulations and poked and prodded them into shape.</p>
<p><strong>AppleScript</strong></p>
<p>In the Macintosh world, the old standby has always been AppleScript, with its quirky syntax that attempts to look like English. It is surprisingly easy to do some fairly heavy lifting with a minimal of effort.</p>
<p>In fact, though I find its syntax challenging and condescending at the same time, I heartily recommend AppleScript. The key is that Apple has built in AppleScript support to most of the main Macintosh applications.</p>
<p>For a taste of what AppleScript can do, here’s a whole mess of <a href="http://dougscripts.com/itunes/">cool scripts for automating iTunes</a>.</p>
<p>Imagine running a script that builds playlists for you or tags files automatically or finds duplicates and marks them.</p>
<p>This is the main <a href="http://www.apple.com/applescript/">AppleScript site</a>, and here&#8217;s a <a href="http://www.apple.com/applescript/firsttutorial/index.html">tutorial from Apple</a>.</p>
<p><strong>Automator</strong></p>
<p>Another Macintosh product, that provides a more graphical approach to hooking your apps together, is Automator. Here is an article that gives <a href="http://www.macdevcenter.com/pub/a/mac/2005/09/06/what-is-automator.html?CMP=OTC-13IV03560550">a good overview of the tool</a>.</p>
<p>This is what a simple work flow looks like in designer mode:</p>
<p><img class="alignnone size-full wp-image-343" src="http://paperjammed.com/wp-content/uploads/2009/03/20090310-automator.gif" alt="" width="555" height="375" /></p>
<p>You can see more or less what is going on here: You use a predefined set of actions, on the left side, to build up a pipeline on the right side.</p>
<p>In this case, the new automation action will get the selected documents, sort them, combine them into one PDF document, and then open that document in a PDF viewer.</p>
<p>I have always been a little confused with the utility of this product: it’s great if your application provides Automator actions, but what do you do if your app is like FineReader, with no special hooks built in?</p>
<p>Nevertheless, with Automator, you can do some pretty fancy things with many common tasks such as file management, email, and browsing. And there are plenty of custom Automator actions out there for the downloading.</p>
<p>Both Automator and AppleScript are Macintosh OS X built-ins.</p>
<p>For the serious Mac user, there is Bash Unix shell scripting, but that’s all I’m going to say about that. It’s there if you want it!</p>
<p><strong>Windows Scripting</strong></p>
<p>Things are somewhat different on the Windows side.</p>
<p>On the positive side, I believe that Microsoft’s <a href="http://www.microsoft.com/com/default.mspx">COM object architecture</a> makes it possible to script some pretty fancy things using off-the-shelf software.</p>
<p>In addition, the Office suite has amazing macro facilities that have been used by many as a full programming language, as the core of many successful commercial products.</p>
<p>But Windows just doesn’t seem to have anything analogous to AppleScript for gluing things together without getting your hands too dirty.</p>
<p>As a trivial example of the power of the Windows COM architecture, this <a href="http://cwashington.netreach.net/depo/view.asp?Index=142">short script</a> will launch Excel, create a fresh spreadsheet, and automatically feed in some data. To try it out, save the script to your desktop and double click it.</p>
<p>This is what I saw on my machine when I ran the script:</p>
<p><img class="alignnone size-full wp-image-344" src="http://paperjammed.com/wp-content/uploads/2009/03/20090310-excel-spreadsheet.gif" alt="" width="520" height="226" /><br />
Just like AppleScript on the Macintosh side, many Windows applications provide hooks for scripting.</p>
<p>The down side is that the scripting language is much more like a programming language, so it’s more comparable to writing shell scripts than to using a drag-and-drop automation tool like Automator.</p>
<p>But when you do the same thing over and over every day, it may be worth it to roll up your sleeves and get down and dirty with some VBScript.</p>
<p>An added benefit of this approach is that the language shares much of its functionality with Visual Basic for Applications, found in Word and Excel macros. If you are writing standalone Windows scripts, you can also write very complex Word and Excel macros. You also have the ability to do much of the same application automation within Word/Excel macros.</p>
<p><strong>AutoIT</strong></p>
<p>This is a freeware tool that many use for scripting their applications.</p>
<p>From the <a href="http://www.autoitscript.com/autoit3/">product website</a>:</p>
<ul>
<li>Easy to learn BASIC-like syntax</li>
<li>Simulate keystrokes and mouse movements</li>
<li>Manipulate windows and processes</li>
<li>Interact with all standard windows controls</li>
<li>Scripts can be compiled into standalone executables</li>
<li>Create Graphical User Interfaces (GUIs)</li>
</ul>
<p>What more could you ask for?</p>
<p><strong>Closing Thoughts</strong></p>
<p>There are other tools out there; don’t be afraid to check them out and try some of the examples. If you can automate just one annoying task that you do every day, it may be worth the hassle of spending a weekend writing the script.</p>
<p><strong>Epilogue</strong></p>
<p>Yesterday at work I finally rolled up my sleeves at work to tackle a nagging document problem that had been lingering for months.</p>
<p>We have a large quantity of PowerPoint slide decks that have critical company information embedded within. I work in the pharmaceutical industry, so many of those PowerPoint slides had chemical structures for various drug molecules embedded within as OLE objects.</p>
<p>The problem was that we wanted to extract that knowledge (the molecule files) electronically and text mine the slides, but our tools did not support PowerPoint. They did, however, support Microsoft Word.</p>
<p>How could I convert the slides to Word documents? There are dozens of ways on the Net to go in the other direction, but the only way to do what I wanted resulted in the OLE objects being converted to pictures—no good for my purpose. I did find that a copy/paste of each molecule from PowerPoint to Word worked correctly, but who wants to do that by hand with hundreds of documents?</p>
<p>After weeks of trying to find a real Microsoft developer at work to do the task, I finally gave in and tried my hand using VBScript and the Windows Scripting Host. In four hours I had the basics of a script going.</p>
<p>My simple VBScript file does the following:</p>
<ul>
<li>Prompt the user for a PowerPoint file and target directory</li>
<li>Launch PowerPoint and Word in the background</li>
<li>Loop through every slide, inspecting any OLE objects found within to see if it was created by ChemDraw or IsisBase Sketch, two common chemistry editor tools</li>
<li>If any such molecules are found, the following happens:
<ul>
<li>All of the &#8220;shapes&#8221; on the slide are copied to the clipboard</li>
<li>A new Word document is created with &#8220;Slide (1..n)&#8221; appended to the file name</li>
<li>The clipboard contents are pasted into the document</li>
</ul>
</li>
</ul>
<p>Sure, there are plenty of warts to work out, but I was happy to be able to run my script and watch as it magically consumed huge slide decks and spit out Word documents containing the exact data I was looking for. Windows Scripting Host is a powerful tool, as good as, if not better than AppleScript.</p>
<p>[Update: Added epilogue singing the joys of Windows Scripting Host]</p>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2009/03/10/smooth-out-the-bumps-in-your-workflow-with-desktop-scripting-tools/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Organize Your Digital Life: How to Store Your Photographs, Music, Videos, and Personal Documents in a Digital World</title>
		<link>http://paperjammed.com/2009/03/08/organize-your-digital-life-how-to-store-your-photographs-music-videos-and-personal-documents-in-a-digital-world/</link>
		<comments>http://paperjammed.com/2009/03/08/organize-your-digital-life-how-to-store-your-photographs-music-videos-and-personal-documents-in-a-digital-world/#comments</comments>
		<pubDate>Mon, 09 Mar 2009 00:18:39 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Green Living]]></category>
		<category><![CDATA[Paperless Life]]></category>
		<category><![CDATA[Scanning]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Books]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[Organization]]></category>
		<category><![CDATA[Reviews]]></category>

		<guid isPermaLink="false">http://paperjammed.com/?p=331</guid>
		<description><![CDATA[Organize Your Digital Life: How to Store Your Photographs, Music, Videos, and Personal Documents in a Digital World by Aimee Baldridge
If you are looking for step-by-step checklists and good solid advice about putting your digital life in order, then this book is for you.
The author covers a very broad swath of digital media, discussing topics such [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-332" title="31i3ducaavl_sl500_aa180_" src="http://paperjammed.com/wp-content/uploads/2009/03/31i3ducaavl_sl500_aa180_.jpg" alt="31i3ducaavl_sl500_aa180_" width="180" height="180" />Organize Your Digital Life: How to Store Your Photographs, Music, Videos, and Personal Documents in a Digital World by Aimee Baldridge</p>
<p>If you are looking for step-by-step checklists and good solid advice about putting your digital life in order, then this book is for you.</p>
<p>The author covers a very broad swath of digital media, discussing topics such as video, music, and photography. She provides clear guidelines for prioritizing your efforts and crisp checklists.</p>
<p>In addition, this up-to-date book (January 2009) provides a good source of information about the leading software products available, with advice on features to look for. For example, while discussing photo management software, she mentioned the advantage of being able to select a large group of photographs and batch-rename them to something like &#8220;Egypt 001.jpg&#8221; through &#8220;Egypt 100.jpg&#8221;</p>
<p>The book gets geeky enough to whet the palate, without going too far. Topics such as EXIF data and nitty gritty settings for scanning photos are covered in depth.</p>
<p><a href="http://www.amazon.com/Organize-Your-Digital-Life-Photographs/dp/1426203349/ref=pd_bbs_sr_1?ie=UTF8&amp;s=books&amp;qid=1236558082&amp;sr=8-1">Here&#8217;s the book on Amazon</a></p>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2009/03/08/organize-your-digital-life-how-to-store-your-photographs-music-videos-and-personal-documents-in-a-digital-world/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What should you have in your toolbox?</title>
		<link>http://paperjammed.com/2009/02/22/what-should-you-have-in-your-toolbox/</link>
		<comments>http://paperjammed.com/2009/02/22/what-should-you-have-in-your-toolbox/#comments</comments>
		<pubDate>Sun, 22 Feb 2009 20:50:38 +0000</pubDate>
		<dc:creator>Tad</dc:creator>
				<category><![CDATA[Backups]]></category>
		<category><![CDATA[Scanning]]></category>
		<category><![CDATA[Searching and Indexing]]></category>
		<category><![CDATA[Security]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Tools of the Trade]]></category>

		<guid isPermaLink="false">http://paperjammed.com/?p=231</guid>
		<description><![CDATA[ 
A list of several useful hardware and software tools with which to arm yourself before you attack the file cabinet.
When I first became interested in woodworking, I checked out several books on the subject from the library. Invariably, within the first two or three chapters, there was an illustrated list of desirable hand tools for [...]]]></description>
			<content:encoded><![CDATA[<p> </p>
<p><img class="alignright size-full wp-image-242" src="http://paperjammed.com/wp-content/uploads/2009/02/istock_000008018163xsmall-sm.gif" alt="istock_000008018163xsmall-sm" width="250" height="165" />A list of several useful hardware and software tools with which to arm yourself before you attack the file cabinet.</p>
<p>When I first became interested in woodworking, I checked out several books on the subject from the library. Invariably, within the first two or three chapters, there was an illustrated list of desirable hand tools for a woodworker’s workbench.</p>
<p>No one ever actually goes out and buys everything shown on those pages; they start with the essentials and build up a collection over time. And so it is with this list I give you of useful tools of the trade.<span id="more-231"></span></p>
<p><strong>Staple Remover</strong></p>
<p><img class="alignright size-full wp-image-246" title="20090222-staple-remover" src="http://paperjammed.com/wp-content/uploads/2009/02/20090222-staple-remover.jpg" alt="20090222-staple-remover" width="150" height="150" />Such a simple little device, but so important. My wife staples everything together, then she hands me a stack of old bills to be scanned in, complete with staples.</p>
<p>Don’t bother with those ones that have jaws. Get one of the stick kind, like this one.</p>
<p><strong>Paper Cutter</strong></p>
<p>I wouldn’t buy one unless I had lots of manuals and such to scan, but this is a device that can really speed things up. Just remove the staples from the spine and then start chopping through the centers of the pages, in small stacks.</p>
<p>If you have too much work for a paper cutter, you can always take the manuals/magazines to a professional office center and <a href="http://paperjammed.com/2009/02/12/the-guillotine-and-the-saw/">have them do the job</a>.</p>
<p><strong>Paper shredder</strong></p>
<p>It’s a sad commentary on modern times that most people these days have a paper shredder in the house. Yours should be strong enough to do the job without overheating and should chop the paper into proper tiny bits.</p>
<p><strong>Flatbed Scanner</strong></p>
<p>This is where everyone starts. Nobody goes out and buys a sheet-fed scanner as their first scanner; the price is often prohibitive, and the devices are very task-specific.</p>
<p>Even if you have invested in a sheet-fed scanner, it is good to keep a flatbed scanner around to scan in things such as hard book covers or pages that you don’t want to remove from a book.</p>
<p><strong>Sheet-fed Scanner</strong></p>
<p><img class="alignright size-full wp-image-248" title="20090222-scansnap" src="http://paperjammed.com/wp-content/uploads/2009/02/20090222-scansnap.jpg" alt="20090222-scansnap" width="150" height="150" />The <a href="http://paperjammed.com/2009/02/01/tools-of-the-trade-your-scanner/">most important part of a serious document scanning exercise</a>.</p>
<p>The ability to scan in twenty or thirty pages in a minute, both sides, makes this expensive device stand head and shoulders above all flatbed scanners.</p>
<p>I use the <a href="http://www.fujitsu.com/us/services/computing/peripherals/scanners/scansnap/s510m.html">Fujitsu ScanSnap S510M</a>.</p>
<p>You might want to check out Terry White&#8217;s Tech Blog&#8217;s comparison of the <a href="http://terrywhite.com/techblog/?p=1121">NeatReceipts, ScanSnap S300, and ScanSnap S510M</a>. He did a great video showing the three in operation.</p>
<p><strong>OCR software</strong></p>
<p>Without OCR software, your documents <a href="http://paperjammed.com/2009/02/05/its-all-about-searching/">are not searchable</a>. Fortunately, most scanners come bundled with some kind of OCR software. Mine came with both <a href="http://finereader.abbyy.com/">Abbyy FineReader</a> as well as <a href="http://www.adobe.com/products/acrobatpro/">Adobe Acrobat Professional 8</a>.</p>
<p><strong>External backup drive</strong></p>
<p>So important. Your data <a href="http://paperjammed.com/2009/02/19/if-there-arent-two-copies-in-separate-places-it-isnt-a-backup/">has to be in two places</a>.</p>
<p><strong>Backup software</strong></p>
<p>Sure, you can manually copy files from your desktop to the backup drive, and I did this for a long time. It’s better to automate this process.</p>
<p>Backup software helps prevent accidental erasure of either the source or the target data, and it helps you perform the backups regularly and painlessly.</p>
<p>I personally use rsync scripts to do the job, though this may be a bit too geeky for some. Many external hard drives come bundled with quality utilities. You can even consider using online backups such as Mozy or Carbonite.</p>
<p><strong>PDF editing software</strong></p>
<p>A definite plus once you begin scanning larger documents. You want to be able to merge documents, rotate pages, and move pages around with ease.</p>
<p>For example, I just finished scanning in a tri-fold owner’s manual that I cut on the folds. The first sheet had page 1 and 6 on it, the middle sheet had page 2 and 5 on it, and the last sheet had page 3 and 4 on it.</p>
<p>When the document went through the scanner, I had a PDF with 1,6,2,5,3,4 page order. It was a breeze to drag the pages into the correct order.</p>
<p>I use the <a href="http://www.macworld.com/article/132468/2008/04/workingmac2504.html">Preview app from Mac OS X</a> for this. Adobe Acrobat does the job quite nicely as well.</p>
<p><strong>Document Management software</strong></p>
<p>This is the repository where you keep your documents. I have mixed feelings about document management software.</p>
<p>On the one hand, these products are very good at what they do and they certainly make it easier to organize documents.</p>
<p>On the other hand, as soon as you commit yourself to one document management tool, it is difficult to migrate your collection to a new tool, should you need to switch in the future.</p>
<p>That said, I use a tool called <a href="http://www.ironicsoftware.com/yep/index.html">Yep</a> for this purpose. One of the other major players out there is <a href="http://www.devon-technologies.com/products/devonthink/">DEVONthink</a>.</p>
<p><strong>Keyboard macro software</strong></p>
<p>I find myself doing certain repetitive tasks from time to time that require multiple clicks or keystrokes. For example, when I am reviewing a scanned document, I scroll through the document looking for pages that were accidentally rotated 90 or 180 degrees by the scanning software.</p>
<p>The tool I use for previewing and editing (the Preview app from Mac OS X) supports rotating single pages, but it asks you each time “Do you want to rotate the current page only or the entire document?”</p>
<p>After tiring of clicking through that question every time, I used my macro software to write three keyboard macros, and I assigned them to ⌘-Left, ⌘-Right, and ⌘-Up, for Rotate Left, Rotate Right, and Rotate 180. Now I can quickly browse through a document, flipping and rotating any pages that need it.</p>
<p>I use <a href="http://www.keyboardmaestro.com/main/">Keyboard Maestro</a> for this job.</p>
<p><strong>File management utilities</strong></p>
<p>As soon as you start juggling hundreds of files around and renaming them and moving them into different folders, you find yourself performing certain repetitive tasks, that can often be made easier by software.</p>
<p>I use a tool called <a href="http://www.publicspace.net/ABetterFinderRename/">A Better File Renamer</a> for the Mac that allows me to rename large numbers of files, following dozens of customizable rules. This tool allows you to package renaming scripts into a “droplet” on the desktop, where you can drop files.</p>
<p>One of my droplets adds the file creation date to the filename as a prefix. Another droplet converts filenames with embedded dashes and underscores into “title case” names with spaces.</p>
<p><img class="alignright size-full wp-image-237" title="20090222-bmfm-icon" src="http://paperjammed.com/wp-content/uploads/2009/02/20090222-bmfm-icon.jpg" alt="20090222-bmfm-icon" width="100" height="100" />I recently bought a folder management utility called <a href="http://www.publicspace.net/BigMeanFolderMachine/">The Big Mean Folder Machine</a> that allows easy creation of folder hierarchies, such as by automatically splitting files into groups of 100. I haven&#8217;t used it much yet, but I sure love the icon!</p>
<p><strong>Encryption utility</strong></p>
<p>Gotta have it if you are putting your intimate personal information into digital form. Anything with your SSN or your credit card numbers should be protected. Remember, even if it’s on your home desktop machine, a burglar would take the whole machine, and happily sift through your personal data looking for info that could be stolen.</p>
<p>For some things I use encrypted volumes on Mac OS X; for others, I use <a href="http://paperjammed.com/2009/02/17/what-would-happen-if-that-thumb-drive-slipped-out-of-your-pocket/">TrueCrypt</a>.</p>
<p><strong>Assortment of Thumb Drives</strong></p>
<p><img class="alignright size-full wp-image-240" title="20090222-619088_usb_drive-sm" src="http://paperjammed.com/wp-content/uploads/2009/02/20090222-619088_usb_drive-sm.gif" alt="20090222-619088_usb_drive-sm" width="200" height="133" />What better way to move your digital documents around? They don’t even need to be that large. Cheap throwaway thumb drives these days have the capacity to hold a lifetime of digital documents. But make sure you protect sensitive data!</p>
<p>[Update: Added a link to Terry White's scanner review]</p>
]]></content:encoded>
			<wfw:commentRss>http://paperjammed.com/2009/02/22/what-should-you-have-in-your-toolbox/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
	</channel>
</rss>
