<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Creating ebooks from book scans &#8230;. on Linux</title>
	<atom:link href="http://www.danielstender.com/granthinam/564/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.danielstender.com/granthinam/564/</link>
	<description>Daniel Stender&#039;s blog on Sanskrit philology and associated issues (e-philology, textual criticism, TeX, Open source, etc.)</description>
	<lastBuildDate>Wed, 28 Jul 2010 11:42:51 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Daniel Stender</title>
		<link>http://www.danielstender.com/granthinam/564/comment-page-1/#comment-1236</link>
		<dc:creator>Daniel Stender</dc:creator>
		<pubDate>Mon, 21 Jun 2010 09:25:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.danielstender.com/granthinam/?p=564#comment-1236</guid>
		<description>Update:
(1) I really haven&#039;t thought about kicking off such a thing with this but the American &lt;em&gt;Linux Magazine&lt;/em&gt; has bought the article from German &lt;em&gt;Linux User&lt;/em&gt; which has emerged from this contrib (issue 06/2010), and it&#039;s going to appear in no. 117 in August.
(2) In the meanwhile Abbyy published a Linux command line version of their popular &lt;em&gt;FineReader&lt;/em&gt; software (see &lt;a href=&quot;http://translate.google.de/translate?js=y&amp;prev=_t&amp;hl=de&amp;ie=UTF-8&amp;layout=1&amp;eotf=1&amp;u=http%3A%2F%2Fwww.pro-linux.de%2Fnews%2F1%2F15418%2Fabbyy-veroeffentlicht-command-line-interface-ocr-fuer-linux.html&amp;sl=de&amp;tl=en&quot; rel=&quot;nofollow&quot;&gt;here&lt;/a&gt;), which seems to work really fine (see the review in the recent &lt;em&gt;Linux-Magazin&lt;/em&gt;, 07/2010, 68-71). The reader spits out also hOCR but also finished PDFs.
(3) There is a comparison of the free OCR engines in the &lt;em&gt;Linux-Magazin&lt;/em&gt; 12/2006 (see &lt;a href=&quot;http://translate.google.de/translate?js=y&amp;prev=_t&amp;hl=de&amp;ie=UTF-8&amp;layout=1&amp;eotf=1&amp;u=http%3A%2F%2Fwww.linux-magazin.de%2Flayout%2Fset%2Fprint%2Fcontent%2Fview%2Ffull%2F487&amp;sl=de&amp;tl=en&quot; rel=&quot;nofollow&quot;&gt;here&lt;/a&gt;, might be a little bit outdated).
(4) Jakub Wilk also implemented a little GUI editor for DjVu metadata, &lt;a href=&quot;http://jwilk.net/software/djvusmooth&quot; rel=&quot;nofollow&quot;&gt;Djvusmooth&lt;/a&gt;, which is available as I can see so far only for Debian Testing (that&#039;s anyway the choice if you want to run the latest stuff), see &lt;a href=&quot;http://www.telemedia.ch/cgi-bin/whohas.wh?dist=alldists&amp;q=djvusmooth&amp;go=Search!&quot; rel=&quot;nofollow&quot;&gt;here&lt;/a&gt;.</description>
		<content:encoded><![CDATA[<p>Update:<br />
(1) I really haven&#8217;t thought about kicking off such a thing with this but the American <em>Linux Magazine</em> has bought the article from German <em>Linux User</em> which has emerged from this contrib (issue 06/2010), and it&#8217;s going to appear in no. 117 in August.<br />
(2) In the meanwhile Abbyy published a Linux command line version of their popular <em>FineReader</em> software (see <a href="http://translate.google.de/translate?js=y&#038;prev=_t&#038;hl=de&#038;ie=UTF-8&#038;layout=1&#038;eotf=1&#038;u=http%3A%2F%2Fwww.pro-linux.de%2Fnews%2F1%2F15418%2Fabbyy-veroeffentlicht-command-line-interface-ocr-fuer-linux.html&#038;sl=de&#038;tl=en" rel="nofollow">here</a>), which seems to work really fine (see the review in the recent <em>Linux-Magazin</em>, 07/2010, 68-71). The reader spits out also hOCR but also finished PDFs.<br />
(3) There is a comparison of the free OCR engines in the <em>Linux-Magazin</em> 12/2006 (see <a href="http://translate.google.de/translate?js=y&#038;prev=_t&#038;hl=de&#038;ie=UTF-8&#038;layout=1&#038;eotf=1&#038;u=http%3A%2F%2Fwww.linux-magazin.de%2Flayout%2Fset%2Fprint%2Fcontent%2Fview%2Ffull%2F487&#038;sl=de&#038;tl=en" rel="nofollow">here</a>, might be a little bit outdated).<br />
(4) Jakub Wilk also implemented a little GUI editor for DjVu metadata, <a href="http://jwilk.net/software/djvusmooth" rel="nofollow">Djvusmooth</a>, which is available as I can see so far only for Debian Testing (that&#8217;s anyway the choice if you want to run the latest stuff), see <a href="http://www.telemedia.ch/cgi-bin/whohas.wh?dist=alldists&#038;q=djvusmooth&#038;go=Search!" rel="nofollow">here</a>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Daniel Stender</title>
		<link>http://www.danielstender.com/granthinam/564/comment-page-1/#comment-1053</link>
		<dc:creator>Daniel Stender</dc:creator>
		<pubDate>Tue, 27 Apr 2010 08:07:46 +0000</pubDate>
		<guid isPermaLink="false">http://www.danielstender.com/granthinam/?p=564#comment-1053</guid>
		<description>Thanks Edgard. Indeed everything here is under heavy development (actually Debian [Testing] is the best distrib for running all the newest stuff). I&#039;ve been in contact with Jakub Wilk and he told me grayscale processing will be also available soon (with 0.4.2, option &lt;em&gt;-render&lt;/em&gt;). BTW for proper single word highlighting &lt;em&gt;hocr2djvused&lt;/em&gt; needs data with has been produced with &lt;em&gt;ocroscript --charboxes&lt;/em&gt;!</description>
		<content:encoded><![CDATA[<p>Thanks Edgard. Indeed everything here is under heavy development (actually Debian [Testing] is the best distrib for running all the newest stuff). I&#8217;ve been in contact with Jakub Wilk and he told me grayscale processing will be also available soon (with 0.4.2, option <em>-render</em>). BTW for proper single word highlighting <em>hocr2djvused</em> needs data with has been produced with <em>ocroscript &#8211;charboxes</em>!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Edgard Bikelis</title>
		<link>http://www.danielstender.com/granthinam/564/comment-page-1/#comment-1051</link>
		<dc:creator>Edgard Bikelis</dc:creator>
		<pubDate>Tue, 27 Apr 2010 03:51:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.danielstender.com/granthinam/?p=564#comment-1051</guid>
		<description>Excellent post, Daniel. The Ocrodjvu is really a good finding, as the last time I&#039;ve checked, we could just add OCR to each line, not to each word. Now, being able to search all my .djvu files will be neat indeed!</description>
		<content:encoded><![CDATA[<p>Excellent post, Daniel. The Ocrodjvu is really a good finding, as the last time I&#8217;ve checked, we could just add OCR to each line, not to each word. Now, being able to search all my .djvu files will be neat indeed!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Daniel Stender</title>
		<link>http://www.danielstender.com/granthinam/564/comment-page-1/#comment-559</link>
		<dc:creator>Daniel Stender</dc:creator>
		<pubDate>Tue, 02 Mar 2010 15:37:06 +0000</pubDate>
		<guid isPermaLink="false">http://www.danielstender.com/granthinam/?p=564#comment-559</guid>
		<description>Latest Gscan2pdf 0.9.30 employs a port to OCRopus, cf. http://lwn.net/Articles/372405/</description>
		<content:encoded><![CDATA[<p>Latest Gscan2pdf 0.9.30 employs a port to OCRopus, cf. <a href="http://lwn.net/Articles/372405/" rel="nofollow">http://lwn.net/Articles/372405/</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Harunaga Isaacson</title>
		<link>http://www.danielstender.com/granthinam/564/comment-page-1/#comment-511</link>
		<dc:creator>Harunaga Isaacson</dc:creator>
		<pubDate>Mon, 08 Feb 2010 11:50:48 +0000</pubDate>
		<guid isPermaLink="false">http://www.danielstender.com/granthinam/?p=564#comment-511</guid>
		<description>Thanks for the post. gscan2pdf is indeed very nice; I used to use it a lot. Have you tried Scan Tailor (http://scantailor.sourceforge.net/ )? I have found this useful too.</description>
		<content:encoded><![CDATA[<p>Thanks for the post. gscan2pdf is indeed very nice; I used to use it a lot. Have you tried Scan Tailor (<a href="http://scantailor.sourceforge.net/" rel="nofollow">http://scantailor.sourceforge.net/</a> )? I have found this useful too.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk (enhanced) (user agent is rejected)
Database Caching 3/13 queries in 0.016 seconds using disk

Served from: www.danielstender.com @ 2010-09-08 16:48:55 -->