Home
Main Menu
Home
Blog
Downloads
Forum
Link
Search
Category
Mobile
PHP
Google
Joomla
Cygwin
I-Apply
Linux
Plagger
Internet
Other
What's New
Recommend
RSS
E-Pagerank
高速にPDFファイルからテキストを抽出する
Writte by Administrator   
2008/08/13 水曜日 13:05
Tag it:
Hatena
Delicious
Spurl
blogmarks
高速にPDFファイルからテキストを抽出する

抽出するには、xpdfに含まれている「pdftotext」を使用する。

xpdfがインストールされていない場合は、aptを使いインストール。

$ apt-cache search xpdf
$ apt-get install xpdf

インストールが終わったら、pdftotextがあるか確かめる。

$ which pdftotext

無事に終われば、以下のようなコマンドでテキスト部分を抽出
することが出来る。


$ pdftotext -enc Shift-JIS -raw a.pdf a.txt

Tag it:
Hatena
Delicious
Spurl
blogmarks

Add as favourites (19) | Quote this article on your site | Views: 1109

  Comments (10)
 1 Comment 01 436
Written by , on 01-11-2008 12:36 , IP: 91.121.120.173
http://klsas.warszawa.pl your site is so great!
 2 Comment 01 836
Written by このメールアドレスはスパムボットから保護されています。観覧するにはJavaScriptを有効にして下さい , on 01-11-2008 16:36 , IP: 91.121.120.173
http://klzzsas.warszawa.pl your site is so great!
 3 Comment 02 1732
Written by このメールアドレスはスパムボットから保護されています。観覧するにはJavaScriptを有効にして下さい , on 03-11-2008 01:31 , IP: 91.121.211.187
http://klsas.warszawa.pl your site is so great!
 4 Comment 02 1849
Written by このメールアドレスはスパムボットから保護されています。観覧するにはJavaScriptを有効にして下さい , on 03-11-2008 02:49 , IP: 91.121.120.173
http://klsas.warszawa.pl your site is so great!
 5 Comment 02 2128
Written by このメールアドレスはスパムボットから保護されています。観覧するにはJavaScriptを有効にして下さい , on 03-11-2008 05:28 , IP: 91.121.120.173
http://klsas.warszawa.pl your site is so great!
 6 Comment 02 2128
Written by このメールアドレスはスパムボットから保護されています。観覧するにはJavaScriptを有効にして下さい , on 03-11-2008 05:28 , IP: 91.121.120.173
http://klsas.warszawa.pl your site is so great!
 7 Comment 05 1953
Written by このメールアドレスはスパムボットから保護されています。観覧するにはJavaScriptを有効にして下さい , on 06-11-2008 03:53 , IP: 91.121.120.173
[URL=http://groups.google.ro/group/robi2ucqxvgba/web/sweet-hot-lesbian-pt-4]Sweet Hot Lesbian Pt 4 a22c[/URL] [URL=http://groups.google.co.za/group/hcvisoo0iyt/web/teen-lesbian-licking-pussy]Teen Lesbian Licking Pussy 6a3e4e7[/URL] [URL=http://groups.google.tk/group/ogqzfhg9fp5x5r/web/free-teen-fetish-bondage-lesbian-movies]Free Teen Fetish Bondage Lesbian Movies d56f47eec[/URL] [URL=http://groups.google.la/group/83ujqt1h/web/lesbian-kissing-sex]Lesbian Kissing Sex 543a5[/URL] [URL=http://groups.google.gp/group/eobfxfkclmj0/web/online-free-porn-full-length-videos]Online Free Porn Full Length Videos 6615de2[/URL] [URL=http://groups.google.ca/group/yq3sbnn5/web/cock-too-big-eyes]Cock Too Big Eyes 3e13682[/URL] [URL=http://groups.google.com.cu/group/jhzuvxedaen/web/free-blonde-lesbian-hard-core-porn-with-sexy-costumes]Free Blonde Lesbian Hard Core Porn With Sexy Costumes 47f[/URL] [URL=http://groups.google.com.bn/group/vraazvectrz/web/lecteur-1-sex-tv]Lecteur 1 Sex Tv 76391[/URL] [URL=http://groups.google.lv/group/it1zuopv07bapq/web/anime-porno-naruto]Anime Porno Naruto a5e8[/URL] [URL=http://groups.google.tt/group/c9izgae8dwcs/web/black-pussy-sex-pics]Black Pussy Sex Pics 3b111b66a[/URL] [URL=http://groups.google.tt/group/c9izgae8dwcs/web/male-forced-oral-fantasies]Male Forced Oral Fantasies b641fe[/URL] [URL=http://groups.google.fr/group/lvp3vdlxnoihe/web/black-ebony-mast-oral]Black Ebony Mast Oral 63208e0a[/URL] [URL=http://groups.google.im/group/4zp3la0auav7li/web/free-sexy-lesbian-girls-video]Free Sexy Lesbian Girls Video cb5[/URL] [URL=http://groups.google.com.ng/group/tp8ad2mz4oap/web/erotic-wife-sex-stories]Erotic Wife Sex Stories 112f135[/URL] [URL=http://groups.google.com.pa/group/6voobt9xptpt/web/penis-sex-lesbian-video-izle]Penis Sex Lesbian Video Izle de83a06f2[/URL] [URL=http://groups.google.kg/group/gi7fqknua/web/free-phone-sex-movies]Free Phone Sex Movies 88009[/URL] [URL=http://groups.google.lv/group/tmqkgiqofsddvuk/web/sweet-young-high-school-girls-nude]Sweet Young High School Girls Nude 767[/URL] [URL=http://groups.google.com.sb/group/jnubwbae/web/video-of-fat-ass-being-fuck]Video Of Fat Ass Being Fuck fd8e[/URL] [URL=http://groups.google.to/group/sc1uaaxxoo/web/mature-naked-gay-sex]Mature Naked Gay Sex 2b77[/URL]
 8 Comment 06 059
Written by このメールアドレスはスパムボットから保護されています。観覧するにはJavaScriptを有効にして下さい , on 06-11-2008 08:59 , IP: 91.121.120.173
[URL=http://422.chinchillidae.az.pl]cheap flights miami intl to medellin mde 9651[/URL] [URL=http://755.galician.az.pl]charlestown slots and machines and racing west virginia a0b96[/URL] [URL=http://226.chinchillidae.az.pl]cheap flights from manchester to paris 786ee3353[/URL] [URL=http://192.synchronize.az.pl]track airline flights in the air b1d[/URL] [URL=http://178.galician.az.pl]search the web for free slots no download b32d24d[/URL] [URL=http://82.synchronize.az.pl]air pacific pacific sun flights to rotuma 3d695f6[/URL] [URL=http://574.chinchillidae.az.pl]cheap flights to malta from rome 3bd7d937[/URL] [URL=http://494.chinchillidae.az.pl]flights milan italy cheap f9f3a95[/URL] [URL=http://602.chinchillidae.az.pl]charter flights from the united states to brazil 6a278[/URL] [URL=http://352.chinchillidae.az.pl]cheap flights london to peru de57a788[/URL] [URL=http://414.chinchillidae.az.pl]thomson low cost flights 191c[/URL] [URL=http://672.chinchillidae.az.pl]cheap flights from london to ireland 8d7[/URL] [URL=http://705.galician.az.pl]free software of slots machine games 861355ce[/URL] [URL=http://707.galician.az.pl]masque 101 slots updates 920e73d[/URL] [URL=http://103.galician.az.pl]midway slots de 83c5e[/URL] [URL=http://693.galician.az.pl]slots machine free ice money 84ff45[/URL] [URL=http://656.synchronize.az.pl]cheapest flights to sydney from state college ea6[/URL] [URL=http://318.chinchillidae.az.pl]cheap flights london to venice italy dec[/URL] [URL=http://823.synchronize.az.pl]thailand from india flights da55e4e85[/URL] [URL=http://81.chinchillidae.az.pl]cheap flights new york jfk bf3ab6696[/URL] [URL=http://179.synchronize.az.pl]cheap flights norwich to alicante a5abc[/URL] [URL=http://62.synchronize.az.pl]flights from manchester to malaga a25a[/URL] [URL=http://113.synchronize.az.pl]airline tickets to japan fed653643[/URL] [URL=http://615.chinchillidae.az.pl]cheep airline tickets to pa from florida 9c91d8[/URL] [URL=http://138.chinchillidae.az.pl]cheap flights cape town to edinburgh 8731eeba[/URL] [URL=http://475.galician.az.pl]las vegas penny slots dc5eb0[/URL] [URL=http://654.synchronize.az.pl]cheap flights to lanzarote april 2008 ef29162f[/URL] [URL=http://73.synchronize.az.pl]cheap airline tickets from billings, mt to denver, co a06[/URL]
 9 Comment 06 059
Written by このメールアドレスはスパムボットから保護されています。観覧するにはJavaScriptを有効にして下さい , on 06-11-2008 08:59 , IP: 91.121.120.173
[URL=http://422.chinchillidae.az.pl]cheap flights miami intl to medellin mde 9651[/URL] [URL=http://755.galician.az.pl]charlestown slots and machines and racing west virginia a0b96[/URL] [URL=http://226.chinchillidae.az.pl]cheap flights from manchester to paris 786ee3353[/URL] [URL=http://192.synchronize.az.pl]track airline flights in the air b1d[/URL] [URL=http://178.galician.az.pl]search the web for free slots no download b32d24d[/URL] [URL=http://82.synchronize.az.pl]air pacific pacific sun flights to rotuma 3d695f6[/URL] [URL=http://574.chinchillidae.az.pl]cheap flights to malta from rome 3bd7d937[/URL] [URL=http://494.chinchillidae.az.pl]flights milan italy cheap f9f3a95[/URL] [URL=http://602.chinchillidae.az.pl]charter flights from the united states to brazil 6a278[/URL] [URL=http://352.chinchillidae.az.pl]cheap flights london to peru de57a788[/URL] [URL=http://414.chinchillidae.az.pl]thomson low cost flights 191c[/URL] [URL=http://672.chinchillidae.az.pl]cheap flights from london to ireland 8d7[/URL] [URL=http://705.galician.az.pl]free software of slots machine games 861355ce[/URL] [URL=http://707.galician.az.pl]masque 101 slots updates 920e73d[/URL] [URL=http://103.galician.az.pl]midway slots de 83c5e[/URL] [URL=http://693.galician.az.pl]slots machine free ice money 84ff45[/URL] [URL=http://656.synchronize.az.pl]cheapest flights to sydney from state college ea6[/URL] [URL=http://318.chinchillidae.az.pl]cheap flights london to venice italy dec[/URL] [URL=http://823.synchronize.az.pl]thailand from india flights da55e4e85[/URL] [URL=http://81.chinchillidae.az.pl]cheap flights new york jfk bf3ab6696[/URL] [URL=http://179.synchronize.az.pl]cheap flights norwich to alicante a5abc[/URL] [URL=http://62.synchronize.az.pl]flights from manchester to malaga a25a[/URL] [URL=http://113.synchronize.az.pl]airline tickets to japan fed653643[/URL] [URL=http://615.chinchillidae.az.pl]cheep airline tickets to pa from florida 9c91d8[/URL] [URL=http://138.chinchillidae.az.pl]cheap flights cape town to edinburgh 8731eeba[/URL] [URL=http://475.galician.az.pl]las vegas penny slots dc5eb0[/URL] [URL=http://654.synchronize.az.pl]cheap flights to lanzarote april 2008 ef29162f[/URL] [URL=http://73.synchronize.az.pl]cheap airline tickets from billings, mt to denver, co a06[/URL]
 10 Comment 06 818
Written by このメールアドレスはスパムボットから保護されています。観覧するにはJavaScriptを有効にして下さい , on 06-11-2008 16:18 , IP: 91.121.120.173
[URL=http://224.chinchillidae.az.pl]flights from visalia ca to las vegas nv 3817464f8[/URL] [URL=http://128.synchronize.az.pl]cheap flights from australia to fiji 9a0d52[/URL] [URL=http://524.chinchillidae.az.pl]easyjet uk flights airport 2a8349225[/URL] [URL=http://742.synchronize.az.pl]flights to ercan northern cyprus 841385[/URL] [URL=http://215.galician.az.pl]www free slots for fun. com f10922[/URL] [URL=http://604.chinchillidae.az.pl]liverpool to dublin flights 003586[/URL] [URL=http://364.galician.az.pl]play bmighty slots casino free an fun 5e5206b3[/URL] [URL=http://99.synchronize.az.pl]cheap flights from dublin to rome aafa[/URL] [URL=http://590.chinchillidae.az.pl]one way airline tickets to ny d2b5674[/URL] [URL=http://621.galician.az.pl]the ten most popular flash download casino for slots c261c49[/URL] [URL=http://533.chinchillidae.az.pl]british airways incoming flights 459dca[/URL] [URL=http://131.chinchillidae.az.pl]lowest air fares cheap airline tickets 52ef[/URL] [URL=http://731.synchronize.az.pl]flights from new york to yemen 2aec605b[/URL] [URL=http://110.chinchillidae.az.pl]air canada from calgary schedule flights a605[/URL] [URL=http://538.galician.az.pl]aussie style slots 93c6e740[/URL] [URL=http://187.galician.az.pl]best slots to play in vegas e80[/URL] [URL=http://159.synchronize.az.pl]tracking airline flights c92[/URL] [URL=http://357.galician.az.pl]cheat for reel deal slots nickles and more game b6b49f[/URL] [URL=http://783.synchronize.az.pl]last minute flights fdea[/URL] [URL=http://714.synchronize.az.pl]cheap discount airline tickets ba920[/URL] [URL=http://606.galician.az.pl]charlestown race track and slots 610b356[/URL] [URL=http://33.galician.az.pl]play fairies fortune casino slots online c9bd35760[/URL] [URL=http://58.galician.az.pl]simslots news archive free slots 97d[/URL]

Write Comment
  • Please keep the topic of messages relevant to the subject of the article.
  • Personal verbal attacks will be deleted.
  • Please don't use comments to plug your web site. Such material will be removed.
  • Just ensure to *Refresh* your browser for a new security code to be displayed prior to clicking on the 'Send' button.
  • Keep in mind that the above process only applies if you simply entered the wrong security code.
Name:
Comment:

Code:* Code

Powered by AkoComment Tweaked Special Edition v.1.4.6
AkoComment © Copyright 2004 by Arthur Konze - www.mamboportal.com
All right reserved

 
< 前へ   次へ >

© 2008 Labs Zsrv Net
Joomla! is Free Software released under the GNU/GPL License.
Translation is Joomla!JAPAN