インド語族の手書文書認識<br>Guide to OCR for Indic Scripts : Document Recognition and Retrieval (Advances in Pattern Recognition)

個数:

インド語族の手書文書認識
Guide to OCR for Indic Scripts : Document Recognition and Retrieval (Advances in Pattern Recognition)

  • 提携先の海外書籍取次会社に在庫がございます。通常3週間で発送いたします。
    重要ご説明事項
    1. 納期遅延や、ご入手不能となる場合が若干ございます。
    2. 複数冊ご注文の場合は、ご注文数量が揃ってからまとめて発送いたします。
    3. 美品のご指定は承りかねます。

    ●3Dセキュア導入とクレジットカードによるお支払いについて
  • 【入荷遅延について】
    世界情勢の影響により、海外からお取り寄せとなる洋書・洋古書の入荷が、表示している標準的な納期よりも遅延する場合がございます。
    おそれいりますが、あらかじめご了承くださいますようお願い申し上げます。
  • ◆画像の表紙や帯等は実物とは異なる場合があります。
  • ◆ウェブストアでの洋書販売価格は、弊社店舗等での販売価格とは異なります。
    また、洋書販売価格は、ご注文確定時点での日本円価格となります。
    ご注文確定後に、同じ洋書の販売価格が変動しても、それは反映されません。
  • 製本 Hardcover:ハードカバー版/ページ数 340 p./サイズ 161 illus.
  • 言語 ENG
  • 商品コード 9781848003293
  • DDC分類 651

基本説明

Describes OCR systems that cover 8 different scripts – Bangla, Devanagari, Gurmukhi, Gujarati, Kannada, Malayalam, Tamil, and Urdu (Perso-Arabic).

Full Description

Theoriginalmotivationsfordevelopingopticalcharacterrecognitiontechnologies weremodesttoconvertprintedtexton?atphysicalmediatodigitalform,prod- ingmachine-readabledigitalcontent. Bydoingthis,wordsthathadbeeninertand bound to physical material would be brought into the digital realm and thus gain newandpowerfulfunctionalitiesandanalyticalpossibilities. First-generation digital OCR researchers in the 1970s quickly realized that by limiting their ambitions primarily to contemporary documents printed in st- dard font type from the modern Roman alphabet (and of these, mostly English language materials), they were constraining the possibilities for future research andtechnologiesconsiderably. Domainresearchersalsosawthatthetrajectoryof OCR technologies if left unchanged would exclude a large portion of the human record. Digitalconversionofdocumentsandmanuscriptsinotheralphabets,scripts, and cursive styles was of critical importance.
Embedded in non-Roman alp- bet source documents, including ancient manuscripts, papyri scrolls, clay tablets, and other inscribed artifacts was not only a wealth of scholarly information but alsonewopportunitiesandchallengesforadvancingOCR,imagingsciences,and othercomputationalresearchareas. Thelimitingcircumstancesatthetimeincluded the rudimentary capability (and high cost) of computational resources and lack of network-accessible digital content. Since then computational technology has advancedataveryrapidpaceandnetworkinginfrastructurehasproliferated. Over time, thisexponential decrease inthecost of computation, memory, and com- nicationsbandwidthcombinedwiththeexponentialincreaseinInternet-accessible digitalcontenthastransformededucation,scholarship,andresearch. Largenumbers ofresearchers,scholars,andstudentsuseanddependuponInternet-basedcontent andcomputationalresources. Thechaptersinthisbookdescribeacriticallyimportantareaofinvestigation- addressingconversionofIndicscriptintomachine-readableform. Roughestimates haveitthatcurrentlymorethanabillionpeopleuseIndicscripts. Collectively,Indic historic and cultural documents contain a vast richness of human knowledge and experience.
The state-of-the-art research described in this book demonstrates the multiple values associated with these activities. Technically, the problems associated with Indicscriptrecognitionareverydif?cultandwillcontributetoandinformrelated v vi Foreword scriptrecognitionefforts. Theworkalsohasenormousconsequenceforenriching andenablingthestudyofIndicculturalheritagematerialsandthehistoricrecord of its people. This in turn broadens the intellectual context for domain scholars focusingonothersocieties,ancientandmodern. Digital character recognition has brought about another milestone in coll- tivecommunicationbybringinginert,?xed-in-place,textintoaninteractivedi- talrealm. Indoingso,theinformationhasgainedadditionalfunctionalitieswhich expandourabilitiestoconnect,combine,contextualize,share,andcollaboratively pursue knowledge making. High-quality Internet content continues to grow in an explosivefashion. Inthenewglobalcyberenvironment,thefunctionalitiesandapp- cationsofdigitalinformationcontinuetotransformknowledgeintonewundersta- ingsofhumanexperienceandtheworldinwhichwelive.
Thepossibilitiesforthe futurearelimitedonlybyavailableresearchresourcesandcapabilitiesandtheim- inationandcreativityofthosewhousethem. Arlington,Virginia StephenM.

Contents

Section: Recognition of Indic scripts.- Building Data Sets for Indian Language OCR Research.- On OCR of Major Indian Scripts: Bangla and Devanagari.- A Complete Machine-Printed Gurmukhi OCR System.- Progress in Gujarati Document Processing and Character Recognition.- Design of a Bilingual Kannada-English OCR.- Recognition of Malayalam Documents.- A Complete OCR System for Tamil Magazine Documents.- Experiments on Urdu Text Recognition.- The BBN Byblos Hindi OCR System.- Generalization of Hindi OCR Using Adaptive Segmentation and Font Files.- Online Handwriting Recognition for Indic Scripts.- Section: Retrieval of Indic documents.- Enhancing Access to Primary Cultural Heritage Materials of India.- Digital Image Enhancement of Indic Historical Manuscripts.- GFG-Based Compression and Retrieval of Document Images in Indian Scripts.- Word Spotting for Indic Documents to Facilitate Retrieval.- Indian Language Information Retrieval.

最近チェックした商品