May 202012
 

This blog has discussed several options for performing OCR on Chinese texts, but the options all required a desktop or laptop computer (Google Docs, Adobe Acrobat, Sciweavers i2OCR). In this post, we’ll look at several options for OCRing Chinese on iOS devices.

The contenders:

All of these require starting from an image file, not a PDF. You can take a photo with your device’s camera or import one from the Photos app. My tests used two different photos taken with an iPhone 4S and iPad 3, so your results may vary.

Round one

A page from an adaptation of 西游记 Journey to the West. This book has a rounded font style that I thought might be tough to OCR.

A bit of the text from Round 1 showing the font

A bit of the text from Round 1 showing the font

ABBYY TextGrabber: TextGrabber made about 13 character recognition errors in about 200 characters. Given its misreading of zhōng as a circled 1 ①, the roundness of the font may have been problematic.

LRDict: LRDict had even more problems with this text, repeatedly misreading common words like de, yǒu, hěn, hé, tiān, gè, kāi, dà, zài, and rì, among others. OCR is seldom perfect, but this result was unacceptable, with over a third of the characters wrong.

Pleco: Pleco’s errors were almost all with punctuation. It consistently failed to recognize the Chinese period ( ) and serial comma ( ), as well as quotation marks. Otherwise it split into two characters (and ) several times (although it recognized successfully elsewhere), but made no other errors with actual hanzi.

Round one winner: Pleco

Round two

A page from 第三只眼睛 The Third Eye, the first title in level 3 of the Chinese Breeze series of graded readers (more about the series in this post). This text was tricky because it includes footnote numbers, but if you’re scanning academic texts, this might be a real issue.

ABBYY TextGrabber: TextGrabber did much better in this round. Only one hanzi was misread (perhaps because it was at the edge of the page) and all the punctuation was correct. Otherwise TextGrabber only had a problem with the number 40, which was read as a 4 and something else (either a D or a Chinese period).

LRDict: LRDict also did better, but was still the worst of the three. Five hanzi were misread in the majority of the passage, but the last couple of lines got increasingly garbled. There were also problems with the numbers and punctuation.

Pleco: Pleco misread four hanzi, but three of them were shì (the fourth error was giving rù for rén). Periods were not a problem this time, but quotation marks were still always wrong. The number 40 was misidentified as various Chinese characters, although 2 and 39 were fine.

Round two winner: ABBYY TextGrabber

Tiebreaker round

Catullus 13. This began life as a PDF scanned (on a Ricoh Aficio MP 5000) from a Chinese edition of the poems of the Roman poet Catullus. The PDF was then saved as jpeg image (remember that these apps cannot OCR PDFs directly). The quality of this image is clearly worse than that of the pictures taken with the latest iPhone/iPad cameras.

ABBYY TextGrabber: 12 hanzi were misread and a string of three was missed entirely.

LRDict got disqualified from this last round because after OCRing, I was unable to select the whole text for copying/pasting into another app. It was out of the running already anyway.

Pleco: Once again, Pleco had some problems with punctuation, but it only missed one hanzi; it didn’t see an instance of yī.

The overall winner: Pleco!

Just considering the accuracy of recognizing Chinese characters, Pleco was the most reliable. If the developers work on its handling of punctuation and numerals, there would almost be no contest.

The ultimate decision as to which app to choose may well depend on why you are scanning the text. Do you want to have it machine-translated? Do you you want to read it with the help of a dictionary? Do you want to extract words and add them to a flashcard app? With the final goal in mind, you may want to consider the other features of the apps in question.

ABBYY Textgrabber offers translation of the whole text into a number of languages (powered by Google Translate). An action button gives you one-tap options to copy, e-mail, post to Twitter and Facebook, send to Evernote, and more.

LRDict was noticeably slower than the others. A bigger problem was the fact that the “move and scale” options did not allow me to get the entire photo into the app’s squarish OCR area (why not match the dimensions of an iPhone photo?) Even the last lines that fit in the OCR area came out very garbled. And, as mentioned above, I could not always access all of the OCRed text.

If these problems somehow don’t affect you, LRDict has some features that could be useful, if they worked properly. Tapping on the image (not the OCRed text) brings up a version of the text that is color-coded by tone and has ruby pinyin. Here you can tap a character to look it up in the LRDict dictionaries (cc-cedict for English). For some reason, however, this view would only display the first part of a text.

Pleco’s OCR offers benefits by integrating with other features of the app, namely its dictionaries and flashcards. These will be discussed further in an upcoming post.



May 142012
 
Chrome Language Immersion browser plug-in

The web is, of course, a limitless source of authentic reading material for students of Chinese (not to mention other languages). But only quite advanced students can read most unadapted web pages. So there has been a fair amount of excitement online (Lifehacker, Engadget, EdSurge newsletter) about a plug-in for the Google Chrome browser called Language Immersion for Chrome that aims to turn any webpage into level-appropriate reading material for language students. Well, that may be overstating its goals; perhaps it would be more accurate to say that the plug-in lets you work some language study into your regular web browsing. Once installed, you start from a page in English (for example) and at the click of a button, the plug-in translates some words and phrases into the language of your choice (including either simplified or traditional Chinese). The number of words/phrases translated into the target language is determined by a setting matched to … [read more]

May 062012
 
iOS app: trainchinese Pinyin Trainer

Pinyin Trainer from trainchinese tests your ability to distinguish pinyin initials, tones, and/or sequences of two tones. You can choose to be tested on any one of these or a combination of all three. There seems to be no option, however, for testing pinyin finals. So if you need work distinguishing gān from gāng, for example, this app won’t help. The format is multiple choice, ranging from 2-4 choices for each syllable; there is also an “artist mode” if you want to sketch tone marks. The app keeps a running tally of right/wrong responses. These are not just numbers, but rather a list of the words tested and what your response was. This is great for finding patterns of errors, for example, if you have trouble distinguishing x from sh, or second tones from third. On an iPad in landscape orientation (as in the image above), this list is constantly … [read more]

Apr 282012
 
Free iPad app: AllSet Learning Pinyin

About a week ago, I tweeted about a new app for learning pinyin from AllSet Learning. AllSet Learning Pinyin is essentially a classic chart with initials down the left and finals across the top. The whole chart does not fit within the iPad screen; you swipe to scroll up, down, and across. You can also use standard gestures to zoom in and out. Subtle shading divides the chart into zones that make it easier to see where you are. All in all, it’s very readable, with a clean, san serif font and generous spacing. Tone marks are printed on each syllable; you can choose from any one of the four tones with the tap of a button. This not only affects the display, but determines the pronunciation of the syllable when you tap to hear it (playing audio can be disabled in the settings). One thing that I would like … [read more]

Apr 222012
 
Harvard's "lion turtle"

A sure sign that spring has sprung in Harvard Yard (apart from the overpowering smell of mulch) is the reappearance of the Harvard University bixi from its winter wrapping. The term bìxì (赑屃 / 贔屭) refers to a dragon with the body of a tortoise. Stone bixi often serve as a base for a tablet or stele. The 17-foot tall, 20+ ton marble sculpture was given to the university on the occasion of its tercentennial in 1936 by alumni who were members of Harvard Clubs in China. Although the bixi dates from the Qing dynasty, it was re-inscribed for Harvard. The annual winter wrapping, however, only began in the late ’90s and the inscription is severely eroded. The following transcription is taken from the Wikimedia page devoted to the bixi. 文化为国家之命脉国家之所以兴也繇于文化而文化之所以盛也实繇于学深识远 见之士知立国之本必亟以兴学为先创始也艰自是光大而扩充之而其文化之宏往 往收效于数百年间而勿替是说也徵之于美国哈佛大学滋益信矣哈佛约翰先生于 三百年前由英之美讲学于波士顿市嗣在剑桥设立大学即以哈佛名之规制崇闳学 课美备因而人才辈出为世界有名之学府与美国之国运争荣哈佛先生之深识远见 其有造于国家之文化也大矣我国为东方文化古国然世运推移日新月异志学之士 复负笈海外以求深造近三十年来就学于哈佛大学学成归国服务于国家社会者先 后几达千人可云极盛今届母校成立三百年纪念之期同人等感念沾漑启迪之功不 能无所表献自兹以往当见两国文化愈益沟通必更光大扩充之使国家之兴盛得随 学问之进境以增隆斯则同人等之所馨香以祝而永永纪念不忘者尔 西历一九三六年九月哈佛大学中国留学生全体同人敬立 A partial translation appears on the bixi’s Wikipedia page. … [read more]