Recognition-Based Digitalization of Korean Historical Archives

Min Soo Kim; Sungho Ryu; Kyu Tae Cho; Taik Heon Rhee; Hyun Il Choi; Jin Hyung Kim

doi:10.1007/978-3-540-31871-2_24

Recognition-Based Digitalization of Korean Historical Archives

Min Soo Kim, Sungho Ryu, Kyu Tae Cho, Taik Heon Rhee, Hyun Il Choi, Jin Hyung Kim

Source

Lecture Notes in Computer Science > Information Retrieval Technology > Enabling Technology > 281-288

Abstract

We present a recognition-based digitization method for building digital library of large amount of historical archives. Because the most of archives are manually transcribed in ancient Chinese characters, their digitization present unique academic and pragmatic challenges. By integrating the layout analysis and the recognition into single probabilistic framework, our system achieved 95.1% character recognition rates on test data set, despite the obsolete characters and unique variants used in the archives. Compared with intuitive verification and correction interface, the system freed the operators from repetitive typing tasks and improved the overall throughput significantly.

Identifiers

series ISSN :	0302-9743
series e-ISSN :	1611-3349
book ISBN :	978-3-540-25065-4
book e-ISBN :	978-3-540-31871-2
DOI	10.1007/978-3-540-31871-2_24

Authors

Min Soo Kim

Korea Advanced Institute of Science and Technology, AIPR Lab., CS Div., Daejeon, Republic of Korea

Sungho Ryu

Korea Advanced Institute of Science and Technology, AIPR Lab., CS Div., Daejeon, Republic of Korea

Kyu Tae Cho

Korea Advanced Institute of Science and Technology, AIPR Lab., CS Div., Daejeon, Republic of Korea

Taik Heon Rhee

Korea Advanced Institute of Science and Technology, AIPR Lab., CS Div., Daejeon, Republic of Korea

see all

Additional information

Data set: Springer

Publisher

Springer Berlin Heidelberg

chapter

Read online
Download
Add to read later
Add to collection
Add to followed
Share

Export to bibliography


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Recognition-Based Digitalization of Korean Historical Archives $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Min Soo Kim

Sungho Ryu

Kyu Tae Cho

Taik Heon Rhee

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Recognition-Based Digitalization of Korean Historical Archives