In 2005, Tesseract was obtained by the Nevada Institute of Information Technology in the United States, and it turned to Google to improve Tesseract, eliminate bugs, and optimize work. Tesseract has been released as an open-source project in Google Project, and its latest version 3.0 already supports Chinese OCR and provides a command line tool. It is mainly used to identify the text of scanned documents/pictures, including contracts, invoices, etc., which can easily reduce the work that requires manpower.