GOCR

GOCR
Developer(s)	Jörg Schulenburg
Initial release	December 2000; 25 years ago
Stable release	0.51 / August 5, 2017; 8 years ago
Written in	C
Operating system	Linux, Windows, OS/2
Type	Optical character recognition
License	GNU General Public License
Website	www-e.uni-magdeburg.de/jschulen/ocr/ jocr.sourceforge.net(may out of date)

GOCR (or JOCR) is a free optical character recognition program, initially written by Jörg Schulenburg. It can be used to convert or scan image files (portable pixmap or PCX) into text files.^[1]

Features

GOCR claims it can handle single-column sans-serif fonts of 20–60 pixels in height. It reports trouble with serif fonts, overlapping characters, handwritten text, heterogeneous fonts, noisy images, large angles of skew, and text in anything other than a Latin alphabet.^[2]

GOCR can also translate barcodes.^[2]

User interface

GOCR can be used as a stand-alone command-line application, or as a back-end to other programs. It comes with a gocr.tcl graphic interface. GOCR can be also used as an OCR engine in OCRFeeder.^[3]

Development

Version 0.3.0 was released in December 2000, 0.3.5 in February 2002, and 0.37 in August 2002.

Between version 0.40 (March 2005) and 0.43 (December 2006), the recognition engine was gradually replaced with a vector version.

Version 0.48 was released in August 2009.^[1]

Version 0.49 was released in September 2010.^[1]

Version 0.50 was released in March 2013.^[1]

Version 0.51 was released in August 2017.^[4]

Nomenclature

The application was originally named GOCR which stands for GNU Optical Character Recognition. When it came time to register the project on SourceForge the name GOCR was already taken so the project was registered as JOCR (Jörg's Optical Character Recognition).^[1]^[2]

As a result of this situation the project and application are known as both GOCR and JOCR. Schulenburg admits that this is problematic.^[1]

Formats

Acceptable image formats are:^[2]

PNM
PBM
PGM
PPM
PCX (some)
TGA

Other formats are automatically converted using netpbm-progs, gzip and bzip2 via the use of a unix pipe. These images types include:^[2]

pnm.gz
pnm.bz2
PNG
JPG
TIFF
GIF
BMP

References

↑ ^1.0 ^1.1 ^1.2 ^1.3 ^1.4 ^1.5 Schulenburg, Joerg (March 2007). "GOCR". http://jocr.sourceforge.net/. Retrieved 2008-06-25.
↑ ^2.0 ^2.1 ^2.2 ^2.3 ^2.4 SfR Fresh (n.d.). "Member "gocr-0.45/README" of archive gocr-0.45.tar.gz". Archived from the original on 2008-09-08. https://web.archive.org/web/20080908131653/http://www.sfr-fresh.com/unix/privat/gocr-0.45.tar.gz%3Aa/gocr-0.45/README. Retrieved 2008-06-25.
↑ "GNOME GIT source code repository". http://git.gnome.org/browse/ocrfeeder/tree/README. Retrieved 3 February 2010.
↑ "GOCR Main Page". http://www-e.uni-magdeburg.de/jschulen/ocr/. Retrieved 28 September 2017.

External links

GOCR Main Page
GOCR on SourceForge.net (may be out of date)

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/GOCR. Read more

[GOCRhomepage-1] 1.0 ^1.1 ^1.2 ^1.3 ^1.4 ^1.5 Schulenburg, Joerg (March 2007). "GOCR". http://jocr.sourceforge.net/. Retrieved 2008-06-25.

[SfRFresh-2] 2.0 ^2.1 ^2.2 ^2.3 ^2.4 SfR Fresh (n.d.). "Member "gocr-0.45/README" of archive gocr-0.45.tar.gz". Archived from the original on 2008-09-08. https://web.archive.org/web/20080908131653/http://www.sfr-fresh.com/unix/privat/gocr-0.45.tar.gz%3Aa/gocr-0.45/README. Retrieved 2008-06-25.

[3] "GNOME GIT source code repository". http://git.gnome.org/browse/ocrfeeder/tree/README. Retrieved 3 February 2010.

[4] "GOCR Main Page". http://www-e.uni-magdeburg.de/jschulen/ocr/. Retrieved 28 September 2017.

[1]

[2]

[3]

[4]

v t e Optical character recognition software
Free software	CuneiForm GOCR Ocrad OCRFeeder OCRopus Tesseract
Proprietary software	ABBYY FineReader Asprise OCR Microsoft Office Document Imaging OmniPage ReadSoft SmartScore TeleForm VueScan
See also	Comparison of optical character recognition software