Multi-lingual Optical Character Recognition Seminar

Spring 2019 Schedule of Meetings and Talks

Usual meeting time: Friday, 11:00am - 11:50pm. Room: ENR2 S395.

# Date Room Speaker Title and Abstract
6 Friday, 2-12-2019 ENR2 S395 Dylan Murphy
  • Dylan Murphy (University of Arizona) will discuss the OCR software. Open source packages Kraken and Tesseract will be discussed. The talk will cover use and system architecture of these systems, as well as the process of training for new language recognition.
5 Friday, 2-15-2019 ENR2 S395 Sayyed Vazirizade
  • Sayyed Vazirizade (University of Arizona) will review Persian OCR software.
4 Friday, 2-8-2019 ENR2 S395 Ryan Coatney, Yan Han
  • Ryan Coatney (University of Arizona) will continue talking about a paper by Kobus et. all, applying Gaussian processes to modeling 1-dimensional structures (plants), and potential applications to OCR (est. 25 min).
  • Yan Han (University of Arizona) will talk about APIs for embedding text in PDF (est. 25 min).
3 Friday, 2-1-2019 ENR2 S395 Mike Maizels, Ryan Coatney
  • Mike Maizels (Harvard and University of Arkansas) will discuss an arts-related project involving OCR (15 min).
  • Ryan Coatney (University of Arizona) will talk about a paper by Kobus et. all, applying Gaussian processes to modeling 1-dimensional structures (plants), and potential applications to OCR (30 min).
2 Friday, 1-25-2019 ENR2 S395 Marek Rychlik A method for Chinese OCR using Hough and Fourier transforms. I will explain the algorithm published in our GitHub repository. Also, I will briefly describe the selected papers which we can collectively study. The slides of this talk are available.
1 Friday, 1-18-2019 ENR2 S395 Organizational meeting. Agenda will include:
  • Introductions
  • Description of the NEH grant research
  • Resources for Pashto and Chinese

Zoom recordings

They are available on the restricted page of this website. However, you need to ask the organizers for the credentials to access this page.

The organizers