Chris Pollett > Students >
Riti

    ( Print View)

    [Bio]

    [Blog]

    [CS297Proposal]

    [Deliverable 1]

    [Languages-Translation-Presentation-PDF]

    [Deliverable 2]

    [Text-Detection-Images-PDF]

    [Deliverable 3]

    [Deliverable 4 : Dataset]

    [CS297 Report - PDF]

    [CS298 Proposal]

    [CS298 Report - PDF]

Deliverable 3: Extract the text in images using pytesseract, openCV and machine learning

The purpose of the project is to get familiar with image processing and text extraction.

IMPLEMENTATION DETAILS

Implemented using openCV and python.

DATASET

Gathered the dataset by taking snapshots from google.

STEPS USED

1. Read the image and store the pixel values in an array.
2. Remove the noise from image
3. Convert to black and white
4. Recognize the text using machine learning model, pytesseract

RUNNING THE CODE

python google-tes-text-ocr.py

GITHUB :
CODE LINK