CS298 Proposal

Enhancing Viewability of Images of Text in PDF in Mobile Devices

Long N Vuong (longnvuong@yahoo.com)

Advisor: Dr. Chris Pollett (cpollett@gmail.com)

Committee Member 1: Dr. Agustin Araya (araya@cs.sjsu.edu)

Committee Member 2: Dr. John Avila (avila@cs.sjsu.edu)

Abstract:

In mobile devices, standard PDF readers such as Adobe Reader for Mobile Devices enable one to view PDF files provided they contain mainly text and at most small images. However, if we have a PDF file whose contents contain larger images, equations and scanned text in a large image, etc. then these readers just display the whole image. This can be especially bad if the image is larger than the screen size itself as one has to scroll both vertically and horizontally to try to understand the document. In this project, we will develop a reader which solves the problem of displaying scanned text in large images and which solves the problem of displaying equations. Our system will be robust, as it only needs to detect white space rather than do fancier techniques like optical character recognition, which might be hard in the case of math equations, handwriting, or nonstandard scripts.

CS297 Results

The first deliverable was the first step to setup our Java development environment and deploy a program onto mobile devices. This deliverable was to produce a simple J2ME demo program to run in mobile device. We successfully deployed the program onto mobile phones and PDAs via a web server.
The second deliverable was to get to know the PDF file structure and specification. The goal of this deliverable was to produce a program that extracted text from PDF file and saved them as JPEG image files. We was replied on iText as a Java library to read the content of PDF file. We extracted text in a PDF file. Then every word of the text was saved as a JPEG image.
The third deliverable was another program to know more about the PDF file structure and also the processing of different image types in Java. This deliverable was to produce a program that extracted images in a PDF file and saved them as JPEG, TIFF, PNG or GIF files. The program asks the user to input a PDF file and select the image file format that the user want all the images to be saved as.
We researched and learned how to process image files and extract words inside the images with surrounding white space.

Proposed Schedule

Week 1: 01/24/07-01/30/07	Write up and submit CS298 Proposal
Week 2: 01/30/07-02/06/07	Read Ch4 & Ch5 "Practical Algorithms For Image Analysis"
Week 3: 02/06/07-02/13/07	Work on Deliverable 1
Week 4: 02/13/07-02/20/07	Work on Deliverable 1
Week 5: 02/20/07-02/27/07	Due Deliverable 1 and Read "core J2ME"
Week 6: 02/27/07-03/06/07	Work on Deliverable 2
Week 7: 03/13/07-03/20/07	Work on Deliverable 2
Week 8: 03/20/07-03/27/07	Due Deliverable 2 and Write Final Report
Week 9: 03/27/07-04/03/07	Write Final Report
Week 10: 04/03/07-04/10/07	Submit Final Report
Week 11: 04/10/07-04/17/07	Revise Final Report and Work on Final Deliverable
Week 12: 04/17/04-04/24/07	Revise Final Report and Work on Final Deliverable
Week 13: 04/24/07-05/01/07	Final Deliverable Due
Week 14: 05/01/07-05/08/07	Prepare for Defense
Week 15: 05/08/07-05/15/07	Defense CS298 Project

Key Deliverables:

Software
- Deliverable 1: Developing a program called the PDF Mobile Helper to extract words inside image files and each word are saved as smaller image files. Then they are stored in a file that will be read by the PDF Mobile Viewer (Deliverable 2) in mobile devices.
- Deliverable 2: Developing a program called the PDF Mobile Viewer to read the file that was generated by the PDF Mobile Helper (Deliverable 1) and render the content to fit the small screen of mobile devices
- Final Deliverable: Revising Deliverable 1 and Deliverable 2 for bugs and GUIs to be ready for the defense.
Report
- Final Report
- Documentation for all deliverables

Innovations and Challenges

Processing binary image files and extracting meaningful segments such as words and pictures from them are challenging.
Extracting words in image files with surrounding white space is innovative.
Understanding are processing PDF file structure are challenging.
Enhancing Viewability of Images of Text in Mobile Devices is innovative.
The knowledge required to implement the project is diverse.

References:

[2000] Practical Algorithms for Image Analysis: Descriptions, Examples, and Code. Michael Seul, Lawrence O'Gorman, Michael J. Sammon. Cambridge University Press. April 15, 2000.

[1998] Fractal Image Encoding and Analysis. Norway, NATO Advanced Study Institute on Fractal Image Encoding and Analysis. Springer. November 1998.

[1994] Document Image Analysis. H. Bunke, P. S. P. Wang, Henry S. Baird. World Scientific Publishing Company. December 1994.

[2005] Acrobat SDK User's Guide. Adobe. http://partners.adobe.com/public/developer/en/acrobat /sdk/pdf/intro_to_sdk/UserGuide.pdf.

[2005] Acrobat and PDF Library API Reference. Adobe. http://partners.adobe.com/public/developer /en/acrobat/sdk/pdf/plugins/APIReference.pdf.

[2001] Core J2ME Technology. John W. Muchow. Prentice Hall PTR. December 21, 2001.

[2003] Enterprise J2ME: Developing Mobile Java Applications. Michael Juntao Yuan. Prentice Hall PTR. October 20, 2003.

[1997] Finding Text In Images. V. Wu, R. Manmatha and E. M. Riseman. http://www.cs.umass.edu /Dienst/UI/2.0/Describe/ncstrl.umassa_cs%2FUM-CS-1997-009.

[2002] Extraction of Text from Images. Pooja Nath. http://www.cse.iitk.ac.in/research/btp2002 /98263.html.