GADP-parser

Description

This project provides a Python script to extract text from images of Georgia driver's permits using Optical Character Recognition (OCR) with Tesseract-OCR. The script enhances the image, extracts text, and saves the text to a CSV file.

Features

Image enhancement using OpenCV
OCR text extraction with Tesseract-OCR
Saves extracted text to a CSV file
Sample image from the Georgia DDS provided for demonstration

Disclaimer

This project is independent of the Georgia Department of Driver Services (DDS) and is not officially endorsed by or affiliated with DDS. The information provided in this project is for educational and informational purposes only. DDS is not responsible for any errors or omissions in the data or for any consequences arising from the use of this information.

Installation

Clone the repository:

git clone https://github.com/yourusername/your-repo.git
cd your-repo

Install dependencies:
```
pip install -r requirements.txt
```

Ensure Tesseract-OCR is installed. Download it from here and update the PATH in the script (if necessary):

git clone https://github.com/yourusername/your-repo.git
cd your-repo

Usage

Place the image of the driver's license in the same directory as the script or provide the path to the image.
Run the script:
```
python 1-write.py
```

The extracted text will be saved to drivers_license_data.csv

your-repo/
│
├── README.md
├── requirements.txt
├── 1-write.py
├── drivers_license.jpg
├── enhanced_image.jpg (generated in the program)
└── drivers-license_data.csv (generated in the program)

Sample Output

The extracted text from the driver's license will be saved in drivers_license_data.csv, with each line representing a separate piece of text extracted from the image.

Contributing

Fork the repository.
Create a feature branch (git checkout -b feature-branch).
Commit your changes (git commit -am 'Add new feature'). -Make frequent and small commits!
Push to the branch (git push origin feature-branch).
Create a new Pull Request. -Once your request is approved you are now a contributor!

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgements

Georgia DDS: Sample image used in this project.
OpenCV - Computer vision library (Apache License 2.0)
Pytesseract - Python wrapper for Tesseract (Apache License 2.0)
Pandas - Data analysis library (BSD 3-Clause License)
Pillow - Image processing library (HPND License)

Additional Files

requirements.txt: List of dependencies:
- opencv-python
- pytesseract
- pandas
- Pillow

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
1-write.py		1-write.py
LICENSE		LICENSE
NOTICE.md		NOTICE.md
README.md		README.md
drivers-license.jpg		drivers-license.jpg
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GADP-parser

Description

Features

Disclaimer

Installation

Usage

Sample Output

Contributing

License

Acknowledgements

Additional Files

About

Releases

Packages

Languages

License

patelvedantp/GADP-parser

Folders and files

Latest commit

History

Repository files navigation

GADP-parser

Description

Features

Disclaimer

Installation

Usage

Sample Output

Contributing

License

Acknowledgements

Additional Files

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages