Convert PDF into an audiobook.
-
Updated
Nov 9, 2020 - Python
Convert PDF into an audiobook.
This project facilitates the extraction of text from PDF files using various Python libraries. It is designed to be flexible, allowing the choice among different text extraction libraries and supporting both single PDF file and directory containing multiple PDF files.
NLP model for extracting chinese datas from the documents
This is my exploration of a variety of Python 🐍 libraries. I have built geospatial data analytics systems from CSV files, Image and video processing tools like face detection and motion detection. I also built a website with flask (and three.js), I built apps connecting to several types of databases. Created a simple budgeting app that reads, wr…
Extracting details from Resume(CVs) and matching with Job Description(JDs) using pretrained model like DistilBERT and ranking them using cosine similarity.
Common Python PDF parsing utilities 📑
Interface developed to extract information from web through scraping and summarize given data.
Scrapes data tables from a PDF file.
Business objective- The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention
Программа парсит несколько pdf-отчетов, ищет необходимую информацию о серии и флаконах, формирует отчет и создает excel-файл с отчетом.
A sample script to extract text data from a pdf file, converts it to a pandas data frame, and saves it into a CSV file.
PDF to MP3, audio book convertor
collecting data from the Barcelona City Hall Open Data Service's on socioeconomic indicators of the territorial division of the city of Barcelona
This repository contains a Python script for comparing PDF files between a local source folder and a remote server. The script logs results, highlighting identical and non-identical files based on size and page count. It employs "pdfplumber" for PDF handling and "paramiko" for SSH connections.
Business objective- The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention
Scrapes hazardous waste data from a website and PDF file. Cleans and analyzes the data. Prepares the data for mapping.
To create knowledgegraph from pdfs
GUI app for parsing specific PDF files (data from standardized Vehicle Registration smart card - Republic of Serbia) and generating data file for specific use case.
Add a description, image, and links to the pdfplumber topic page so that developers can more easily learn about it.
To associate your repository with the pdfplumber topic, visit your repo's landing page and select "manage topics."