Search - Code Extractor

Search Components

Full-Text: Fast keyword matching | Semantic: AI-powered understanding of intent (finds similar concepts)

Search Results for "PyPDF2"

Found 14 matching component(s)

class RegulatoryExtractor

A class for extracting structured metadata from regulatory guideline PDF documents using LLM-based analysis and storing the results in an Excel tracking spreadsheet.

File: /tf/active/vicechatdev/reg_extractor.py

pdf-extraction regulatory-documents llm-extraction ocr data-extraction
function merge_pdfs_v1

Merges multiple PDF files into a single output PDF file with robust error handling and fallback mechanisms.

File: /tf/active/vicechatdev/msg_to_eml.py

pdf merge file-processing document-processing pdf-manipulation
class DocumentExtractor

A document text extraction class that supports multiple file formats including Word, PowerPoint, PDF, and plain text files, with automatic format detection and conversion capabilities.

File: /tf/active/vicechatdev/leexi/document_extractor.py

document-processing text-extraction pdf word powerpoint
class DocumentProcessor_v8

Process different document types for indexing

File: /tf/active/vicechatdev/docchat/document_indexer.py

class documentprocessor
class DocumentProcessor_v1

A document processing class that extracts text from PDF and Word documents using llmsherpa as the primary method with fallback support for PyPDF2, pdfplumber, and python-docx.

File: /tf/active/vicechatdev/contract_validity_analyzer/utils/document_processor_new.py

document-processing text-extraction pdf-processing word-processing llmsherpa
class DocumentProcessor_v2

A document processing class that extracts text from PDF and Word documents using llmsherpa as the primary method with fallback support for PyPDF2, pdfplumber, and python-docx.

File: /tf/active/vicechatdev/contract_validity_analyzer/utils/document_processor_old.py

document-processing text-extraction pdf-processing word-processing llmsherpa
class DocumentProcessor_v7

Lightweight document processor for chat upload functionality

File: /tf/active/vicechatdev/vice_ai/document_processor.py

class documentprocessor
function test_enhanced_pdf_processing

A comprehensive test function that validates PDF processing capabilities, including text extraction, cleaning, chunking, and table detection across multiple PDF processing libraries.

File: /tf/active/vicechatdev/vice_ai/test_enhanced_pdf.py

testing pdf-processing document-processing diagnostic text-extraction
function extract_metadata_pdf

Extracts metadata from PDF files including title, author, creation date, page count, and other document properties using PyPDF2 library.

File: /tf/active/vicechatdev/CDocs/utils/document_processor.py

pdf metadata extraction document-processing file-parsing
class DocumentDetail_v2

Document detail view component

File: /tf/active/vicechatdev/CDocs/ui/document_detail.py

class documentdetail
function extract_text_from_pdf_sample

Extracts text content from the first few pages of a PDF file for content comparison purposes, returning up to 5000 characters.

File: /tf/active/vicechatdev/mailsearch/enhanced_document_comparison.py

pdf text-extraction document-processing content-sampling file-processing
class RemarkableCloudWatcher

Monitors the reMarkable Cloud 'gpt_out' folder for new documents, automatically downloads them, and converts .rm (reMarkable native) files to PDF format.

File: /tf/active/vicechatdev/e-ink-llm/mixed_cloud_processor.py

remarkable cloud-storage file-watcher pdf-conversion document-extraction
class SessionDetector

Detects session information (conversation ID and exchange number) from PDF files using multiple detection methods including metadata, filename, footer, and content analysis.

File: /tf/active/vicechatdev/e-ink-llm/session_detector.py

pdf-processing session-detection conversation-tracking metadata-extraction pattern-matching
class RemarkableReplicaBuilder

Step-by-step replica builder

File: /tf/active/vicechatdev/e-ink-llm/cloudtest/local_replica_v2.py

class remarkablereplicabuilder

Search Examples

validation - Find validation functions
database - Find database-related components
email - Find email processing functions
api - Find API-related components
authentication - Find auth components

Search Components

Search Results for "PyPDF2"

class RegulatoryExtractor

function merge_pdfs_v1

class DocumentExtractor

class DocumentProcessor_v8

class DocumentProcessor_v1

class DocumentProcessor_v2

class DocumentProcessor_v7

function test_enhanced_pdf_processing

function extract_metadata_pdf

class DocumentDetail_v2

function extract_text_from_pdf_sample

class RemarkableCloudWatcher

class SessionDetector

class RemarkableReplicaBuilder

Search Examples