🔍 Code Extractor

Search Components

Full-Text: Fast keyword matching | Semantic: AI-powered understanding of intent (finds similar concepts)

Search Results for "PyPDF2"

Found 14 matching component(s)

  • class RegulatoryExtractor

    A class for extracting structured metadata from regulatory guideline PDF documents using LLM-based analysis and storing the results in an Excel tracking spreadsheet.

    File: /tf/active/vicechatdev/reg_extractor.py

    pdf-extraction regulatory-documents llm-extraction ocr data-extraction
  • function merge_pdfs_v1

    Merges multiple PDF files into a single output PDF file with robust error handling and fallback mechanisms.

    File: /tf/active/vicechatdev/msg_to_eml.py

    pdf merge file-processing document-processing pdf-manipulation
  • class DocumentExtractor

    A document text extraction class that supports multiple file formats including Word, PowerPoint, PDF, and plain text files, with automatic format detection and conversion capabilities.

    File: /tf/active/vicechatdev/leexi/document_extractor.py

    document-processing text-extraction pdf word powerpoint
  • class DocumentProcessor_v8

    Process different document types for indexing

    File: /tf/active/vicechatdev/docchat/document_indexer.py

    class documentprocessor
  • class DocumentProcessor_v1

    A document processing class that extracts text from PDF and Word documents using llmsherpa as the primary method with fallback support for PyPDF2, pdfplumber, and python-docx.

    File: /tf/active/vicechatdev/contract_validity_analyzer/utils/document_processor_new.py

    document-processing text-extraction pdf-processing word-processing llmsherpa
  • class DocumentProcessor_v2

    A document processing class that extracts text from PDF and Word documents using llmsherpa as the primary method with fallback support for PyPDF2, pdfplumber, and python-docx.

    File: /tf/active/vicechatdev/contract_validity_analyzer/utils/document_processor_old.py

    document-processing text-extraction pdf-processing word-processing llmsherpa
  • class DocumentProcessor_v7

    Lightweight document processor for chat upload functionality

    File: /tf/active/vicechatdev/vice_ai/document_processor.py

    class documentprocessor
  • function test_enhanced_pdf_processing

    A comprehensive test function that validates PDF processing capabilities, including text extraction, cleaning, chunking, and table detection across multiple PDF processing libraries.

    File: /tf/active/vicechatdev/vice_ai/test_enhanced_pdf.py

    testing pdf-processing document-processing diagnostic text-extraction
  • function extract_metadata_pdf

    Extracts metadata from PDF files including title, author, creation date, page count, and other document properties using PyPDF2 library.

    File: /tf/active/vicechatdev/CDocs/utils/document_processor.py

    pdf metadata extraction document-processing file-parsing
  • class DocumentDetail_v2

    Document detail view component

    File: /tf/active/vicechatdev/CDocs/ui/document_detail.py

    class documentdetail
  • function extract_text_from_pdf_sample

    Extracts text content from the first few pages of a PDF file for content comparison purposes, returning up to 5000 characters.

    File: /tf/active/vicechatdev/mailsearch/enhanced_document_comparison.py

    pdf text-extraction document-processing content-sampling file-processing
  • class RemarkableCloudWatcher

    Monitors the reMarkable Cloud 'gpt_out' folder for new documents, automatically downloads them, and converts .rm (reMarkable native) files to PDF format.

    File: /tf/active/vicechatdev/e-ink-llm/mixed_cloud_processor.py

    remarkable cloud-storage file-watcher pdf-conversion document-extraction
  • class SessionDetector

    Detects session information (conversation ID and exchange number) from PDF files using multiple detection methods including metadata, filename, footer, and content analysis.

    File: /tf/active/vicechatdev/e-ink-llm/session_detector.py

    pdf-processing session-detection conversation-tracking metadata-extraction pattern-matching
  • class RemarkableReplicaBuilder

    Step-by-step replica builder

    File: /tf/active/vicechatdev/e-ink-llm/cloudtest/local_replica_v2.py

    class remarkablereplicabuilder

Search Examples