🔍 Code Extractor

Search Components

Full-Text: Fast keyword matching | Semantic: AI-powered understanding of intent (finds similar concepts)

Search Results for "parser"

Found 47 matching component(s)

  • class ReferenceManager_v2

    Manages extraction and formatting of references for LLM chat responses. Handles both file references and BibTeX citations, formatting them according to various academic citation styles.

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG copy.py

    class referencemanager
  • function main_v11

    Command-line interface function that orchestrates pattern-based extraction of poultry flock data, including data loading, pattern classification, geocoding, and export functionality.

    File: /tf/active/vicechatdev/pattern_based_extraction.py

    cli command-line-interface data-extraction poultry-data pattern-analysis
  • class ReferenceManager_v3

    Manages extraction and formatting of references for LLM chat responses. Handles both file references and BibTeX citations, formatting them according to various academic citation styles.

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG_old.py

    class referencemanager
  • function main_v70

    Entry point function for a FileCloud ACL management test script that parses command-line arguments and initiates ACL testing.

    File: /tf/active/vicechatdev/test_acl_functions.py

    cli command-line testing filecloud acl
  • function main_v24

    Command-line interface function that orchestrates the generation of meeting minutes from a transcript file using either GPT-4o or Gemini LLM models.

    File: /tf/active/vicechatdev/advanced_meeting_minutes_generator.py

    cli command-line meeting-minutes transcript-processing llm
  • class ReferenceManager_v4

    Manages extraction and formatting of references for LLM chat responses. Handles both file references and BibTeX citations, formatting them according to various academic citation styles.

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG.py

    class referencemanager
  • function eml_to_pdf

    Converts an .eml email file to PDF format, including the email body and all attachments merged into a single PDF document.

    File: /tf/active/vicechatdev/msg_to_eml.py

    email-processing pdf-conversion eml-parser document-conversion attachment-handling
  • function main_v69

    Entry point function that parses command-line arguments and orchestrates the FileCloud email processing workflow to find, download, and convert .msg files.

    File: /tf/active/vicechatdev/msg_to_eml.py

    cli command-line entry-point filecloud email-processing
  • class DocxMerger

    A class named DocxMerger

    File: /tf/active/vicechatdev/word_merge.py

    class docxmerger
  • function main_v31

    Entry point function that initializes and serves the CDocs Panel web application with configurable port and debug mode options.

    File: /tf/active/vicechatdev/cdocs_panel_app.py

    entry-point web-application panel server cli
  • class DocumentExtractor

    A document text extraction class that supports multiple file formats including Word, PowerPoint, PDF, and plain text files, with automatic format detection and conversion capabilities.

    File: /tf/active/vicechatdev/leexi/document_extractor.py

    document-processing text-extraction pdf word powerpoint
  • function main_v3

    Command-line interface function that orchestrates the generation of enhanced meeting minutes from transcript files and PowerPoint presentations using various LLM models (GPT-4o, Azure GPT-4o, or Gemini).

    File: /tf/active/vicechatdev/leexi/enhanced_meeting_minutes_generator.py

    cli command-line meeting-minutes llm gpt-4
  • function main_v48

    Command-line interface entry point for monitoring SharePoint to FileCloud synchronization logs, providing status analysis, log tailing, and real-time watching capabilities.

    File: /tf/active/vicechatdev/SPFCsync/monitor.py

    cli command-line-interface log-monitoring sharepoint filecloud
  • function main_v16

    Main entry point for a SharePoint to FileCloud synchronization application that handles command-line arguments, connection testing, and orchestrates single or continuous sync operations.

    File: /tf/active/vicechatdev/SPFCsync/main.py

    cli entry-point synchronization sharepoint filecloud
  • function parse_arguments_v1

    Parses command-line arguments for a legal contract extraction tool that processes documents from FileCloud storage.

    File: /tf/active/vicechatdev/contract_validity_analyzer/extractor.py

    cli command-line argument-parsing argparse configuration
  • function parse_arguments

    Parses command-line arguments for a contract validity analysis tool that processes FileCloud documents with configurable options for paths, concurrency, output, and file filtering.

    File: /tf/active/vicechatdev/contract_validity_analyzer/main.py

    cli argument-parsing command-line argparse configuration
  • function main_v105

    Command-line interface function that orchestrates the cleaning of ChromaDB collections by removing duplicates and similar documents, with options to skip collections and customize the cleaning process.

    File: /tf/active/vicechatdev/chromadb-cleanup/main.py

    cli command-line chromadb database-cleaning deduplication
  • function main_v89

    Command-line interface function that orchestrates a ChromaDB collection cleaning pipeline by removing duplicate and similar documents through hashing and similarity screening.

    File: /tf/active/vicechatdev/chromadb-cleanup/main copy.py

    cli command-line data-cleaning deduplication chromadb
  • function test_us_with_thousands

    A unit test function that validates the smart_read_csv function's ability to correctly parse US-formatted CSV files containing numbers with thousand separators (commas) and decimal points.

    File: /tf/active/vicechatdev/vice_ai/test_regional_formats.py

    testing unit-test csv-parsing data-validation number-formatting
  • function process_markdown_content

    Parses markdown-formatted text content and converts it into a structured list of content elements with type annotations and formatting metadata suitable for document export.

    File: /tf/active/vicechatdev/vice_ai/complex_app.py

    markdown parser document-processing text-processing content-conversion
  • function basic_markdown_to_html

    Converts basic Markdown syntax to HTML without using external Markdown libraries, handling headers, lists, code blocks, and inline formatting.

    File: /tf/active/vicechatdev/vice_ai/complex_app.py

    markdown html conversion text-processing formatting
  • function test_markdown_processing

    A test function that validates markdown processing capabilities by testing content parsing, element extraction, and HTML conversion functionality.

    File: /tf/active/vicechatdev/vice_ai/test_markdown.py

    testing markdown html-conversion text-processing unit-test
  • function simple_markdown_to_html

    Converts a subset of Markdown syntax to clean HTML, supporting headers, bold text, unordered lists, and paragraphs.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    markdown html conversion text-formatting parser
  • function html_to_plain_text_with_formatting

    Parses HTML content and converts it to plain text while preserving formatting information, returning a list of text segments with their associated format types (headers, bold, or normal).

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    html-parsing text-extraction document-processing format-preservation html-to-text
  • function html_to_markdown_v1

    Converts HTML markup to Markdown syntax, handling headers, code blocks, text formatting, links, lists, and paragraphs with proper spacing.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    html markdown conversion text-processing formatting
  • function process_markdown_content_v1

    Parses markdown-formatted text content and converts it into a structured list of document elements (headers, paragraphs, lists, tables, code blocks) with their types and formatting preserved in original order.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    markdown parser document-processing text-processing content-conversion
  • function add_formatted_content_to_word

    Converts processed markdown elements into formatted content within a Word document, handling headers, paragraphs, lists, tables, and code blocks with appropriate styling.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    word-document markdown-conversion document-formatting python-docx content-generation
  • function main_v18

    Command-line interface function that orchestrates database migration from a legacy document AI system to a new architecture, with options for verification and sample data creation.

    File: /tf/active/vicechatdev/vice_ai/migration.py

    migration database cli command-line data-migration
  • class ApprovalPanel_v1

    Approval management interface component

    File: /tf/active/vicechatdev/CDocs/ui/approval_panel.py

    class approvalpanel
  • class ReviewPanel_v1

    Review management interface component

    File: /tf/active/vicechatdev/CDocs/ui/review_panel.py

    class reviewpanel
  • function main_v74

    Entry point function that orchestrates vendor enrichment testing by parsing command-line arguments, running setup validation, and executing a single vendor test against a ChromaDB collection.

    File: /tf/active/vicechatdev/find_email/test_enrichment.py

    testing vendor-enrichment command-line argparse chromadb
  • function main_v20

    Command-line interface function that orchestrates the enrichment of vendor data from an Excel file with email and VAT information using ChromaDB and RAG engine.

    File: /tf/active/vicechatdev/find_email/vendor_enrichment.py

    cli command-line data-enrichment vendor-management excel-processing
  • function main_v38

    Command-line entry point that parses arguments and orchestrates the extraction of vendor emails from all vicebio.com mailboxes using Microsoft Graph API.

    File: /tf/active/vicechatdev/find_email/extract_vendor_batch.py

    cli command-line entry-point argparse email-extraction
  • function main_v116

    Command-line interface function that parses arguments and sends a test email through an SMTP forwarder service, displaying connection details and returning an exit code based on success.

    File: /tf/active/vicechatdev/email-forwarder/send_test_email.py

    cli email smtp testing command-line
  • class QueryParser

    A parser class that converts LLM-generated query response text into structured dictionaries containing various search query types, metadata, and parameters.

    File: /tf/active/vicechatdev/QA_updater/core/query_parser.py

    parser LLM query-processing text-parsing structured-data
  • class QAUpdater

    Orchestrates a two-step Q&A document updating process that generates optimal search queries, retrieves information from internal and external sources, and uses an LLM to determine if updates are needed.

    File: /tf/active/vicechatdev/QA_updater/qa_engine/qa_updater.py

    qa-management document-updating llm-orchestration information-retrieval vector-search
  • class QAUpdater_v1

    Orchestrates the Q&A document updating process.

    File: /tf/active/vicechatdev/QA_updater/qa_engine/qa_updater_full.py

    class qaupdater
  • function main_v5

    Main entry point function for an invoice processing system that monitors an inbound directory for PDF invoices, processes them using LLM extraction, generates Excel outputs, and moves processed files to a processed directory.

    File: /tf/active/vicechatdev/invoice_extraction/main.py

    invoice-processing pdf-processing llm-extraction batch-processing file-monitoring
  • class DocumentProcessor_v3

    A comprehensive PDF document processor that handles text extraction, OCR (Optical Character Recognition), layout analysis, table detection, and metadata extraction from PDF files.

    File: /tf/active/vicechatdev/invoice_extraction/core/document_processor.py

    pdf-processing ocr text-extraction document-processing invoice-processing
  • class ReferenceManager_v5

    Manages extraction and formatting of references for LLM chat responses. Handles both file references and BibTeX citations, formatting them according to various academic citation styles.

    File: /tf/active/vicechatdev/datacapture_backup_16072025/OneCo_hybrid_RAG.py

    class referencemanager
  • function main_v112

    Entry point function that parses command-line arguments to fix file timestamps by setting them to the oldest date found, either for a single file or recursively through a directory.

    File: /tf/active/vicechatdev/mailsearch/fix_file_dates.py

    cli command-line argparse file-processing timestamp-management
  • function main_v10

    Command-line interface function that orchestrates PDF document analysis using OCR and LLM processing, with configurable input/output paths and processing limits.

    File: /tf/active/vicechatdev/mailsearch/document_analyzer.py

    cli command-line entry-point pdf-processing ocr
  • function main_v98

    Command-line application that uploads PDF files without WUXI coding from a local directory to a FileCloud server, with support for dry-run mode and customizable file patterns.

    File: /tf/active/vicechatdev/mailsearch/upload_non_wuxi_coded.py

    file-upload filecloud cli batch-processing pdf-management
  • function main_v103

    Asynchronous main entry point for a test suite that validates Mixed Cloud Processor functionality, including authentication, discovery, and dry-run operations for reMarkable and OneDrive integration.

    File: /tf/active/vicechatdev/e-ink-llm/test_mixed_mode.py

    async testing cloud-integration remarkable onedrive
  • function main_v68

    Async entry point for an E-Ink LLM Assistant that processes handwritten/drawn content using AI vision models, supporting local files, reMarkable Cloud, and OneDrive integration.

    File: /tf/active/vicechatdev/e-ink-llm/main.py

    async cli entry-point file-processing ai-vision
  • function test_placeholder_parsing

    A unit test function that validates the placeholder parsing functionality of the HybridResponseHandler class by testing its ability to extract and parse graphic placeholders from formatted text.

    File: /tf/active/vicechatdev/e-ink-llm/test_hybrid_mode.py

    testing unit-test placeholder-parsing text-processing graphics
  • function parse_directory_listing_debug

    A debug version of a directory listing parser that extracts and categorizes file entries with detailed console output for troubleshooting.

    File: /tf/active/vicechatdev/e-ink-llm/cloudtest/debug_rm_parsing.py

    parsing debug directory-listing file-system regex

Search Examples