Commit Graph

12 Commits

Author SHA1 Message Date
acano 752bf9c7d9 Update Elasticsearch index version and modify imports in ingestion and translation scripts
- Changed Elasticsearch index from "avap-docs-test-v3" to "avap-docs-test-v4" in elasticsearch_ingestion.py.
- Removed unused import SystemMessage from langchain_core.messages in translate_mbpp.py.
- Added import for Lark in chunk.py to support new functionality.
2026-03-19 11:30:00 +01:00
acano 868a17523a Merge online into mrh-online-dev 2026-03-19 11:25:36 +01:00
acano fadf813494 Update Elasticsearch index version and enhance document processing
- Changed Elasticsearch index from "avap-docs-test-v3" to "avap-docs-test-v4" in elasticsearch_ingestion.py.
- Added Lark parser for AVAP code processing in chunk.py.
- Enhanced metadata extraction for processed documents, including AST for AVAP files.
- Improved error handling for AVAP code parsing.
2026-03-16 13:21:25 +01:00
pseco acc00adfaa Add AVAP execution and testing scripts
- Implemented parser for executing AVAP files within a Docker container (parser v1.py).
- Created a script to send AVAP code to a local server and handle responses (parser v2.py).
- Introduced a mock MBAP test harness to validate AVAP code against expected outputs (mbap_tester.py).
- Added transformation logic to convert AVAP code into Python-like syntax for testing purposes.
- Enhanced error handling and output formatting in the testing harness.
2026-03-12 15:56:36 +01:00
acano 654ac88da7 feat: Enhance Elasticsearch ingestion with metadata export
- Added `export_documents` function to save processed documents to JSON.
- Extended `ElasticHandshake` to include chunk metadata during ingestion.
- Updated `process_documents` to include extra metadata for each chunk.
- Modified `ingest_documents` to return Elasticsearch responses for further processing.
- Adjusted `elasticsearch_ingestion` command to accept output path for exported JSON.
2026-03-12 12:28:17 +01:00
acano ed25f15542 feat: Enhance Elasticsearch ingestion process with metadata export
- Added output path parameter to elasticsearch_ingestion command for exporting processed documents.
- Implemented ElasticHandshakeWithMetadata class to preserve chunk metadata during ingestion.
- Updated process_documents function to include extra metadata for each chunk.
- Modified ingest_documents function to return Elasticsearch response for each chunk.
- Introduced export_documents function to save processed documents as JSON files.
2026-03-12 12:26:47 +01:00
acano de21bcb5fb Refactor code structure for improved readability and maintainability 2026-03-11 17:48:54 +01:00
pseco d04c149e66 workin on scratches bnf and parsing 2026-03-11 12:28:35 +01:00
pseco 3ac432567b BNF extraction pipeline from avap.md 2026-03-11 11:29:19 +01:00
acano a434d34676 Updated docs 2026-03-06 11:38:06 +01:00
acano 8914acbb95 Add comprehensive documentation for AVAP™ language features
- Introduced sections on Persistence, Connectors, and Native ORM, detailing the avapConnector, ORM commands, and data access abstraction.
- Documented System Utilities and Transformation, covering time management, string manipulation, and security operations.
- Explained Function Architecture and Scopes, including function definition, invocation, and middleware usage.
- Provided a Master Example that integrates various sections to demonstrate practical application.
- Detailed the dynamic nature of AVAP™ as a programming language, including dynamic typing and memory management.
- Established notation conventions and lexical analysis processes for code clarity and structure.
- Outlined data types and structures available in AVAP™, emphasizing their usage in program development.
- Discussed variable management, including local and global variables, and best practices for comments.
- Explained expressions in AVAP™, including types, operators, and practical examples with lists.
2026-03-05 15:54:05 +01:00
acano 1549069f5a feat: Add Elasticsearch ingestion pipeline and document chunking functionality
- Implemented `elasticsearch_ingestion` function to handle document ingestion into Elasticsearch.
- Created `build_chunks_from_folder` function to read and clean text files, generating document chunks.
- Added logging for better traceability during the ingestion process.
- Updated `uv.lock` to include `boto3` as a new dependency.
2026-03-04 18:21:01 +01:00