Commit Graph

19 Commits

Author SHA1 Message Date
acano 5f21544e0b Refactor Elasticsearch ingestion pipeline and add MBPP generation script
- Updated `elasticsearch_ingestion.py` to streamline document processing and ingestion into Elasticsearch.
- Introduced `generate_mbap.py` for generating benchmark problems in AVAP language from a provided LRM.
- Created `prompts.py` to define prompts for converting Python problems to AVAP.
- Enhanced chunk processing in `chunk.py` to support markdown and AVAP documents.
- Added `OllamaEmbeddings` class in `embeddings.py` for handling embeddings with Ollama model.
- Updated dependencies in `uv.lock` to include new packages and versions.
2026-03-11 17:17:44 +01:00
acano 0ed7dfc653 Merge branch 'online' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-11 09:57:14 +01:00
acano 2ad09cc77f feat: Update dependencies and enhance Elasticsearch ingestion pipeline
- Added new dependencies including chonkie and markdown-it-py to requirements.txt.
- Refactored the Elasticsearch ingestion script to read and concatenate documents from specified folders.
- Implemented semantic chunking for documents using the chonkie library.
- Removed the old elasticsearch_ingestion_from_docs.py script as its functionality has been integrated into the main ingestion pipeline.
- Updated README.md to reflect new project structure and environment variables.
- Added a new changelog entry for version 1.4.0 detailing recent changes and enhancements.
2026-03-11 09:50:51 +01:00
rafa-ruiz 35ca56118d feat: add MBPP-style dataset generator and evaluation docs 2026-03-10 13:37:19 -07:00
acano 6692990a38 docs: Update function definition syntax in AVAP™ documentation for clarity 2026-03-06 08:50:54 +01:00
acano a4267e1b60 feat: implement Elasticsearch ingestion pipeline and embedding factories 2026-03-05 16:26:22 +01:00
acano d951868200 refactor: Simplify Elasticsearch ingestion by removing chunk management module and integrating document building directly 2026-03-05 16:23:27 +01:00
acano 8914acbb95 Add comprehensive documentation for AVAP™ language features
- Introduced sections on Persistence, Connectors, and Native ORM, detailing the avapConnector, ORM commands, and data access abstraction.
- Documented System Utilities and Transformation, covering time management, string manipulation, and security operations.
- Explained Function Architecture and Scopes, including function definition, invocation, and middleware usage.
- Provided a Master Example that integrates various sections to demonstrate practical application.
- Detailed the dynamic nature of AVAP™ as a programming language, including dynamic typing and memory management.
- Established notation conventions and lexical analysis processes for code clarity and structure.
- Outlined data types and structures available in AVAP™, emphasizing their usage in program development.
- Discussed variable management, including local and global variables, and best practices for comments.
- Explained expressions in AVAP™, including types, operators, and practical examples with lists.
2026-03-05 15:54:05 +01:00
acano 31206e8fce refactor: Update project structure in README to enhance clarity and organization 2026-03-05 15:16:50 +01:00
acano 1549069f5a feat: Add Elasticsearch ingestion pipeline and document chunking functionality
- Implemented `elasticsearch_ingestion` function to handle document ingestion into Elasticsearch.
- Created `build_chunks_from_folder` function to read and clean text files, generating document chunks.
- Added logging for better traceability during the ingestion process.
- Updated `uv.lock` to include `boto3` as a new dependency.
2026-03-04 18:21:01 +01:00
Rafael Ruiz 86f426ae48
Fix formatting of secret values warning in README 2026-03-03 13:35:27 -08:00
rafa-ruiz aa9865f5d0 docs: add contribution standards, PR template and env vars policy 2026-03-03 13:19:42 -08:00
rafa-ruiz c862ea9194 docs: formalize repository standards and Docker build policies in README 2026-02-16 20:50:19 -08:00
pseco 7979061a36 Merge branch 'mrh-online-proposal' of github.com:BRUNIX-AI/assistance-engine into mrh-online-proposal 2026-02-16 14:59:23 +01:00
izapata 7cdaf5a0c5 feat: update README and add start-tunnels.sh script for infrastructure setup 2026-02-16 14:50:55 +01:00
pseco ab48d27ddb adding kubectl 2026-02-16 14:14:34 +01:00
rafa-ruiz 8d1de81e47 update 2026-02-11 00:41:39 -08:00
rafa-ruiz 4cf171c8a2 update 2026-02-11 00:13:13 -08:00
rafa-ruiz be65aa977b Docs 2026-02-09 20:36:58 -08:00