Commit Graph

15 Commits

Author SHA1 Message Date
acano 5f21544e0b Refactor Elasticsearch ingestion pipeline and add MBPP generation script
- Updated `elasticsearch_ingestion.py` to streamline document processing and ingestion into Elasticsearch.
- Introduced `generate_mbap.py` for generating benchmark problems in AVAP language from a provided LRM.
- Created `prompts.py` to define prompts for converting Python problems to AVAP.
- Enhanced chunk processing in `chunk.py` to support markdown and AVAP documents.
- Added `OllamaEmbeddings` class in `embeddings.py` for handling embeddings with Ollama model.
- Updated dependencies in `uv.lock` to include new packages and versions.
2026-03-11 17:17:44 +01:00
acano 0ed7dfc653 Merge branch 'online' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-11 09:57:14 +01:00
acano 2ad09cc77f feat: Update dependencies and enhance Elasticsearch ingestion pipeline
- Added new dependencies including chonkie and markdown-it-py to requirements.txt.
- Refactored the Elasticsearch ingestion script to read and concatenate documents from specified folders.
- Implemented semantic chunking for documents using the chonkie library.
- Removed the old elasticsearch_ingestion_from_docs.py script as its functionality has been integrated into the main ingestion pipeline.
- Updated README.md to reflect new project structure and environment variables.
- Added a new changelog entry for version 1.4.0 detailing recent changes and enhancements.
2026-03-11 09:50:51 +01:00
rafa-ruiz 35ca56118d feat: add MBPP-style dataset generator and evaluation docs 2026-03-10 13:37:19 -07:00
acano a4267e1b60 feat: implement Elasticsearch ingestion pipeline and embedding factories 2026-03-05 16:26:22 +01:00
acano d951868200 refactor: Simplify Elasticsearch ingestion by removing chunk management module and integrating document building directly 2026-03-05 16:23:27 +01:00
acano 8914acbb95 Add comprehensive documentation for AVAP™ language features
- Introduced sections on Persistence, Connectors, and Native ORM, detailing the avapConnector, ORM commands, and data access abstraction.
- Documented System Utilities and Transformation, covering time management, string manipulation, and security operations.
- Explained Function Architecture and Scopes, including function definition, invocation, and middleware usage.
- Provided a Master Example that integrates various sections to demonstrate practical application.
- Detailed the dynamic nature of AVAP™ as a programming language, including dynamic typing and memory management.
- Established notation conventions and lexical analysis processes for code clarity and structure.
- Outlined data types and structures available in AVAP™, emphasizing their usage in program development.
- Discussed variable management, including local and global variables, and best practices for comments.
- Explained expressions in AVAP™, including types, operators, and practical examples with lists.
2026-03-05 15:54:05 +01:00
acano 97c5ea7ce5 Merge branch 'online' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-05 12:13:31 +01:00
acano 83ca902a59 fix: Correct file path for chunk management module in changelog 2026-03-05 11:44:52 +01:00
acano 51f42c52b3 refactor: Remove unused uuid import from chunks.py and update changelog for refactoring changes 2026-03-05 11:27:27 +01:00
pseco 34c13dceca chore: Update changelog for version 1.2.0 to include new factory modules, orchestration, and ingestion pipeline 2026-03-05 11:15:40 +01:00
pseco 183c04829c Update changelog for version 1.2.0: add new modules, refactor server integration, and enhance dependency management 2026-03-05 11:00:30 +01:00
rafa-ruiz aa9865f5d0 docs: add contribution standards, PR template and env vars policy 2026-03-03 13:19:42 -08:00
rafa-ruiz c862ea9194 docs: formalize repository standards and Docker build policies in README 2026-02-16 20:50:19 -08:00
rafa-ruiz be65aa977b Docs 2026-02-09 20:36:58 -08:00