Commit Graph

74 Commits

Author SHA1 Message Date
pseco 8762eddef0 working on lark 2026-03-13 11:01:56 +01:00
pseco acc00adfaa Add AVAP execution and testing scripts
- Implemented parser for executing AVAP files within a Docker container (parser v1.py).
- Created a script to send AVAP code to a local server and handle responses (parser v2.py).
- Introduced a mock MBAP test harness to validate AVAP code against expected outputs (mbap_tester.py).
- Added transformation logic to convert AVAP code into Python-like syntax for testing purposes.
- Enhanced error handling and output formatting in the testing harness.
2026-03-12 15:56:36 +01:00
acano 0421a315eb Set default value of delete_es_index to False in elasticsearch_ingestion function 2026-03-11 17:39:25 +01:00
acano 5f21544e0b Refactor Elasticsearch ingestion pipeline and add MBPP generation script
- Updated `elasticsearch_ingestion.py` to streamline document processing and ingestion into Elasticsearch.
- Introduced `generate_mbap.py` for generating benchmark problems in AVAP language from a provided LRM.
- Created `prompts.py` to define prompts for converting Python problems to AVAP.
- Enhanced chunk processing in `chunk.py` to support markdown and AVAP documents.
- Added `OllamaEmbeddings` class in `embeddings.py` for handling embeddings with Ollama model.
- Updated dependencies in `uv.lock` to include new packages and versions.
2026-03-11 17:17:44 +01:00
pseco d04c149e66 workin on scratches bnf and parsing 2026-03-11 12:28:35 +01:00
pseco 3ac432567b BNF extraction pipeline from avap.md 2026-03-11 11:29:19 +01:00
pseco cd3922abbd modified config.py 2026-03-11 10:41:28 +01:00
acano 2ad09cc77f feat: Update dependencies and enhance Elasticsearch ingestion pipeline
- Added new dependencies including chonkie and markdown-it-py to requirements.txt.
- Refactored the Elasticsearch ingestion script to read and concatenate documents from specified folders.
- Implemented semantic chunking for documents using the chonkie library.
- Removed the old elasticsearch_ingestion_from_docs.py script as its functionality has been integrated into the main ingestion pipeline.
- Updated README.md to reflect new project structure and environment variables.
- Added a new changelog entry for version 1.4.0 detailing recent changes and enhancements.
2026-03-11 09:50:51 +01:00
pseco f5b2df94d2 Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-10 14:40:32 +01:00
pseco 4c56dc29c4 Add initial Jupyter notebook for document ingestion using Ollama embeddings
- Implemented code to utilize OllamaEmbeddings for embedding documents.
- Included example usage with sample text inputs.
- Demonstrated response handling from the Ollama LLM.
- Noted deprecation warning for the Ollama class in LangChain.
2026-03-10 14:40:27 +01:00
acano bf3c7f36d8 feat(chunk): enhance file reading and processing logic
- Updated `read_files` function to return a list of dictionaries containing 'content' and 'title' keys.
- Added logic to handle concatenation of file contents and improved handling of file prefixes.
- Introduced `get_chunk_docs` function to chunk document contents using `SemanticChunker`.
- Added `convert_chunks_to_document` function to convert chunked content into `Document` objects.
- Integrated logging for chunking process.
- Updated dependencies in `uv.lock` to include `chonkie` and other related packages.
2026-03-10 14:36:09 +01:00
pseco f6bfba5561 Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-09 15:04:23 +01:00
pseco 4afba7d89d working on scrappy 2026-03-09 15:00:07 +01:00
acano 6d856ba691 Add chunk.py for processing and replacing JavaScript references with Avap
- Implemented `replace_javascript_with_avap` function to handle text replacement.
- Created `read_concat_files` function to read and concatenate files with a specified prefix, replacing JavaScript markers.
- Added functionality to read files from a specified directory and process their contents.
2026-03-09 13:21:18 +01:00
pseco 423061f76d Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-09 09:35:53 +01:00
pseco 11e6ef71b1 working on agent pseco scratches 2026-03-09 09:35:49 +01:00
acano a434d34676 Updated docs 2026-03-06 11:38:06 +01:00
pseco 010270bf22 Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-05 11:00:35 +01:00
pseco 183c04829c Update changelog for version 1.2.0: add new modules, refactor server integration, and enhance dependency management 2026-03-05 11:00:30 +01:00
acano d9d754bc6f Implement feature X to enhance user experience and optimize performance 2026-03-05 10:57:38 +01:00
acano 1549069f5a feat: Add Elasticsearch ingestion pipeline and document chunking functionality
- Implemented `elasticsearch_ingestion` function to handle document ingestion into Elasticsearch.
- Created `build_chunks_from_folder` function to read and clean text files, generating document chunks.
- Added logging for better traceability during the ingestion process.
- Updated `uv.lock` to include `boto3` as a new dependency.
2026-03-04 18:21:01 +01:00
pseco f15266f345 Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-04 13:58:43 +01:00
pseco 9079674114 working on retrieve from ES 2026-03-04 13:58:38 +01:00
acano dcc07495e5 Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-03 17:49:36 +01:00
acano 0538f3b5ce Refactor code structure for improved readability and maintainability 2026-03-03 17:49:27 +01:00
pseco 89316a9f6b Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-03 15:07:58 +01:00
pseco 63c5fc976f working on Dual Index 2026-03-03 15:07:53 +01:00
acano ff08d9a426 Refactor code structure for improved readability and maintainability 2026-03-03 14:38:55 +01:00
pseco 9575af3ff0 working on dual index 2026-03-03 12:01:03 +01:00
pseco 8297ae204c added lark to notebook 2026-03-03 09:39:09 +01:00
acano 5a666079a4 Refactor langgraph_agent_simple notebook execution counts and handle Langfuse client errors
- Set execution counts to null for initial cells in langgraph_agent_simple.ipynb
- Update execution counts for subsequent cells to maintain order
- Change output stream name from stdout to stderr for error handling
- Capture and log detailed error messages for failed Langfuse client authentication

Update uv.lock to manage accelerate dependency

- Remove accelerate from main dependencies
- Add accelerate to dev dependencies with version specification
- Adjust requires-dist section to reflect changes in dependency management
2026-03-02 14:07:29 +01:00
acano 48d280440c Refactor code structure for improved readability and maintainability 2026-02-27 14:45:33 +01:00
acano 10246a3046 Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-02-27 08:41:10 +01:00
acano a70cd29b67 Refactor code structure for improved readability and maintainability 2026-02-27 08:39:39 +01:00
pseco e01e424fac workin on llm_factory 2026-02-26 18:02:46 +01:00
pseco 77751ee8ac working on langgraph agent v2 2026-02-26 11:35:00 +01:00
pseco 4a2db004c0 adding ragas to dev 2026-02-26 09:45:42 +01:00
pseco dfdf94f604 working on ragas 2026-02-25 17:28:35 +01:00
pseco 12eef38f33 count tokens files 2026-02-25 17:17:20 +01:00
pseco 71cb79985c added config and count tokens 2026-02-25 14:59:57 +01:00
pseco b01a76e71d evaluation on acano 2026-02-24 17:03:33 +01:00
pseco f6a907911d unstage changes 2026-02-24 16:59:57 +01:00
pseco cdd5f45ae1 Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-02-24 16:59:38 +01:00
pseco d6ac7aa1ca working on evaluating embeddings 2026-02-24 16:53:37 +01:00
acano 4b5352d93c Refactor code structure for improved readability and maintainability 2026-02-24 16:24:32 +01:00
acano d4d7d9d2a1 Implement feature X to enhance user experience and optimize performance 2026-02-24 15:48:06 +01:00
acano 0d6c08e341 Add BEIR analysis notebook for CosQA and update dependencies
- Created a new Jupyter notebook for analyzing BEIR dataset with CosQA using Ollama embeddings.
- Implemented a custom embedding class to integrate LangChain's OllamaEmbeddings with BEIR.
- Added data loading and evaluation logic for the CosQA dataset.
- Updated `uv.lock` to remove unnecessary dependencies (`mteb` and `polars`) and incremented revision number.
2026-02-24 15:27:59 +01:00
pseco ff438ea6c4 update makefile 2026-02-24 14:52:48 +01:00
pseco 9b6726c232 working on evaluatin embeddings 2026-02-24 14:35:48 +01:00
pseco 8e852c5417 Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-02-24 12:09:56 +01:00