assistance-engine

Commit Graph

Author	SHA1	Message	Date
pseco	04fa15ff1e	Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev	2026-03-27 14:14:05 +01:00
pseco	0cf2fc3aa7	Remove detailed print statements from fill rate analysis and retain only essential output	2026-03-27 14:14:00 +01:00
acano	8df0b59f65	Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev	2026-03-27 14:13:17 +01:00
pseco	344230c2cf	Refactor code structure for improved readability and maintainability	2026-03-27 14:09:18 +01:00
acano	d074ce32cc	Merge branch 'online' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev	2026-03-27 14:08:18 +01:00
pseco	1ee5f21c7c	Add BEIR analysis notebooks and evaluation pipeline for embedding models - Created `n00 Beir Analysis_cosqa.ipynb` for analyzing CoSQA dataset with BEIR. - Created `n00 first Analysis.ipynb` for initial analysis with embeddings. - Implemented `evaluate_embeddings_pipeline.py` to evaluate embedding models across CodexGlue, CoSQA, and SciFact benchmarks. - Added adapters for Ollama and HuggingFace embeddings to ensure compatibility with BEIR. - Enhanced error handling and data normalization in embedding processes. - Included functionality to load datasets from local cache or download if not present.	2026-03-26 09:37:37 +01:00
pseco	0d2cdd2190	Refactor AVAP dataset generation prompts and add synthetic data generation notebook - Introduced a new notebook for generating synthetic datasets for AVAP, including loading AVAP and MBPP data, and creating prompts for LLM interactions.	2026-03-25 17:07:00 +01:00
acano	21bc6fc3f0	feat: add embedding evaluation results and task processing notebook	2026-03-25 10:40:49 +01:00
acano	868a17523a	Merge online into mrh-online-dev	2026-03-19 11:25:36 +01:00
pseco	8878ca51e4	working on examples verification and testing on avap language server	2026-03-17 11:46:25 +01:00
pseco	80cdbcc38e	Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev	2026-03-17 11:02:18 +01:00
pseco	c7adab24a6	working on synthetic dataset	2026-03-17 11:02:06 +01:00
acano	f343e0027b	Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev	2026-03-16 13:21:38 +01:00
acano	fadf813494	Update Elasticsearch index version and enhance document processing - Changed Elasticsearch index from "avap-docs-test-v3" to "avap-docs-test-v4" in elasticsearch_ingestion.py. - Added Lark parser for AVAP code processing in chunk.py. - Enhanced metadata extraction for processed documents, including AST for AVAP files. - Improved error handling for AVAP code parsing.	2026-03-16 13:21:25 +01:00
pseco	8501988619	working on bnf	2026-03-16 09:57:36 +01:00
pseco	8762eddef0	working on lark	2026-03-13 11:01:56 +01:00
pseco	acc00adfaa	Add AVAP execution and testing scripts - Implemented parser for executing AVAP files within a Docker container (parser v1.py). - Created a script to send AVAP code to a local server and handle responses (parser v2.py). - Introduced a mock MBAP test harness to validate AVAP code against expected outputs (mbap_tester.py). - Added transformation logic to convert AVAP code into Python-like syntax for testing purposes. - Enhanced error handling and output formatting in the testing harness.	2026-03-12 15:56:36 +01:00
acano	90ba49d613	refactor: remove unused test scripts for Elasticsearch integration	2026-03-12 12:32:05 +01:00
acano	0421a315eb	Set default value of delete_es_index to False in elasticsearch_ingestion function	2026-03-11 17:39:25 +01:00
acano	5f21544e0b	Refactor Elasticsearch ingestion pipeline and add MBPP generation script - Updated `elasticsearch_ingestion.py` to streamline document processing and ingestion into Elasticsearch. - Introduced `generate_mbap.py` for generating benchmark problems in AVAP language from a provided LRM. - Created `prompts.py` to define prompts for converting Python problems to AVAP. - Enhanced chunk processing in `chunk.py` to support markdown and AVAP documents. - Added `OllamaEmbeddings` class in `embeddings.py` for handling embeddings with Ollama model. - Updated dependencies in `uv.lock` to include new packages and versions.	2026-03-11 17:17:44 +01:00
pseco	d04c149e66	workin on scratches bnf and parsing	2026-03-11 12:28:35 +01:00
pseco	3ac432567b	BNF extraction pipeline from avap.md	2026-03-11 11:29:19 +01:00
pseco	cd3922abbd	modified config.py	2026-03-11 10:41:28 +01:00
acano	2ad09cc77f	feat: Update dependencies and enhance Elasticsearch ingestion pipeline - Added new dependencies including chonkie and markdown-it-py to requirements.txt. - Refactored the Elasticsearch ingestion script to read and concatenate documents from specified folders. - Implemented semantic chunking for documents using the chonkie library. - Removed the old elasticsearch_ingestion_from_docs.py script as its functionality has been integrated into the main ingestion pipeline. - Updated README.md to reflect new project structure and environment variables. - Added a new changelog entry for version 1.4.0 detailing recent changes and enhancements.	2026-03-11 09:50:51 +01:00
pseco	f5b2df94d2	Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev	2026-03-10 14:40:32 +01:00
pseco	4c56dc29c4	Add initial Jupyter notebook for document ingestion using Ollama embeddings - Implemented code to utilize OllamaEmbeddings for embedding documents. - Included example usage with sample text inputs. - Demonstrated response handling from the Ollama LLM. - Noted deprecation warning for the Ollama class in LangChain.	2026-03-10 14:40:27 +01:00
acano	bf3c7f36d8	feat(chunk): enhance file reading and processing logic - Updated `read_files` function to return a list of dictionaries containing 'content' and 'title' keys. - Added logic to handle concatenation of file contents and improved handling of file prefixes. - Introduced `get_chunk_docs` function to chunk document contents using `SemanticChunker`. - Added `convert_chunks_to_document` function to convert chunked content into `Document` objects. - Integrated logging for chunking process. - Updated dependencies in `uv.lock` to include `chonkie` and other related packages.	2026-03-10 14:36:09 +01:00
pseco	f6bfba5561	Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev	2026-03-09 15:04:23 +01:00
pseco	4afba7d89d	working on scrappy	2026-03-09 15:00:07 +01:00
acano	6d856ba691	Add chunk.py for processing and replacing JavaScript references with Avap - Implemented `replace_javascript_with_avap` function to handle text replacement. - Created `read_concat_files` function to read and concatenate files with a specified prefix, replacing JavaScript markers. - Added functionality to read files from a specified directory and process their contents.	2026-03-09 13:21:18 +01:00
pseco	423061f76d	Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev	2026-03-09 09:35:53 +01:00
pseco	11e6ef71b1	working on agent pseco scratches	2026-03-09 09:35:49 +01:00
acano	a434d34676	Updated docs	2026-03-06 11:38:06 +01:00
pseco	010270bf22	Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev	2026-03-05 11:00:35 +01:00
pseco	183c04829c	Update changelog for version 1.2.0: add new modules, refactor server integration, and enhance dependency management	2026-03-05 11:00:30 +01:00
acano	d9d754bc6f	Implement feature X to enhance user experience and optimize performance	2026-03-05 10:57:38 +01:00
acano	1549069f5a	feat: Add Elasticsearch ingestion pipeline and document chunking functionality - Implemented `elasticsearch_ingestion` function to handle document ingestion into Elasticsearch. - Created `build_chunks_from_folder` function to read and clean text files, generating document chunks. - Added logging for better traceability during the ingestion process. - Updated `uv.lock` to include `boto3` as a new dependency.	2026-03-04 18:21:01 +01:00
pseco	f15266f345	Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev	2026-03-04 13:58:43 +01:00
pseco	9079674114	working on retrieve from ES	2026-03-04 13:58:38 +01:00
acano	dcc07495e5	Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev	2026-03-03 17:49:36 +01:00
acano	0538f3b5ce	Refactor code structure for improved readability and maintainability	2026-03-03 17:49:27 +01:00
pseco	89316a9f6b	Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev	2026-03-03 15:07:58 +01:00
pseco	63c5fc976f	working on Dual Index	2026-03-03 15:07:53 +01:00
acano	ff08d9a426	Refactor code structure for improved readability and maintainability	2026-03-03 14:38:55 +01:00
pseco	9575af3ff0	working on dual index	2026-03-03 12:01:03 +01:00
pseco	8297ae204c	added lark to notebook	2026-03-03 09:39:09 +01:00
acano	5a666079a4	Refactor langgraph_agent_simple notebook execution counts and handle Langfuse client errors - Set execution counts to null for initial cells in langgraph_agent_simple.ipynb - Update execution counts for subsequent cells to maintain order - Change output stream name from stdout to stderr for error handling - Capture and log detailed error messages for failed Langfuse client authentication Update uv.lock to manage accelerate dependency - Remove accelerate from main dependencies - Add accelerate to dev dependencies with version specification - Adjust requires-dist section to reflect changes in dependency management	2026-03-02 14:07:29 +01:00
acano	48d280440c	Refactor code structure for improved readability and maintainability	2026-02-27 14:45:33 +01:00
acano	10246a3046	Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev	2026-02-27 08:41:10 +01:00
acano	a70cd29b67	Refactor code structure for improved readability and maintainability	2026-02-27 08:39:39 +01:00

1 2

90 Commits