Commit Graph

236 Commits

Author SHA1 Message Date
pseco febf955a62 Add new JSON output files for candidate F reward statistics and MBPP tasks
- Created `candidate_F_reward_10_coverage_stats.json` with coverage statistics including total cells, filled cells, fill rate, and node type frequency.
- Added `mbpp_avap.json` containing 14 tasks with descriptions, code implementations, test inputs, and expected test results for various endpoints and functionalities.
2026-03-26 17:18:45 +01:00
acano d50f33c707 Merge branch 'online' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-26 09:37:57 +01:00
pseco 1ee5f21c7c Add BEIR analysis notebooks and evaluation pipeline for embedding models
- Created `n00 Beir Analysis_cosqa.ipynb` for analyzing CoSQA dataset with BEIR.
- Created `n00 first Analysis.ipynb` for initial analysis with embeddings.
- Implemented `evaluate_embeddings_pipeline.py` to evaluate embedding models across CodexGlue, CoSQA, and SciFact benchmarks.
- Added adapters for Ollama and HuggingFace embeddings to ensure compatibility with BEIR.
- Enhanced error handling and data normalization in embedding processes.
- Included functionality to load datasets from local cache or download if not present.
2026-03-26 09:37:37 +01:00
rafa-ruiz ccd9073a52 feat(dataset): add ADR-0006 and scaffold reward algorithm pipeline 2026-03-25 22:19:19 -07:00
pseco 0d2cdd2190 Refactor AVAP dataset generation prompts and add synthetic data generation notebook
- Introduced a new notebook for generating synthetic datasets for AVAP, including loading AVAP and MBPP data, and creating prompts for LLM interactions.
2026-03-25 17:07:00 +01:00
pseco d7f895804c Refactor code structure for improved readability and maintainability 2026-03-25 10:53:38 +01:00
pseco 71eb85cc89 Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-25 10:50:13 +01:00
pseco 0b309bfa69 feat: add evaluation results for bge-m3 and qwen3-0.6B-emb models 2026-03-25 10:46:02 +01:00
acano b2e5d06d96 Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-25 10:41:26 +01:00
acano 21bc6fc3f0 feat: add embedding evaluation results and task processing notebook 2026-03-25 10:40:49 +01:00
acano da483c51bb created code_indexing_improvements research 2026-03-25 10:37:53 +01:00
acano fe90548b8b added ast tree metadata 2026-03-25 10:36:18 +01:00
acano dc8230c872 feat: add ANTHROPIC_API_KEY and ANTHROPIC_MODEL to docker-compose environment 2026-03-25 10:30:00 +01:00
acano bd542bb14d Continued ADR-0005 and created ADR-0006 2026-03-25 10:27:41 +01:00
acano 1442a632c9 fixed avap examples (not coherent with official avap bnf rules) 2026-03-25 10:26:47 +01:00
pseco cbce3ae530 Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-25 10:06:11 +01:00
pseco 9b1a0e54d5 updated pipeline to only download files when missing 2026-03-25 10:06:07 +01:00
acano 017a89322a feat: update dependencies in pyproject.toml and uv.lock 2026-03-25 10:01:17 +01:00
Rafael Ruiz f9b2b014bb
Merge pull request #59 from BRUNIX-AI/mrh-online-dev-partial
Added embeddings research
2026-03-24 06:38:59 -07:00
pseco 2a33f8eb06 bge-m3 and qwen3-emb comparison 2026-03-23 15:59:44 +01:00
pseco b574517340 working on ADR0005 2026-03-23 13:17:50 +01:00
pseco 185ea276b7 updated cosqa notebook 2026-03-23 10:36:10 +01:00
acano ec57e52dea Merge branch 'online' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-23 09:12:20 +01:00
rafa-ruiz 59c1748594 feat: editor context injection (PRD-0002) + repository governance 2026-03-20 19:43:48 -07:00
rafa-ruiz 2fbfad41df feat: editor context injection (PRD-0002) + repository governance 2026-03-20 19:25:29 -07:00
acano 14b279b8af Refactor code structure for improved readability and maintainability 2026-03-19 16:40:57 +01:00
acano 27cfbaf257 Refactor code structure for improved readability and maintainability 2026-03-19 16:37:42 +01:00
acano 16acdfb1f3 DOCS: Add research directory description in README 2026-03-19 16:30:44 +01:00
acano b6f9b386ef Add BEIR analysis notebooks for different datasets and models
- Created `n00 Beir Analysis.ipynb` for analyzing BEIR dataset with Ollama embeddings.
- Added `n00 Beir Analysis_cosqa.ipynb` for evaluating the CosQA dataset using similar methods.
- Introduced `n00 first Analysis.ipynb` for initial analysis with Ragas embeddings and semantic similarity evaluation.
- Implemented data loading, processing, and evaluation metrics for each notebook.
- Included functionality to save results to JSON files for further analysis.
2026-03-19 16:27:25 +01:00
acano dd3bde2ec9 Add BEIR analysis notebooks for different datasets and models
- Created `n00 Beir Analysis.ipynb` for analyzing BEIR dataset with Ollama embeddings.
- Added `n00 Beir Analysis_cosqa.ipynb` for evaluating the CosQA dataset using similar embedding techniques.
- Introduced `n00 first Analysis.ipynb` for initial analysis with Ragas embeddings and semantic similarity evaluation.
- Implemented data loading and processing for each notebook, including downloading datasets and saving results.
- Included evaluation metrics such as NDCG, MAP, Recall, and Precision for model performance assessment.
2026-03-19 16:24:34 +01:00
acano 752bf9c7d9 Update Elasticsearch index version and modify imports in ingestion and translation scripts
- Changed Elasticsearch index from "avap-docs-test-v3" to "avap-docs-test-v4" in elasticsearch_ingestion.py.
- Removed unused import SystemMessage from langchain_core.messages in translate_mbpp.py.
- Added import for Lark in chunk.py to support new functionality.
2026-03-19 11:30:00 +01:00
acano 868a17523a Merge online into mrh-online-dev 2026-03-19 11:25:36 +01:00
Rafael Ruiz 3ca8fc450c
Merge pull request #58 from BRUNIX-AI/online-fork
Online fork
2026-03-18 19:13:30 -07:00
Rafael Ruiz 52d381fa22
Merge branch 'main' into online-fork 2026-03-18 19:09:47 -07:00
rafa-ruiz fda47edae0 UPGRADE: New RAG functional 2026-03-18 18:56:01 -07:00
rafa-ruiz 1daac66f89 UPGRADE: New RAG functional 2026-03-18 18:55:48 -07:00
pseco 8878ca51e4 working on examples verification and testing on avap language server 2026-03-17 11:46:25 +01:00
pseco 80cdbcc38e Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-17 11:02:18 +01:00
pseco c7adab24a6 working on synthetic dataset 2026-03-17 11:02:06 +01:00
acano f343e0027b Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-16 13:21:38 +01:00
acano fadf813494 Update Elasticsearch index version and enhance document processing
- Changed Elasticsearch index from "avap-docs-test-v3" to "avap-docs-test-v4" in elasticsearch_ingestion.py.
- Added Lark parser for AVAP code processing in chunk.py.
- Enhanced metadata extraction for processed documents, including AST for AVAP files.
- Improved error handling for AVAP code parsing.
2026-03-16 13:21:25 +01:00
acano ed466b123d feat: Add llama-cpp-python and tenacity to dependencies 2026-03-16 13:19:09 +01:00
pseco 8501988619 working on bnf 2026-03-16 09:57:36 +01:00
acano ab1022d8b6 feat: Implement ElasticHandshakeWithMetadata to preserve chunk metadata in Elasticsearch 2026-03-13 11:02:32 +01:00
pseco 8aa12bd8eb Merge branch 'mrh-online-dev' of github.com:BRUNIX-AI/assistance-engine into mrh-online-dev 2026-03-13 11:02:02 +01:00
pseco 8762eddef0 working on lark 2026-03-13 11:01:56 +01:00
Rafael Ruiz dfcbf43fa2
Merge pull request #55 from BRUNIX-AI/mrh-online-dev-partial
Refactored ingestion pipeline and created AVAP synthethic dataset generation pipeline
2026-03-12 11:22:28 -07:00
acano f986c70845 fix: load environment variables and add elasticsearch_index to Settings class 2026-03-12 17:37:22 +01:00
acano e744d9f0cd feat: Load environment variables and add elasticsearch_index to Settings class 2026-03-12 17:36:28 +01:00
acano 9a435120d5 Merge branch 'online' into mrh-online-dev-partial 2026-03-12 17:09:00 +01:00