assistance-engine

Commit Graph

Author	SHA1	Message	Date
pseco	56349184fb	Add evaluation results for AVAP knowledge models and update evaluation notebook - Created a new JSON file containing evaluation results for the AVAP knowledge models, including scores for faithfulness, answer relevancy, context recall, and context precision. - Updated the evaluation notebook to use a new embedding model and fixed execution counts for code cells.	2026-04-06 11:55:33 +02:00
pseco	00a7cc727d	Refactor code structure and remove redundant code blocks for improved readability and maintainability	2026-04-06 11:20:21 +02:00
pseco	26ffcc54d9	Refactor code structure for improved readability and maintainability	2026-03-31 13:57:15 +02:00
pseco	8f501d3e52	Refactor code structure for improved readability and maintainability	2026-03-31 11:16:03 +02:00
pseco	cd656b08a8	Update default dataset path in validate_synthetic_dataset.py to point to new output location	2026-03-30 10:04:28 +02:00
pseco	0cf2fc3aa7	Remove detailed print statements from fill rate analysis and retain only essential output	2026-03-27 14:14:00 +01:00
pseco	344230c2cf	Refactor code structure for improved readability and maintainability	2026-03-27 14:09:18 +01:00
pseco	1ee5f21c7c	Add BEIR analysis notebooks and evaluation pipeline for embedding models - Created `n00 Beir Analysis_cosqa.ipynb` for analyzing CoSQA dataset with BEIR. - Created `n00 first Analysis.ipynb` for initial analysis with embeddings. - Implemented `evaluate_embeddings_pipeline.py` to evaluate embedding models across CodexGlue, CoSQA, and SciFact benchmarks. - Added adapters for Ollama and HuggingFace embeddings to ensure compatibility with BEIR. - Enhanced error handling and data normalization in embedding processes. - Included functionality to load datasets from local cache or download if not present.	2026-03-26 09:37:37 +01:00
pseco	0d2cdd2190	Refactor AVAP dataset generation prompts and add synthetic data generation notebook - Introduced a new notebook for generating synthetic datasets for AVAP, including loading AVAP and MBPP data, and creating prompts for LLM interactions.	2026-03-25 17:07:00 +01:00
pseco	8878ca51e4	working on examples verification and testing on avap language server	2026-03-17 11:46:25 +01:00
pseco	c7adab24a6	working on synthetic dataset	2026-03-17 11:02:06 +01:00
pseco	4a2db004c0	adding ragas to dev	2026-02-26 09:45:42 +01:00
pseco	dfdf94f604	working on ragas	2026-02-25 17:28:35 +01:00

13 Commits