Commit Graph

12 Commits

Author SHA1 Message Date
pseco 00a7cc727d Refactor code structure and remove redundant code blocks for improved readability and maintainability 2026-04-06 11:20:21 +02:00
pseco 26ffcc54d9 Refactor code structure for improved readability and maintainability 2026-03-31 13:57:15 +02:00
pseco 8f501d3e52 Refactor code structure for improved readability and maintainability 2026-03-31 11:16:03 +02:00
pseco cd656b08a8 Update default dataset path in validate_synthetic_dataset.py to point to new output location 2026-03-30 10:04:28 +02:00
pseco 0cf2fc3aa7 Remove detailed print statements from fill rate analysis and retain only essential output 2026-03-27 14:14:00 +01:00
pseco 344230c2cf Refactor code structure for improved readability and maintainability 2026-03-27 14:09:18 +01:00
pseco 1ee5f21c7c Add BEIR analysis notebooks and evaluation pipeline for embedding models
- Created `n00 Beir Analysis_cosqa.ipynb` for analyzing CoSQA dataset with BEIR.
- Created `n00 first Analysis.ipynb` for initial analysis with embeddings.
- Implemented `evaluate_embeddings_pipeline.py` to evaluate embedding models across CodexGlue, CoSQA, and SciFact benchmarks.
- Added adapters for Ollama and HuggingFace embeddings to ensure compatibility with BEIR.
- Enhanced error handling and data normalization in embedding processes.
- Included functionality to load datasets from local cache or download if not present.
2026-03-26 09:37:37 +01:00
pseco 0d2cdd2190 Refactor AVAP dataset generation prompts and add synthetic data generation notebook
- Introduced a new notebook for generating synthetic datasets for AVAP, including loading AVAP and MBPP data, and creating prompts for LLM interactions.
2026-03-25 17:07:00 +01:00
pseco 8878ca51e4 working on examples verification and testing on avap language server 2026-03-17 11:46:25 +01:00
pseco c7adab24a6 working on synthetic dataset 2026-03-17 11:02:06 +01:00
pseco 4a2db004c0 adding ragas to dev 2026-02-26 09:45:42 +01:00
pseco dfdf94f604 working on ragas 2026-02-25 17:28:35 +01:00