58 lines
3.0 KiB
JSON
58 lines
3.0 KiB
JSON
{
|
|
"generated_at": "2026-04-06T10:39:53.540150",
|
|
"embedding_model": "qwen3-0.6B-emb:latest",
|
|
"index_name": "avap-knowledge-v2-qwen",
|
|
"result": {
|
|
"status": "ok",
|
|
"questions_evaluated": 5,
|
|
"elapsed_seconds": 75.3,
|
|
"judge_model": "claude-sonnet-4-20250514",
|
|
"index": "avap-knowledge-v2-qwen",
|
|
"category_filter": "all",
|
|
"scores": {
|
|
"faithfulness": 0.5963,
|
|
"answer_relevancy": 0.8799,
|
|
"context_recall": 0.5615,
|
|
"context_precision": 0.5333
|
|
},
|
|
"global_score": 0.6428,
|
|
"verdict": "ACCEPTABLE",
|
|
"details": [
|
|
{
|
|
"id": "GD-001",
|
|
"category": "RETRIEVAL",
|
|
"question": "What is AVAP and what is it designed for?",
|
|
"answer_preview": "AVAP (Advanced Virtual API Programming) is a Domain-Specific Language (DSL) that is Turing Complete. It was designed arquitectonically to orchestrate secure, concurrent, and deterministic microservices and I/O operations in an efficient manner.\n\nKey points about AVAP:\n- **Turing Completeness**: It c",
|
|
"n_chunks": 5
|
|
},
|
|
{
|
|
"id": "GD-002",
|
|
"category": "RETRIEVAL",
|
|
"question": "How does AVAP handle conditional logic? What commands are used and how are blocks closed?",
|
|
"answer_preview": "AVAP handles conditional logic using the `if()` command, which can be invoked in two modes:\n\n1. **Mode 1 (Structured Comparison):** The syntax is `if(variable, value, comparison_operator)`. This mode evaluates a comparison between the variable and the value provided as strings (e.g., `\"==\"`, `\">\"`, ",
|
|
"n_chunks": 5
|
|
},
|
|
{
|
|
"id": "GD-003",
|
|
"category": "CODE_GENERATION",
|
|
"question": "Write an AVAP script that reads a 'password' parameter, generates a SHA-256 hash of it, and returns the hash.",
|
|
"answer_preview": "```avap\naddParam(\"password\", password)\nencodeSHA256(password, hashed_password)\nreturn hashed_password\n```",
|
|
"n_chunks": 5
|
|
},
|
|
{
|
|
"id": "GD-004",
|
|
"category": "CODE_GENERATION",
|
|
"question": "Show an AVAP script that loops from 1 to 5, builds a JSON object with each iteration index as a key, and returns it.",
|
|
"answer_preview": "```avap\nx = 10\ny = 20\n\njson_obj = {}\n\nfor i in range(1, 6):\n json_obj[i] = {\"index\": i}\n\nreturn json_obj\n```\n\nThis script initializes `x` and `y`, then builds a JSON object where each key is an iteration index from 1 to 5. The function returns this constructed JSON object at the end.",
|
|
"n_chunks": 5
|
|
},
|
|
{
|
|
"id": "GD-005",
|
|
"category": "RETRIEVAL",
|
|
"question": "How does AVAP support external HTTP calls? What commands are available and how is timeout handled?",
|
|
"answer_preview": "AVAP supports external HTTP calls through the `RequestPost` and `RequestGet` functions. These functions allow for POST and GET requests, respectively, to be executed on a specified URL with optional query parameters, headers, body data, and destination file.\n\nTimeout handling in AVAP is managed by t",
|
|
"n_chunks": 5
|
|
}
|
|
]
|
|
}
|
|
} |