smallevals — CPU-fast, GPU-blazing fast offline retrieval evaluation for RAG systems with tiny QA models.
-
Updated
Dec 4, 2025 - Python
smallevals — CPU-fast, GPU-blazing fast offline retrieval evaluation for RAG systems with tiny QA models.
CTR/ranking fundamentals practice with feature crossing, Logistic Regression baselines, AUC/LogLoss/nDCG notes and reproducible evaluation scripts.
Algorithm Intern Candidate | Recommendation / Search Retrieval / Ranking | PyTorch + Faiss | Offline Evaluation / Negative Sampling / Badcase Analysis
Offline prototype for intent-aware queue adaptation in music recommendation systems
Offline RAG retrieval-quality harness. Recall@k, nDCG, MRR, chunking diagnostics, regression diffs. No LLM-as-judge required. CI-friendly.
a new method for offline evaluation of recommender systems
Neural Thompson Sampling contextual bandit for personalized Type 2 Diabetes therapy selection — training pipeline, offline policy evaluation (IPS/SNIPS/DM/DR), safety gates, drift monitoring, and LLM-generated clinical explanations.
可复现的中文离线内容推荐应用原型,覆盖合成行为数据、动态兴趣画像、双路召回、个性化排序、多样性重排、推荐解释与离线评估。
Spotify-style music discovery platform with Spotify OAuth, hybrid recommendation, ALS, Word2Vec-style embeddings, explainable recommendations, and offline evaluation.
Production-ready recommender system suite: serving API, pipelines, algorithm SDK, and evaluation tooling.
Add a description, image, and links to the offline-evaluation topic page so that developers can more easily learn about it.
To associate your repository with the offline-evaluation topic, visit your repo's landing page and select "manage topics."