lila/data-pipeline
lila 73fb12ac35 feat: enrich script working, redesigning to sub-stage architecture
- Enrich script functional with timeout, progress tracking, rejection mechanism
- Identified ordering issue: CEFR voting needs validated translations first
- Redesign: round1_gloss → round1_example → round1_translations → round1_cefr
- Update data-pipeline.md with new sub-stage design and roadmap
- Qwen3.5-4B confirmed working with thinking disabled
2026-05-07 13:09:43 +02:00
..
db feat: enrich script working, redesigning to sub-stage architecture 2026-05-07 13:09:43 +02:00
sample feat: add db schema, init, and vitest config 2026-05-03 17:56:29 +02:00
stage-1-extract/scripts feat: update extractor for all 5 languages, update import for multi-language 2026-05-05 18:46:32 +02:00
stage-2-reverse-link/scripts feat: add stage 2 reverse link sync script 2026-05-05 18:57:55 +02:00
stage-3-enrich feat: enrich script working, redesigning to sub-stage architecture 2026-05-07 13:09:43 +02:00
tests/validation fix: update db import validation tests to account for reverse links 2026-05-05 19:10:19 +02:00
.env.example feat: enrich stage foundation — provider config, env setup, schema fix 2026-05-03 22:44:14 +02:00
audit.ts docs: rewrite data-pipeline.md for Kaikki migration 2026-05-05 17:14:48 +02:00
package.json feat: add stage 2 reverse link sync script 2026-05-05 18:57:55 +02:00
pipeline.ts feat: add stage 3 round 1 enrich script and wire into orchestrator 2026-05-05 19:28:38 +02:00
tsconfig.json feat: add db schema, init, and vitest config 2026-05-03 17:56:29 +02:00
vitest.config.ts feat: add stage 1 and 2 validation tests 2026-05-03 21:36:56 +02:00