lila/scripts/extraction-scripts/italian
lila 3374bd8b20 feat(scripts): add Italian CEFR data pipeline
- Add extractors for Italian sources: it_m3.xls and italian.json
- Add comparison script (compare-italian.py) to report source overlaps and conflicts
- Add merge script (merge-italian-json.py) with priority order ['italian', 'it_m3']
- Output authoritative dataset to datafiles/italian-merged.json
- Update README to document both English and Italian pipelines
2026-04-08 18:32:03 +02:00
..
extract-it_m3.py feat(scripts): add Italian CEFR data pipeline 2026-04-08 18:32:03 +02:00
extract-random-json.py feat(scripts): add Italian CEFR data pipeline 2026-04-08 18:32:03 +02:00