- Add extractors for Italian sources: it_m3.xls and italian.json - Add comparison script (compare-italian.py) to report source overlaps and conflicts - Add merge script (merge-italian-json.py) with priority order ['italian', 'it_m3'] - Output authoritative dataset to datafiles/italian-merged.json - Update README to document both English and Italian pipelines |
||
|---|---|---|
| .. | ||
| extract-cefrj-csv.py | ||
| extract-en_m3.py | ||
| extract-octanove.py | ||
| extract-random-json.py | ||