- rename scripts/ to data-pipeline/, archive existing scripts - add @lila/pipeline as pnpm workspace package - add stage-1-extract through stage-5-compare folder structure - update SUPPORTED_LANGUAGE_CODES (add es, de, fr) - update SUPPORTED_POS (add adjective, adverb) - add description field to term_glosses - add term_examples table - run and verify db migration - write and verify extract.py (117,659 synsets across 5 languages) - write PIPELINE.md |
||
|---|---|---|
| .. | ||
| stage-1-extract/scripts | ||
| stage-2-annotate/sources/cefr | ||
| COVERAGE.md | ||
| package.json | ||
| tsconfig.json | ||