lila/documentation/notes.md
lila 2b177aad5b feat(db): add incremental upsert seed script for WordNet vocabulary
Implements packages/db/src/seed.ts — reads all JSON files from
scripts/datafiles/, validates filenames against supported language
codes and POS, and upserts synsets into  and
via onConflictDoNothing. Safe to re-run; produces 0 writes on
a duplicate run.
2026-03-30 15:58:01 +02:00

1 KiB

notes

tasks

  • pinning dependencies in package.json files
  • add this to drizzle migrartions file: ALTER TABLE terms ADD CHECK (pos IN ('noun', 'verb', 'adjective', etc));

openwordnet

download libraries via

python -c 'import wn; wn.download("omw-fr")';

libraries:

odenet:1.4 omw-es:1.4 omw-fr:1.4 omw-it:1.4 omw-en:1.4

upgrade wn package:

pip install --upgrade wn

check if wn is available, eg italian:

python -c "import wn; print(len(wn.words(lang='it', lexicon='omw-it:1.4')))"

remove a library:

python -c "import wn; wn.remove('oewn:2024')"﬌ python -c "import wn; wn.remove('oewn:2024')"

list all libraries:

python -c "import wn; print(wn.lexicons())"

drizzle

generate migration file, go to packages/db, then:

pnpm drizzle-kit generate

execute migration, go to packages/db (docker containers need to be running):

DATABASE_URL=postgresql://username:password@localhost:5432/database pnpm drizzle-kit migrate