Skip to content

Sync OpenCC dictionaries, configs, and upstream testcases#34

Open
frankslin wants to merge 1 commit into
yichen0831:masterfrom
nk2028:master
Open

Sync OpenCC dictionaries, configs, and upstream testcases#34
frankslin wants to merge 1 commit into
yichen0831:masterfrom
nk2028:master

Conversation

@frankslin
Copy link
Copy Markdown

  • Replace bundled OpenCC dictionary sources with the current upstream text dictionaries.
  • Replace bundled conversion configs with upstream configs, converting .ocd2 references to .txt so they work with this pure-Python loader.
  • Generate missing reverse dictionaries for HK, TW, and JP variants from the updated upstream source dictionaries.
  • Remove obsolete split Taiwan phrase dictionaries: TWPhrasesIT.txt, TWPhrasesName.txt, and TWPhrasesOther.txt.
  • Add a data-driven unittest that runs the upstream testcase corpus against all expected conversion modes.
  • Add upstream testcase data into opencc/testcases/ so imported upstream assets live under the package tree.
  • Teach dictionary loading to ignore upstream comment/blank lines, and allow ASCII hyphenated terms to match dictionary entries.
  • Fix the existing unittest import path so tests can be run from the repository root.

- Replace bundled OpenCC dictionary sources with the current upstream text dictionaries.
- Replace bundled conversion configs with upstream configs, converting `.ocd2` references to `.txt` so they work with this pure-Python loader.
- Generate missing reverse dictionaries for HK, TW, and JP variants from the updated upstream source dictionaries.
- Remove obsolete split Taiwan phrase dictionaries: `TWPhrasesIT.txt`, `TWPhrasesName.txt`, and `TWPhrasesOther.txt`.
- Add a data-driven unittest that runs the upstream testcase corpus against all expected conversion modes.
- Add upstream testcase data into `opencc/testcases/` so imported upstream assets live under the package tree.
- Teach dictionary loading to ignore upstream comment/blank lines, and allow ASCII hyphenated terms to match dictionary entries.
- Fix the existing unittest import path so tests can be run from the repository root.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant