Skip to content

Identify variants which aren't used in any decompositions / which people are more likely to use #15

@Transfusion

Description

@Transfusion

The premise of this tool is to leverage knowledge of character variants, which includes the source characters of various radicals, whether dictionary-indexed, simplified, or otherwise transformed in a consistent matter.

The right part of 価, the top part of 要, 栗, the top-right part of 湮, etc is semantically a 西, however, there are two identical (to the eye) "覀"s that are shown.

image

The approach taken to handle issue #2 is to convert all IDS decompositions to Unihan characters outside of the CJK Radicals Supplement in the ETL process, hence these characters are retrievable by U+8980 and not by U+2EC3.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions