Skip to content

Identify variants which aren't used in any decompositions / which people are more likely to use #15

@Transfusion

Description

@Transfusion

The premise of this tool is to leverage knowledge of character variants, which includes the source characters of various radicals, whether dictionary-indexed, simplified, or otherwise transformed in a consistent matter.

The right part of 価, the top part of 要, 栗, the top-right part of 湮, etc is semantically a 西, however, there are two identical (to the eye) "覀"s that are shown.

image

The approach taken to handle issue #2 is to convert all IDS decompositions to Unihan characters outside of the CJK Radicals Supplement in the ETL process, hence these characters are retrievable by U+8980 and not by U+2EC3.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions