This is Not My Gender

You may come across Chinese names in various contexts and wonder about their gender. Unlike Western names like Peter and Emily, Chinese names don’t follow fixed gender rules. Native speakers often infer gender from character meanings, but this gets tricky when names are written in Pinyin, which hides both meaning and tone.

Scroll down for the story or click Explorer to dive in.

Gender Inference
in Chinese Names

Unlike Western names, there is no predefined list of names in Chinese. Surnames are inherited, while parents can choose one or two (rarely three) characters for their children’s given names based on factors like meaning, sound, and family tradition.

For example, in Cixin Liu, the author of The Three-Body Problem, "Liu" (刘) is the surname, and "Cixin" is the chosen given name made from the characters "Ci" (慈) and "Xin" (欣). When written in Pinyin, these characters are represented as syllables that together form the name "Cixin."

surname

Liu

given name

two characters

Ci Xin

When it comes to gender, the association of a Chinese given name can be only inferred from the meanings and cultural connotations of its characters. So if we did not already know that Liu Cixin is a well-known male science-fiction author, his given name could easily be perceived as feminine, since its characters more commonly appear in female names.

Gendered and
Ambiguous Characters

Certain Chinese characters carry strong gender associations. For example, characters like (jiàn), meaning strong or healthy, are commonly found in male names, while (měi), meaning beautiful, is often used in female names. On the other hand, characters like (hǎo), meaning good, are gender-neutral or ambiguous, and can be used for both males and females.

This suggests that gender association in Chinese characters exists on a spectrum rather than within a strict binary. Below is the gender distribution of 2,936 characters found in given names at least once per million people, based on how frequently each character is used in male vs. female names.

...

Moreover, there are no fixed rules requiring male names to use “masculine” characters. For example, Lin Dan (林丹), the male Olympic badminton champion, has a given name Dan (丹) that is commonly used in female names.

In the case of two-character names, both characters contribute to the gender association. For example, He Zehui (何泽慧), often called the “Madame Curie of China,” has Ze (泽), slightly more common in male names, and Hui (慧), strongly feminine, which likely makes the name overall perceived as female.

When the characters form a phrase, meaning can override individual associations. Li Xiaolong (李小龙), known as Bruce Lee, has a given name that means “little dragon” (Xiaolong, 小龙). While Xiao (小, “little”) appears slightly more in female names, the phrase still refers to a dragon, which is traditionally masculine, giving the name a strongly male impression.

All of these cases, along with many others, make gender inference challenging and often of low accuracy, even for native speakers.

Challenges in
Pinyin Names

Various characters can share the same Pinyin. For example, refers to 37 commonly used characters. Some of these, such as 毅 and 薏, are strongly associated with opposite genders.

When inferring gender from Pinyin alone, native speakers can only estimate based on the overall distribution or popularity of characters that share that pronunciation. In other words, if most characters pronounced yì are masculine, or if the most frequently used ones are, then the name is more likely to be perceived as male, even though some feminine variants exist.

In Western contexts, Pinyin names are often written without tone markers, compressing thousands of Chinese characters into just 375 syllables. However, in a tonal language like Chinese, the same syllable pronounced in different tones can correspond to entirely different sets of characters, like , , , and , each carrying its own gender perception.

When only the syllables (without tone markers) are used, inferring gender becomes even more difficult. In our analysis, gender inferred from syllables retains only about 25% of the gender pattern present in the original characters.

Explore the full complexity of gender ambiguity in Chinese names and how different characters influence gender perception.

Go to explorer