orthographic change - The Ohio State University
orthographic change - The Ohio State University
orthographic change - The Ohio State University
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
ORTHOGRAPHIC CHANGE:<br />
YUE (CANTONESE) CHINESE DIALECT CHARACTERS<br />
IN THE<br />
NINETEENTH AND TWENTIETH CENTURIES<br />
A <strong>The</strong>sis<br />
Presented in Partial Fulfillment of the Requirements for<br />
the Degree Master of Arts in the<br />
Graduate School of <strong>The</strong> <strong>Ohio</strong> <strong>State</strong> <strong>University</strong><br />
By<br />
Thomas Chan, B.A.<br />
* * * * *<br />
<strong>The</strong> <strong>Ohio</strong> <strong>State</strong> <strong>University</strong><br />
2001<br />
Master’s Examination Committee: Approved by<br />
Professor Marjorie K.M. Chan, Adviser _____________________<br />
Professor Jianqi Wang Adviser<br />
Department of East Asian<br />
Languages and Literatures
ABSTRACT<br />
Yue (Cantonese) Chinese dialect characters, which have never been subject to<br />
prescriptive reforms, present a fertile ground for studying <strong>orthographic</strong> <strong>change</strong>. In the<br />
past two hundred years, they have <strong>change</strong>d greatly and still continue to <strong>change</strong>,<br />
providing an opportunity to track many <strong>orthographic</strong> <strong>change</strong>s within a relatively short<br />
timeframe. We find that the <strong>change</strong>s are part of an ongoing optimization process of<br />
refining the written form by changing and replacing characters using more preferred<br />
character construction and usage principles, as well as principle-internal <strong>change</strong>s.<br />
Eight dictionaries and lexicons, ranging from 1856 to 1996, were used to track<br />
a hundred and fifteen words. A modified model of character construction and usage<br />
principles based on the traditional liushu model was used as a framework for<br />
understanding the characters used. This model categorized each character as one of<br />
four types: co-signific, semantic loan, phonetic loan, and signific-phonetic. Although<br />
contemporary written Cantonese is known for its phonetic loan characters marked with<br />
a mouth radical, signific-phonetic characters were found to be the most preferred<br />
character construction and usage principle, representing a stage of development that<br />
virtually all characters are progressing towards. It was followed by a tie between the<br />
co-signific and semantic loan principles, while phonetic loans were the least preferred.<br />
ii
ACKNOWLEDGMENTS<br />
I wish to thank my adviser, Professor Marjorie K.M. Chan, for her enthusiastic<br />
interest and support in working with early Cantonese materials, and for continual<br />
guidance and encouragement during the writing of this work.<br />
I thank the <strong>Ohio</strong> <strong>State</strong> <strong>University</strong> library and the libraries in the CIC network<br />
and their staff for making available to me old books which have made this work<br />
possible. I also thank Professor Li Guoqing, Chinese Studies Librarian at the <strong>Ohio</strong><br />
<strong>State</strong> <strong>University</strong> library, for his vigilance in preserving the irreplaceable materials in<br />
the library collection.<br />
I thank my thesis committee members, Professor Marjorie K.M. Chan and<br />
Professor Jianqi Wang for their insights and patience during the writing of this work.<br />
I also thank Debbie Knicely, our Department Graduate Secretary, for her vital<br />
assistance with the associated paperwork.<br />
I thank my parents for teaching me the Cantonese language and for their<br />
understanding and support of my academic studies. I also thank my fiancée, Chandra<br />
Reyer, and the Reyer family for providing support and encouragement during the<br />
writing of this work.<br />
iii
Finally, I thank Professor Robert S. Bauer and Professor Kwan-hin Cheung for<br />
making a pre-publication copy of their forthcoming monograph available to me. I also<br />
thank everyone that I have ever had a discussion with about Cantonese dialect<br />
characters for sharing and encouraging my interest.<br />
iv
VITA<br />
August 4, 1976 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Born<br />
New York, NY<br />
1998 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . B.A. Linguistics,<br />
Cornell <strong>University</strong><br />
1999-2001 . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . Graduate Research Associate,<br />
<strong>The</strong> <strong>Ohio</strong> <strong>State</strong> <strong>University</strong><br />
FIELDS OF STUDY<br />
Major Field: East Asian Languages and Literatures<br />
v
TABLE OF CONTENTS<br />
vi<br />
Page<br />
Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ii<br />
Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii<br />
Vita . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v<br />
List of Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix<br />
List of Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii<br />
Chapters:<br />
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1<br />
1.1 Varieties of Spoken Chinese . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1<br />
1.2 Varieties of Written Chinese . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3<br />
1.3 Types of Chinese Characters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8<br />
1.4 Rationale for Studying Cantonese Dialect Characters . . . . . . . . . . . . 10<br />
2. Background Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13<br />
2.1 Phonology of Cantonese . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13<br />
2.2 Romanization Systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16<br />
2.3 Phonological Mergers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21<br />
2.3.1 Substitution of the Velar Nasal Initial ng- [ŋ-]<br />
for the Zero Initial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21<br />
2.3.2 Substitution of the Zero Initial<br />
for the Velar Nasal Initial ng- [ŋ-] . . . . . . . . . . . . . . . . . . . . . 22<br />
2.3.3 Substitution of the Liquid Initial l- [l-]<br />
for the Nasal Initial n- [n-] . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
2.4 Phonological Distinctions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23<br />
2.4.1 Distinction Between the Vowels -o- [-ɔ-] and -a- [-ɐ-]<br />
Before Labial Finals -m [-m] and -p [-p] . . . . . . . . . . . . . . . . 24<br />
2.4.2 Distinction Between the Dental and Palatal Sibilant Initials<br />
[ts-]/[ts h -]/[s-] and [tʃ-]/[tʃ h -]/[ʃ-] . . . . . . . . . . . . . . . . . . . . . . . 25<br />
2.5 Description of Rare Characters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27<br />
2.6 Unicode Codepoint . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29<br />
3. Project . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32<br />
3.1 Requirements for Sources of Cantonese Dialect Characters . . . . . . . . 33<br />
3.2 Overview of Sources Used . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34<br />
3.3 Characters Selected for Study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38<br />
3.4 Traditional Model of Character Construction and Usage Principles . 39<br />
3.5 Modified Model of Character Construction and Usage Principles . . . 41<br />
4. Co-Signific Characters, Semantic Loans, and Indeterminate Cases . . . . . . . 45<br />
4.1 Co-Signific Characters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45<br />
4.2 Semantic Loans . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47<br />
4.3 Indeterminate Cases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50<br />
5. Phonetic Loans . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57<br />
5.1 Unmarked Phonetic Loans . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58<br />
5.2 Marked Phonetic Loans . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62<br />
5.3 Unmarked Phonetic Loans Superseded by Marked Phonetic Loans . . 67<br />
5.4 Optimization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73<br />
5.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86<br />
6. Signific-Phonetic Characters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96<br />
6.1 Optimization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103<br />
6.2 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108<br />
7. Hierarchy of Character Construction and Usage Principles . . . . . . . . . . . . . 112<br />
7.1 Hegemony of Signific-Phonetic Characters . . . . . . . . . . . . . . . . . . . . 112<br />
7.1.1 Signific-Phonetic Characters and Co-Signific Characters . . . 113<br />
7.1.2 Signific-Phonetic Characters Superseding Phonetic Loans . . 114<br />
7.1.3 Signific-Phonetic Characters and Semantic Loans . . . . . . . . 123<br />
7.2 Co-Signific Characters Superseding Phonetic Loans . . . . . . . . . . . . . 126<br />
7.3 Semantic Loans Superseding Phonetic Loans . . . . . . . . . . . . . . . . . . . 128<br />
vii
7.4 Indeterminate Cases Being Superseded . . . . . . . . . . . . . . . . . . . . . . . 129<br />
7.4.1 Signific-Phonetic Characters Superseding<br />
Indeterminate Cases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129<br />
7.4.2 Semantic Loans Superseding Indeterminate Cases . . . . . . . . 131<br />
7.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132<br />
8. Concluding Remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140<br />
Appendices:<br />
A Characters by Unicode Codepoint . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145<br />
B Characters by Syllable . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153<br />
Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163<br />
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 169<br />
viii
LIST OF TABLES<br />
Table Page<br />
2.1 Syllable Structure of Cantonese . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13<br />
2.2 Cantonese Initials . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14<br />
2.3 Cantonese Finals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15<br />
2.4 Cantonese Tones . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16<br />
2.5 Romanization of Initials . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18<br />
2.6 Romanization of Finals, Part I . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19<br />
2.7 Romanization of Finals, Part II . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20<br />
2.8 Romanization of Tones . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21<br />
2.9 Vowels -o- [-ɔ-] and -a- [-ɐ-] Before Labial Finals -m [-m] and -p [-p] . . 24<br />
2.10 Dental and Palatal Sibilant Initials [ts-]/[ts h -]/[s-] and [tʃ-]/[tʃ h -]/[ʃ-] . . . . . 26<br />
2.11 Ideographic Description Characters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28<br />
2.12 Unicode Versions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30<br />
2.13 Unicode Blocks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30<br />
3.1 Comparison of Principles in the Traditional and Modified Models . . . . . 42<br />
4.1 Co-Signific Characters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46<br />
4.2 Optimization of a Co-Signific Character . . . . . . . . . . . . . . . . . . . . . . . . . . 47<br />
4.3 Semantic Loans (History) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49<br />
4.4 Semantic Loans (Basis) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50<br />
4.5 Indeterminate Cases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51<br />
5.1 Completely Homophonous Unmarked Phonetic Loans (History) . . . . . . . 59<br />
5.2 Completely Homophonous Unmarked Phonetic Loans (Basis) . . . . . . . . 60<br />
5.3 Semi-Homophonous Unmarked Phonetic Loans (History) . . . . . . . . . . . . 61<br />
5.4 Semi-Homophonous Unmarked Phonetic Loans (Basis) . . . . . . . . . . . . . 62<br />
5.5 Marked Phonetic Loans Differing in the Initial or Final (History) . . . . . . 64<br />
5.6 Marked Phonetic Loans Differing in the Initial or Final (Basis) . . . . . . . . 65<br />
5.7 Marked Phonetic Loans Differing in the Tone (History) . . . . . . . . . . . . . . 66<br />
5.8 Marked Phonetic Loans Differing in the Tone (Basis) . . . . . . . . . . . . . . . 66<br />
5.9 Indeterminate Case Reanalyzed as a Phonetic Loan . . . . . . . . . . . . . . . . . 67<br />
5.10 Unmarked Phonetic Loans Superseded by Marked Phonetic Loans,<br />
Part I (History) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68<br />
ix
5.11 Unmarked Phonetic Loans Superseded by Marked Phonetic Loans,<br />
Part I (Basis) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69<br />
5.12 Unmarked Phonetic Loans Superseded by Marked Phonetic Loans,<br />
Part II (History) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71<br />
5.13 Unmarked Phonetic Loans Superseded by Marked Phonetic Loans,<br />
Part II (Basis) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72<br />
5.14 Optimization of the Phonetic in Phonetic Loans (History) . . . . . . . . . . . . 74<br />
5.15 Optimization of the Phonetic in Phonetic Loans (Basis) . . . . . . . . . . . . . . 74<br />
5.16 Optimization of the Phonetic in Phonetic Loans<br />
Facilitated by Phonological Mergers (History) . . . . . . . . . . . . . . . . . . . . . 76<br />
5.17 Optimization of the Phonetic in Phonetic Loans<br />
Facilitated by Phonological Mergers (Basis) . . . . . . . . . . . . . . . . . . . . . . . 76<br />
5.18 Optimization of the Phonetic in Phonetic Loans<br />
for Other Reasons (History) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78<br />
5.19 Optimization of the Phonetic in Phonetic Loans<br />
for Other Reasons (Basis) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79<br />
5.20 Erroneous Optimization of the Phonetic in a Phonetic Loan (History) . . . 81<br />
5.21 Erroneous Optimization of the Phonetic in a Phonetic Loan (Basis) . . . . 82<br />
5.22 Optimization of Phonetics in Polysyllabic Phonetic Loans (History) . . . . 84<br />
5.23 Optimization of Phonetics in Polysyllabic Phonetic Loans (Basis) . . . . . . 85<br />
6.1 Signific-Phonetic Characters<br />
with Completely Homophonous Phonetics (History) . . . . . . . . . . . . . . . . 97<br />
6.2 Signific-Phonetic Characters<br />
with Completely Homophonous Phonetics (Basis) . . . . . . . . . . . . . . . . . . 97<br />
6.3 Signific-Phonetic Characters<br />
with Phonetics Differing in the Tone (History) . . . . . . . . . . . . . . . . . . . . . 98<br />
6.4 Signific-Phonetic Characters<br />
with Phonetics Differing in the Tone (Basis) . . . . . . . . . . . . . . . . . . . . . . 99<br />
6.5 Signific-Phonetic Characters<br />
with Phonetics Differing in the Initial (History) . . . . . . . . . . . . . . . . . . . . 100<br />
6.6 Signific-Phonetic Characters<br />
with Phonetics Differing in the Initial (Basis) . . . . . . . . . . . . . . . . . . . . . . 100<br />
6.7 Signific-Phonetic Characters<br />
with Phonetics Differing in the Initial and Tone (History) . . . . . . . . . . . . 101<br />
6.8 Signific-Phonetic Characters<br />
with Phonetics Differing in the Initial and Tone (Basis) . . . . . . . . . . . . . . 102<br />
6.9 Signific-Phonetic Characters<br />
with Phonetics Differing in the Final and Tone (History) . . . . . . . . . . . . . 103<br />
6.10 Signific-Phonetic Characters<br />
with Phonetics Differing in the Final and Tone (Basis) . . . . . . . . . . . . . . 103<br />
6.11 Optimization of the Phonetic in Signific-Phonetic Characters (History) . 104<br />
6.12 Optimization of the Phonetic in Signific-Phonetic Characters (Basis) . . . 105<br />
x
6.13 Optimization of a Phonetic in a Signific-Phonetic Character<br />
Due to a Change in Pronunciation (History) . . . . . . . . . . . . . . . . . . . . . . . 106<br />
6.14 Optimization of a Phonetic in a Signific-Phonetic Character<br />
Due to a Change in Pronunciation (Basis) . . . . . . . . . . . . . . . . . . . . . . . . . 106<br />
6.15 Optimization of Signific-Phonetic Characters<br />
for Other Reasons (History) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107<br />
6.16 Optimization of Signific-Phonetic Characters<br />
for Other Reasons (Basis) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108<br />
7.1 Distribution of Character Construction and Usage Principles . . . . . . . . . . 113<br />
7.2 Signific-Phonetic Characters Superseding Phonetic Loans<br />
and Retaining the Same Phonetic, Part I (History) . . . . . . . . . . . . . . . . . . 115<br />
7.3 Signific-Phonetic Characters Superseding Phonetic Loans<br />
and Retaining the Same Phonetic, Part I (Basis) . . . . . . . . . . . . . . . . . . . . 117<br />
7.4 Signific-Phonetic Characters Superseding Phonetic Loans<br />
and Retaining the Same Phonetic, Part II (History) . . . . . . . . . . . . . . . . . . 118<br />
7.5 Signific-Phonetic Characters Superseding Phonetic Loans<br />
and Retaining the Same Phonetic, Part II (Basis) . . . . . . . . . . . . . . . . . . . 118<br />
7.6 Signific-Phonetic Characters Superseding Phonetic Loans<br />
and Retaining an Abbreviated Form of the Phonetic (History) . . . . . . . . . 119<br />
7.7 Signific-Phonetic Characters Superseding Phonetic Loans<br />
and Retaining an Abbreviated Form of the Phonetic (Basis) . . . . . . . . . . 120<br />
7.8 Signific-Phonetic Characters Superseding Phonetic Loans<br />
as a Result of Semantic Specialization (History) . . . . . . . . . . . . . . . . . . . . 121<br />
7.9 Signific-Phonetic Characters Superseding Phonetic Loans<br />
as a Result of Semantic Specialization (Basis) . . . . . . . . . . . . . . . . . . . . . 122<br />
7.10 Signific-Phonetic Character Superseding a Phonetic Loan<br />
and Optimization of the Phonetic (History) . . . . . . . . . . . . . . . . . . . . . . . . 123<br />
7.11 Signific-Phonetic Character Superseding a Phonetic Loan<br />
and Optimization of the Phonetic (Basis) . . . . . . . . . . . . . . . . . . . . . . . . . 123<br />
7.12 Signific-Phonetic Character Superseding a Semantic Loan (History) . . . . 124<br />
7.13 Signific-Phonetic Character Superseding a Semantic Loan (Basis) . . . . . 124<br />
7.14 Semantic Loans Superseding Signific-Phonetic Characters (History) . . . 126<br />
7.15 Semantic Loans Superseding Signific-Phonetic Characters (Basis) . . . . . 126<br />
7.16 Co-Signific Character Superseding a Phonetic Loan (History) . . . . . . . . . 127<br />
7.17 Co-Signific Character Superseding a Phonetic Loan (Basis) . . . . . . . . . . 127<br />
7.18 Semantic Loans Superseding Phonetic Loans (History) . . . . . . . . . . . . . . 128<br />
7.19 Semantic Loans Superseding Phonetic Loans (Basis) . . . . . . . . . . . . . . . . 129<br />
7.20 Signific-Phonetic Characters Superseding Indeterminate Cases (History) 130<br />
7.21 Signific-Phonetic Characters Superseding Indeterminate Cases (Basis) . . 131<br />
7.22 Semantic Loan Superseding an Indeterminate Case . . . . . . . . . . . . . . . . . 132<br />
xi
LIST OF FIGURES<br />
Figure Page<br />
7.1 Hierarchy of Character Construction and Usage Principles . . . . . . . . . . . 135<br />
xii
CHAPTER 1<br />
INTRODUCTION<br />
This work seeks to understand the <strong>orthographic</strong> <strong>change</strong>s in Cantonese dialect<br />
characters by introducing a methodology of tracing the written forms used to write a<br />
word in sources where the pronunciation and meaning are reliably indicated. <strong>The</strong>se<br />
sources are eight bilingual dictionaries and lexicons, mostly authored by and for a<br />
foreign audience, spanning a century and a half from 1856 to 1996. Using a modified<br />
model based on the traditional liushu 六書 model of character construction and usage<br />
principles, the <strong>change</strong>s in the written forms of a data set of a hundred and fifteen<br />
words have been analyzed as a transition from one principle to another, or as<br />
principle-internal optimizations. In this way, the various principles may be ranked by<br />
how preferred they are, as well arriving at an understanding of why and when various<br />
smaller <strong>change</strong>s have taken place.<br />
This chapter provides definitions and explanations to aid in understanding the<br />
title of this work, “Orthographic Change: Yue (Cantonese) Chinese Dialect Characters<br />
in the Nineteenth and Twentieth Centuries”.<br />
1.1 Varieties of Spoken Chinese<br />
Chinese is conventionally divided into at least seven groups, following Li<br />
Fang-kuei’s 李方桂 classification in the 1937 edition of <strong>The</strong> Chinese Yearbook, which<br />
1
was later reprinted with revisions in the first issue of the Journal of Chinese<br />
Linguistics in 1973, and Yuan Jiahua’s (1960) classification in his Hanyu fangyan<br />
gaiyao 漢語方言概要 in 1960 1 . <strong>The</strong>se groups are, in approximate geographic order<br />
from north to south: 1) Mandarin, 2) Wu 吳, 3) Gan 贛, 4) Xiang 湘, 4), Min 閩, 5)<br />
Kejia (Hakka) 客家, and 6) Yue 粵.<br />
<strong>The</strong> Yue group, according to the Summer Institute of Linguistics’ (SIL)<br />
Ethnologue: Languages of the World, 14th ed. (2001), is currently spoken by fifty-two<br />
million in China, comprising 4.5% of the population, and seventy-one million<br />
worldwide. Yue is spoken in the provinces of Guangdong 廣東 and Guangxi 廣西,<br />
and the speech of the provincial capital of Guangdong, Guangzhou 廣州 (Canton), is<br />
regarded as the standard variety. For purposes of discussion, the term “Cantonese”<br />
will be used narrowly to refer to the speech of Guangzhou and nearby related areas,<br />
such as Hong Kong, rather than broadly as a synonym of “Yue”.<br />
<strong>The</strong> Mandarin group, in contrast, is currently spoken by 867 million in China,<br />
comprising 70% of the population, and 874 million worldwide. Mandarin is spoken<br />
north of the Yangtze river in the northern half of China as well as in southwestern<br />
China, and is considered the standard variety of Chinese, upon which modern standard<br />
written Chinese is based. For purposes of discussion, the term “Mandarin” will be<br />
used narrowly to refer to the Putonghua 普通話 ‘common language’ and Guoyu 國語<br />
‘national language’ koines, as well as the speech of Beijing 北京 that they are based<br />
on, rather than broadly as a term for the entire Mandarin group.<br />
2
1.2 Varieties of Written Chinese<br />
Written Chinese is typically divided into two groups, wenyan 文言 and baihua<br />
白話. Wenyan, or classical Chinese, was a written language that had no spoken<br />
analogue, and was the undisputed literary standard for prestige writing until the 1920s.<br />
In contrast, baihua is a cover term for non-wenyan writing, which were written forms<br />
of the vernacular, used for less prestigious writing such as popular literature. Modern<br />
forms of baihua developed in the first half of the twentieth century into a written<br />
language based primarily on Mandarin, although not without influences from non-<br />
Mandarin varieties of Chinese 2 . By the mid-twentieth century, baihua had effectively<br />
taken over wenyan’s literary functions. However, there were also written forms of the<br />
vernacular that were not based on Mandarin, including those for Cantonese 3 .<br />
However, there were often hybrid forms of writing that also incorporated some<br />
features of wenyan and Mandarin.<br />
An illustration of the differences between written Mandarin and Cantonese is<br />
given by Williams (1909 [1874]: xxxv-xlvii) in the mid-late nineteenth century<br />
vernacular renderings of an excerpt from a section on filial piety in the Shengyu<br />
Guangxun 聖諭廣訓, which was written in classical Chinese. <strong>The</strong> passage is rendered<br />
in the written vernacular of seven localities: Beijing 北京 and Hankou 漢口,<br />
representing Mandarin dialects; Shanghai 上海 and Ningbo 寧波, representing Wu<br />
dialects; Fuzhou 福州 and Shantou 汕頭, representing Min dialects, and Guangzhou<br />
廣州, a Yue dialect. <strong>The</strong> classical Chinese original (W) is shown below, along with<br />
the Beijing (B) and Guangzhou (G) vernacular renderings to represent written<br />
3
Mandarin and Cantonese, respectively, as well as Williams’ English translation (E) of<br />
the original. <strong>The</strong> sentences have been rearranged to juxtapose equivalent sentences for<br />
comparison, and supplemented with modern punctuation. As with all translations,<br />
there is variation in how the original is rendered, but nevertheless, vocabulary and<br />
characters peculiar to Cantonese can still be discerned from the Guangzhou vernacular<br />
version.<br />
W: 夫孝者,天之經,地之義,民之行也。<br />
B: 那孝是什麼?就是天上的常道,地上的定理,人間所應當<br />
奉行的呵。<br />
G: 個的孝道,乃係天嘅常經,地嘅定理,人嘅總行呀。<br />
E: Now filial piety is a statute of heaven, a principle of earth, and<br />
an obligation of mankind.<br />
<strong>The</strong> Cantonese passage opens with go 2 di 1 個的 ‘those’, similar in function to<br />
Mandarin nàxiē 那些 ‘those’. go 2 di 1 個的 ‘those’ is written with the characters for<br />
go 3 個 ‘one’ and dik 1 的 ‘genitive particle’ from which they developed, but nowadays,<br />
it is written as 嗰啲. go 2 di 1 個的 itself can be broken down into go 2 個 ‘that’ and di 1<br />
的 ‘ones’, which are similar in function to Mandarin nà 那 and xiē 些. <strong>The</strong> first<br />
sentence also contains the copula hai 6 係 ‘to be’, which is now bookish in Mandarin.<br />
It also contains the genitive particle ge 3 嘅, similar in function to Mandarin de 的, and<br />
ends with a sentence-final particle, a 3 呀.<br />
W: 人不知孝父母 ,獨不思父母愛子之心乎?<br />
B: 人若不曉得孝順父母 ,先不用講別的獨不想一想父母疼愛<br />
兒子的心腸麼?<br />
G: 世人唔知孝敬父母 ,獨唔想吓父母愛仔個點心咩?<br />
E: Do you, who are void of filial piety, ever reflect on the natural<br />
affection of parents for their children?<br />
4
<strong>The</strong> second sentence contains the negative m 4 唔 ‘not’, similar in function to<br />
Mandarin bù 不, as well as ha 2 吓 ‘a moment’, used in the construct seung 2 ha 2 想吓<br />
‘to think over’, parallel to xiǎngyīxiǎng 想一想 in the Beijing vernacular version. <strong>The</strong><br />
sentence also includes jai 2 仔 ‘child’, and ends with a sentence-final particle, me 1 咩,<br />
which expresses doubt.<br />
W: 方其未離懷抱,饑不能自哺,寒不能自衣。<br />
B: 當他沒有離開父母懷抱的時候,餓了自己不能吃,冷了自<br />
己不能穿。<br />
G: 當佢未曾離開襟懷保抱個時,肚餓唔噲自己揾食,身冷唔<br />
噲自己揾著。<br />
E: Even before you left the maternal bosom, if hungry, you could<br />
not have fed yourselves; or if cold, you could not have put on<br />
your own clothes.<br />
<strong>The</strong> third sentence contains the third person personal pronoun keui 5 佢 ‘he,<br />
she, it’, which developed from the ancient pronoun keui 4 渠 both in pronunciation and<br />
graphic form, and is similar to Mandarin tā 他 when the latter is used without<br />
distinguishing animacy nor gender. <strong>The</strong> sentence also contains mei 6 chang 4 未曾 ‘not<br />
yet’, similar to one usage of Mandarin mei 2 沒, as well as wui 5 噲 ‘to be able’, which<br />
is distinguished from wui 6 會 ‘to meet’. Nowadays both wui 5 ‘to be able’ and wui 6 ‘to<br />
meet’ are written as 會, just as in Mandarin where the two are homophonous. <strong>The</strong><br />
sentence also includes wan 2 揾 ‘to find’ and jeuk 3 著 ‘to wear’, as well as sik 6 食 ‘to<br />
eat’, which is no longer a verb in Mandarin.<br />
W: 為父母者,審音聲,察形色,笑則為之喜,啼則為之憂。<br />
B: 作父母的,揣度他的聲音,察看他的氣色,他若嬉笑就為<br />
他歡喜,他若啼哭就為他愁煩。<br />
G: 做父母嘅,聽佢聲音,睇佢形像,而色笑就替佢歡喜,喊<br />
就替佢贔屭。<br />
5
E: A father or a mother judge by the voice, or look at the features<br />
of their children, whose smiles make them joyful, or whose<br />
weeping excites their grief.<br />
W: 行動則跬步不離,疾痛則寢食俱廢,以養以教。<br />
B: 他一行走就連半步也不肯離開他,有病痛就連睡覺吃飯也<br />
都廢掉,從他小時候就拿衣食養活他,拿詩書教訓他。<br />
G: 初學行就寸步唔敢離開,有病痛就唔瞓得唔食得,一自養<br />
一自教。<br />
E: When trying to walk, they leave not their steps; and when sick<br />
or in pain, they can neither sleep nor eat in comfort, in order<br />
that they may nurture and teach them.<br />
W: 至於成人,復為授家室,謀生理百計,經營心力俱瘁。<br />
M: 直到他長大成人的時候,又給他娶媳婦,謀事業千方百<br />
計,替他打算把機氣力都用得勞困。<br />
C: 至到長大成人個時,又替佢娶妻子,謀生意百樣計較,打<br />
算心共力都疚倦咯。<br />
E: When [their children] reach man’s estate, they see to their<br />
marriage, and scheme for their livelihood by a hundred plans, in<br />
which they can weary their minds and spend their strength.<br />
<strong>The</strong> fourth sentence contains tai 2 睇 ‘to see’ and haam 3 喊 ‘to cry’, and ends<br />
with bai 3 ai 3 贔屭 ‘grief’. <strong>The</strong> following sentence contains fan 3 瞓 ‘to sleep’, which<br />
has been identified with kwan 3 睏 ‘sleepy’ and kwan 3 困 ‘weary’, as well as dak 1 得<br />
‘able’, used in the constructs m 4 fan 3 dak 1 唔瞓得 ‘not able to sleep’ and m 4 sik 6 dak 1<br />
唔食得 ‘not able to eat’, while the sixth sentence ends with a sentence-final particle,<br />
lok 3 咯.<br />
W: 父母之德,寶同昊天罔極!<br />
B: 這樣看來父母的恩典,寶在如同那廣大的天無窮無盡了!<br />
G: 父母嘅思德,眞係同埋至大嘅天咁!<br />
E: Parental virtue is truly as limitless as high heaven!<br />
W: 人子欲報親恩于萬一,自當內盡其心,外竭其力。<br />
B: 為人子的若思想父母的恩典要在萬分裏頭報答一分自然應<br />
當,裏面盡心志,外面端盡力量。<br />
G: 無窮盡�口駕做人仔嘅想報答父母恩典萬份之一,就應該<br />
裏頭盡自己嘅心,外便盡自己嘅力。<br />
6
E: A man who desires to recompense one in a myriad of the loving<br />
acts of his parents, must really devote to them his whole heart at<br />
home, and exert all his strength abroad.<br />
W: 謹身節用,以勤服勞。<br />
B: 又要保守身體省儉用度,為得勢可以勤勤謹謹的服事。<br />
G: 謹眞個身减省使用,嚟服事佢。<br />
E: He must care well for his body and be frugal in his expenses, in<br />
order that he may diligently labor for them.<br />
<strong>The</strong> seventh sentence contains tung 4 maai 4 同埋 ‘with’ and gam 3 咁 ‘so<br />
(quantity)’, which are similar in function to Mandarin gēn 跟 and zhème 這麼/nàme<br />
那麼, respectively. <strong>The</strong> sentence after it contains the sentence-final ga 3 �口駕,<br />
which is a contraction of ge 3 a 3 嘅呀, as well as ngoi 6 bin 6 外便 ‘outside’, where 便<br />
perhaps is really bin 1 邊 ‘side’, while the ninth sentence contains laai 4 嚟 ‘to come; in<br />
order to’, the colloquial reading of the same word as loi 4 來, which has been given its<br />
own character.<br />
W: 以隆孝養,毋博奕飲酒,毋好勇鬪很,毋好貨財私妻子。<br />
B: 他可以豐豐盛盛的奉養,他不可以賭錢下棋喝酒鬧事,<br />
不可好勇逞强忿怒鬥,不可貪愛錢財偏疼妻子。<br />
G: 佢買的好飲食孝敬佢,唔好賭博飲酒,唔好恃勇力打鬭,<br />
唔好食財物厚待妻子。<br />
E: To enable him to fully and filially nurture them, he must neither<br />
gamble nor get drunk, he must neither love to quarrel, nor desire<br />
to hoard wealth for the use of his wife and children.<br />
W: 縱使儀文未備而誠愨。<br />
B: 果能這樣即或外面對的禮節稍有不足,却是內裏的眞誠。<br />
G: 即使外便禮文唔得齊備,但係眞寶嘅心。<br />
E: Though his manners and accomplishments may be defective,<br />
yet his heart must, at any rate, be thoroughly sincere.<br />
W: 有餘推而廣之。<br />
B: 已經有餘孝的根本算是立住了,從此在推開了。<br />
G: 有餘剩噉樣推潤開嚟。<br />
E: Let us enlarge a little on this principle.<br />
7
<strong>The</strong> tenth sentence contains m 4 hou 2 唔好 ‘do not’, similar to function<br />
Mandarin bié 別, while the following sentence contains m 4 dak 1 唔得 ‘not sufficient’<br />
and daan 6 hai 4 但係 ‘but’; the latter similar in function to Mandarin dànshì 但是. <strong>The</strong><br />
twelth sentence contains gam 2 yeung 6 噉樣 ‘like so’, similar in function to Mandarin<br />
zhèyàng 這樣 and nàyàng 那樣.<br />
W: 曾子所謂:「 居處不莊非孝,事君不忠非孝,蒞官不敬非<br />
孝,朋友不信非孝,戰陣無勇非孝。」<br />
B: 往寬廣裏講這孝道就如曾子所說的:「平日在家裏住着若<br />
不端方穩重的就算不得孝,事奉君王若不誠寔盡心得也算<br />
不得孝,臨民作官若不小心愼重的也算不得孝,交朋友若<br />
沒有信實的也算不得孝,出兵打仗若不能奮勇爭先的也算<br />
不得孝。」<br />
G: 好似曾子所講:「坐立唔端正唔係孝,服事人君唔盡心唔<br />
係孝,做官唔謹愼唔係孝,交朋友唔信實唔係孝,打仗唔<br />
出力唔係孝。」<br />
E: Tsăngtsz’ speaks thus respecting it:—“It is unfilial to move and<br />
act without dignity; it is unfilial to serve one’s prince disloyally;<br />
it is unfilial to fill an office without reverential care; it is unfilial<br />
to act insincerely towards a friend; [and finally], to turn a<br />
coward in battle is unfilial.”<br />
W: 皆孝子分內之事也。<br />
B: 這所說的都是孝子本分以內的事呵。<br />
G: 一的都係孝子本分嘅事呀。<br />
E: All these things are involved in the duty of a filial son.<br />
Finally, the next to last sentence contains hou 2 chi 5 好似 ‘like’, which is rarer<br />
than hǎoxiàng 好像 in Mandarin, and m 4 hai 6 唔係 ‘is not’, similar in function to<br />
Mandarin bùshì 不是.<br />
1.3 Types of Chinese Characters<br />
As demonstrated in the previous section, written Cantonese employs characters<br />
and their usages which may be divided into six categories: 1) ones which are identical<br />
with Mandarin and require no further explanation; 2) ones which existed in an<br />
8
common ancestor to Cantonese and Mandarin and are preserved in Cantonese, but are<br />
extinct or exist in a restricted or further developed form in Mandarin, such as hai 6 係<br />
‘to be’ and sik 6 食 ‘to eat’; 3) ones which existed in a common ancestor and are extinct<br />
or restricted in Mandarin, but are preserved in Cantonese with further development,<br />
such as fan 3 瞓 ‘to sleep’ (< 睏 and 困) and keui 5 佢 ‘he, she, it’ (< 渠); 5) ones which<br />
exist in both Cantonese and Mandarin, but with further development in Cantonese,<br />
such as go 2 di 1 個的 ‘those’ (< 個 + 的) and wui 5 噲 ‘to be able’ (< 會); and 6) ones<br />
which are peculiar to Cantonese, such as the genitive particle ge 3 嘅.<br />
However, it is not a simple matter to identify Cantonese dialect characters, nor<br />
do all scholars agree on what constitutes one (Yue 1972; Lau 1977; Bauer 1988; Rao<br />
1996; Cheung and Bauer forthcoming 4 ), provided they even bother to explain their<br />
criteria, while most avoid the issue altogether.<br />
A further complication is that the same character is occasionally independently<br />
created in different regions and time periods, sometimes using different construction<br />
principles. For example, xìn 伩 ‘letter’, an unofficial graphic simplification of 信 (c.f.,<br />
zhè 這 � 这 ‘this’), which briefly enjoyed official endorsement in Singapore in 1974<br />
until Singapore’s script reform was revised to fully align with mainland China’s 1964<br />
scheme in 1976 (Chou 1986: 56). It also briefly enjoyed official endorsement in<br />
mainland China in 1977 (ZWGW 1977: 4) until that additional script reform scheme<br />
was abolished in 1986 with the republishing of the 1964 scheme. However, the same<br />
character was also created in Cantonese-speaking regions as one way to write the<br />
syllable man 1 in sai 3 man 1 jai 2 細伩仔 ‘little kid’ (Meyer 1947: #1774; O’Melia 1959:<br />
9
4: 138; Yue 1972: 213), constructed as a phonetic loan of man 4 文 ‘literature’, which<br />
is indicated by the yan 4 亻(人) ‘person’ radical on the left.<br />
In this work, the definition of a Cantonese dialect character given by Rao, et al.<br />
(1996: 377-380) in the appendix to their Guangzhouhua fangyan cidian<br />
廣州話方言詞典 dictionary and the list of characters provided there has been adopted<br />
as a working definition:<br />
一、廣州話常用的方言字;<br />
二、借來表示廣州話特殊音義的字,其中在群眾中比較通行或字<br />
形比較生疏的;<br />
三、《新華字典》沒有收進去的古字。<br />
That is, 1) dialect characters frequently used in Cantonese, 2) characters borrowed to<br />
represent Cantonese-specific words, and 3) ancient characters that are not included in<br />
the Xinhua zidian 新華字典 dictionary used in mainland China.<br />
1.4 Rationale for Studying Cantonese Dialect Characters<br />
<strong>The</strong>re are many possible reasons for studying Cantonese dialect characters, but<br />
the most important is that Cantonese dialect characters have never been subject to the<br />
kind of prescriptivism that have afflicted other characters, such as the reforms of the<br />
xiaozhuan 小篆 ‘lesser seal’ script over two thousand years ago which is traditionally<br />
ascribed to Li Si 李斯 (Karlgren 1923: 2), the influence of Xu Shen’s 許慎 Shuowen<br />
jiezi 說文解字 (AD 100) and the Kangxi zidian 康熙字典 (1716) dictionaries, or<br />
mainland China’s 1964 simplification scheme. For example, Cantonese dialect<br />
characters often fail to be included in comprehensive dictionaries. <strong>The</strong> eight-volume<br />
Hanyu da zidian 漢語大字典 dictionary (HYDZD 1986), with its coverage of 54,678<br />
characters, is among modern dictionaries second only to the Zhonghua zihai 中華字海<br />
10
(Leng and Wei 1994), which covers more characters (85,568), albeit at the expense of<br />
extensive definitions and usage quotes. As such, the Hanyu da zidian remains a<br />
standard reference work that would be among the first consulted. However, its<br />
coverage of Cantonese dialect characters is either incomplete or non-existent. <strong>The</strong><br />
situation is not improved by using dictionaries and reference works specializing in<br />
Cantonese, as different works may give different written forms. Thus, this presents an<br />
opportunity to study the development of characters in a laissez-faire environment.<br />
Another important reason for studying Cantonese dialect characters is that they<br />
have <strong>change</strong>d greatly within at least the two hundred years and still continue to<br />
<strong>change</strong>, unlike other characters which have basically remained un<strong>change</strong>d since the<br />
time of Li Si and Xu Shen. This allows for the tracking of <strong>orthographic</strong> <strong>change</strong>s<br />
within a relatively narrow and controllable timeframe, and with primary sources that<br />
can be reliably dated, rather than undated or pre-modern re-copied editions of non-<br />
extant originals. Furthermore, there is simply little or no research on the development<br />
of relatively modern characters, popular or scholarly, especially for Cantonese dialect<br />
characters, in contrast to the studies of characters that date far back in antiquity<br />
(Karlgren 1923; Wieger 1927).<br />
11
Endnotes<br />
1 Yuan Jiahua. 1960. Hanyu fangyan gaiyao 漢語方言概要. Beijing: Wenzi gaige.<br />
2 See section 2.2 “Indigenous Innovations Since 1918” (190-217) of the appendix in<br />
Gunn (1991: 185-294), especially section 2.13 “Distinctive Features of Regional<br />
Grammars” (203-216), for specific examples.<br />
3 See Snow (1991) for a full treatment and history.<br />
4<br />
I thank Professor Cheung and Professor Bauer with providing me with a July 26,<br />
2001 pre-publication draft.<br />
12
CHAPTER 2<br />
BACKGROUND INFORMATION<br />
This chapter provides background information on the phonology of Cantonese<br />
and phonological features relevant to the discussion, as well as notations employed in<br />
this work.<br />
2.1 Phonology of Cantonese<br />
<strong>The</strong> syllable structure in Cantonese, sans tone, can be described by Yue’s<br />
(1972: 87-88) formula (C1, G1)V(C2, G2), where C is a consonant, G a glide, and V a<br />
vowel. This yields nine possible combinations, not including syllabic nasals such as<br />
m 4 /m 21 / 唔 ‘not’ and ng 4 /ŋ 24 / 五 ‘five’ (table 2.1).<br />
Syllable<br />
Structure<br />
Examples<br />
V a 3 /a 33 / 呀 ‘sentence-final particle’<br />
C1V ga 3 /ka 33 / 假 ‘vacation’, na 2 /na 35 / 乸 ‘female’<br />
G1V ya 6 /ja 22 / 廿 ‘twenty’, wa 6 /wa 22 / 華 ‘Chinese’<br />
VC2 aat 3 /at 33 / 壓 ‘to crush’, aan 3 /an 33 / 晏 ‘late’<br />
VG2 aai 3 /aj 33 / 嗌 ‘to yell’, aau 3 /aw 33 / 拗 ‘to argue’<br />
C1VC2 gaak 3 /kak 3 / 革 ‘to reform’, gaam 3 /kam 33 / 監 ‘to force’<br />
C1VG2 gaai 3 /kaj 33 / 介 ‘to lie between’, gaau 3 /kaw 33 / 教 ‘to teach’<br />
G1VC2 yaak 3 /jak 3 / 喫 ‘to eat’, waan 1 /wan 55 / 灣 ‘bay’<br />
G1VG2 yaai 2 /jaj 35 / 踹 ‘to step on’, waai 1 /waj 55 / 歪 ‘crooked’<br />
Table 2.1: Syllable Structure of Cantonese<br />
13
However, Chinese syllables are divided into three parts by the traditional<br />
phonological model and romanization systems: the initial, the final, and the tone. <strong>The</strong><br />
“initial” refers to the initial consonant, as well as zero initials and glides serving as the<br />
initial, while the “final” refers to the rest of the syllable except for the tone, which is<br />
treated separately. Syllabic nasals, such as [m] and [ŋ], are treated as finals. Since it<br />
is unnecessary to subdivide the “final” into smaller units when discussing Chinese<br />
characters, the initial-final-tone model will be used in this work.<br />
Cantonese consists of nineteen initials plus a zero initial, and fifty-three finals<br />
plus three additional finals em [ɛm], ep [ɛp], and et [ɛt] which are only used in a few<br />
colloquial syllables. <strong>The</strong> initials are listed below (table 2.2) preceded by the<br />
equivalent Yale romanization, while the finals are listed (table 2.3) as a combination<br />
of the nuclear vowel listed on the x-axis and the following glide, nasal, or stop on the<br />
y-axis, with the equivalent Yale romanization at their intersection.<br />
Unaspirated Aspirated Nasals Fricatives Liquids Glides<br />
Labials b [p] p [p h ] m [m] f [f]<br />
Dentals d [t] t [t h ] n [n] l [l]<br />
Alveolars j [tʃ] ch [tʃ h ] s [s]<br />
Velars g [k] k [k h ] ng [ŋ] h [h]<br />
Labiovelars gw [kw] kw [kw h ]<br />
Glides w [w]<br />
y [j]<br />
Table 2.2: Cantonese Initials<br />
14
[a] [ɐ] [ɛ] [e] [œ] [ø] [i] [ɪ] [ɔ] [o] [u] [ʊ] [y] ∅<br />
∅ a e eu i o u yu<br />
[w] aau au iu ou<br />
[j] aai ai ei oi ui<br />
[ɥ] eui<br />
[m] aam am (em) im m<br />
[n] aan an eun in on un yun<br />
[ŋ] aang ang eng eung ing ong ung ng<br />
[p] aap ap (ep) ip<br />
[t] aat at (ed) eut it ot ut yut<br />
[k] aak ak ek euk ik ok uk<br />
Table 2.3: Cantonese Finals<br />
In the traditional phonological model, tones are divided into four categories,<br />
ping 平 ‘level’, shang 上 ‘rising’, qu 去 ‘going’, and ru 入 ‘entering’. Cantonese has<br />
a tone in each category in both the upper yin 陰 register and the lower yang 陽<br />
register, except for the ru 入 tone category where the yinru 陰入 tone has split,<br />
engendering a zhongru 中入 tone, for a total of nine tones. <strong>The</strong> tones are listed below<br />
(table 2.4) with their Chinese names, tone contours in numerical and Chao tone letter<br />
notation, and tone number in Yale romanization. Since the tone contours of the three<br />
ru 入 category tones, yinru 陰入 (5 ˥), zhongru 中入 (3 ˧) , and yangru 陽入 (2 �) can<br />
be identified with those of the yinping 陰平 (55 ˥), yinqu 陰去 (33 ˧), and yangqu<br />
陽去 (22 �) tones, respectively, except that they are shorter in duration, they are not<br />
assigned separate tone numbers in Yale romanization. However, they can be still be<br />
distinguished because they occur only in syllables ending in a stop, -p [-p], -t [-t], and<br />
-k [-k], and vice versa.<br />
15
Yin 陰 yinping 陰平<br />
55 ˥<br />
#1<br />
Yang 陽 yangping 陽平<br />
21 �<br />
#4<br />
2.2 Romanization Systems<br />
Ping 平 Shang 上 Qu 去 Ru 入<br />
yinshang 陰上<br />
35 �<br />
#2<br />
yangshang 陽上<br />
24 �<br />
#5<br />
Table 2.4: Cantonese Tones<br />
16<br />
yinqu 陰去<br />
33 ˧<br />
#3<br />
yangqu 陽去<br />
22 �<br />
#6<br />
yinru 陰入<br />
5 ˥<br />
#1<br />
zhongru 中入<br />
3 ˧<br />
#3<br />
yangru 陽入<br />
2 �<br />
�#6<br />
Unlike Mandarin, for which there is Hanyu Pinyin and Wade-Giles, there is no<br />
standard romanization system for Cantonese. Yue (1972: 77) remarks, “<strong>The</strong>re are<br />
almost as many systems of romanization as there are writers on Cantonese. No two<br />
authors use the same system without modification.” Dictionaries, textbooks, and other<br />
works often include conversion tables from several influential systems used by earlier<br />
authors, e.g, Lau’s (1977: xv-xvii) A Practical Cantonese-English Dictionary includes<br />
conversions from the Barnett-Chao, Meyer-Wempe, and Yale systems; Chao’s (1947:<br />
31-33) textbook, Cantonese Primer (1947: 31-33) includes conversions from the Ball,<br />
Eitel, Jones-Woo, and Meyer-Wempe systems; and Yue’s (1972: 79-83) Phonology of<br />
Cantonese includes conversions from the Official, Chao, and Meyer-Wempe systems.<br />
<strong>The</strong> system adopted here for discussion purposes, with a minor modification, is<br />
the Yale system, which was first introduced in the mid-twentieth century. Instead of<br />
marking tones with a combination of diacritics and an infixed -h- for yang 陽 register
tones, superscripted numbers are used. Since it is often necessary to refer to the<br />
original romanization used in various sources, unidirectional conversion tables from<br />
those systems to the Yale system have been provided for reference. Besides the trivial<br />
<strong>orthographic</strong> differences arising from various schemes for depicting aspiration, vowel<br />
quality, and tones, there are also differences arising from a different phonology<br />
depicted in sources roughly prior to the mid-twentieth century, e.g., Williams’ (1856)<br />
A Tonic Dictionary of the Chinese Language in the Canton Dialect distinguishes<br />
between two sets of sibilants, ts-ts’-s versus ch-ch’-sh, whereas Lau’s (1977) A<br />
Practical Cantonese-English Dictionary does not. <strong>The</strong> conversion tables are arranged<br />
in terms of the Yale system, and phonemically-motivated differences reflected in older<br />
romanization systems which have now become merged are listed together, delimited<br />
by commas. Differences arising from idiosyncratic spelling practices not motivated<br />
by phonemic differences are also listed together, but delimited by slashes, e.g., the<br />
system used in Rao Bingcai 饒秉才, et al.’s (1996) Guangzhouhua fangyan cidian<br />
廣州話方言詞典 uses j-q-x after the high front vowels i and ü in imitation of Pinyin,<br />
but z-c-s elsewhere.<br />
17
Yale Williams Aubazac Meyer<br />
1856 1909 1947 1<br />
Yue Lau Rao<br />
1972 1977 1996<br />
b p p p p b b<br />
p p’ p’ p’ p’ p p<br />
m m m m m m m<br />
f f f f f f f<br />
d t t t t d d<br />
t t’ t’ t’ t’ t t<br />
n n n n n n n<br />
l l l l l l l<br />
j ts, ch ts, tch ts, ch ts j z/j<br />
ch ts’, ch’ ts’, tch’ ts’, ch’ ts’ ch c/q<br />
s s, sh s, sh s, sh s s s/x<br />
g k k k k g g<br />
k k’ k’ k’ k’ k k<br />
ng ng ng ng ŋ ng ng<br />
h h h h h h h<br />
gw kw kou kw kw gw gu<br />
kw kw’ k’ou kw’ kw’ kw ku<br />
w ∅/w ∅/w ∅/w ŭ w w<br />
y ∅/y ∅/y ∅/y ĭ y y<br />
∅ ∅ ∅ ∅ (ʔ) ∅ ∅<br />
Table 2.5: Romanization of Initials<br />
18
Yale Williams Aubazac Meyer Yue Lau Rao<br />
1856 1909 1947 1972 1977 1996<br />
a á a a A: a a<br />
aau áu áo aau A:ŭ aau ao<br />
aai ái ái aai A:ĭ aai ai<br />
aam ám ám aam A:m aam am<br />
aan án án aan A:n aan an<br />
aang áng áng aang A:ŋ aang ang<br />
aap áp áp aap A:p aap ab<br />
aat át át aat A:t aat ad<br />
aak ák ák aak A:k aak ag<br />
au au ao au ɐŭ au eo<br />
ai ai ai ai ɐĭ ai ei<br />
am am, òm am, om am, om ɐm am em<br />
an an an an ɐn an en<br />
ang ang ang ang ɐŋ ang eng<br />
ap ap, òp ap, op ap, op ɐp ap eb<br />
at at at at ɐt at ed<br />
ak ak ak ak ɐk ak eg<br />
e é é e ɛ: e é<br />
ei í i ei eĭ ei éi<br />
eng eng n/a eng ɛ:ŋ eng éng<br />
ek ek èk ek ɛ:k ek ég<br />
eu ù eu oeh œ: euh ê<br />
eui ui, ü eui, u ui øy̆ ui êu<br />
eun un eun un øn un ên<br />
eung éung eung eung œ:ŋ eung êng<br />
eut ut eut ut øt ut êd<br />
euk éuk euk euk œ:k euk êg<br />
Table 2.6: Romanization of Finals, Part I<br />
19
̩<br />
Yale Williams Aubazac Meyer Yue Lau Rao<br />
1856 1909 1947 1972 1977 1996<br />
i í, z’ i, z i, z i: i i<br />
iu iú iou iu i:ŭ iu iu<br />
im ím im im i:m im im<br />
in ín in in i:n in in<br />
ing íng ing ing ɪŋ ing ing<br />
ip íp ip ip i:p ip ib<br />
it ít it it i:t it id<br />
ik ik ek ik ɪk ik ig<br />
o o o oh ɔ: oh o<br />
ou ò ó o oŭ o ou<br />
oi oi oi oi ɔ:ĭ oi oi<br />
on on on on ɔ:n on on<br />
ong ong ong ong ɔ:ŋ ong ong<br />
ot ot ot ot ɔ:k ot od<br />
ok ok ok ok ɔ:k ok og<br />
u ú ou oo u: oo u<br />
ui úi oui ooi u:ĭ ooi ui<br />
un ún oun oon u:n oon un<br />
ung ung oung ung ʊŋ ung ung<br />
ut út out oot u:t oot ud<br />
uk uk ouk uk ʊk uk ug<br />
yu ü u ue y: ue ü<br />
yun ün un uen y:n uen ün<br />
yut üt ut uet y:t uet üd<br />
m ‘m m m m̩ m m<br />
ng ‘ng ng ng ŋ̩ ng ng<br />
Table 2.7: Romanization of Finals, Part II<br />
20
Chinese<br />
Name<br />
yinping<br />
陰平<br />
yinshang<br />
陰上<br />
yinqu<br />
陰去<br />
yangping<br />
陽平<br />
yangshang<br />
陽上<br />
yangqu<br />
陽去<br />
yinru<br />
陰入<br />
zhongru<br />
中入<br />
yangru<br />
陽入<br />
Yale 2 Williams<br />
1856 3<br />
Aubazac<br />
1909<br />
cá a 1<br />
a 1<br />
a 2 c á a 2<br />
a 3<br />
a 4<br />
á ɔ a 3<br />
21<br />
Meyer Yue<br />
1947 1972<br />
a A: 53<br />
á A: 35<br />
à A: 44<br />
cá a1 ā A: 21<br />
a 5 c á a2 ă A: 24<br />
a 6<br />
at 1<br />
at 3<br />
at 6<br />
2.3 Phonological Mergers<br />
á ɔ a3 â A: 33<br />
at ɔ<br />
at 4<br />
at A:t 5<br />
ato àt A:t 4<br />
atɔ at4 ât A:t 3<br />
Table 2.8: Romanization of Tones<br />
Lau<br />
1977<br />
a 1<br />
a 2<br />
a 3<br />
a 4<br />
a 5<br />
a 6<br />
at 1o<br />
Rao<br />
1996<br />
Numerous authors have observed that there are a number of variations in<br />
Cantonese pronunciation resulting in mergers. <strong>The</strong> ones relevant to the discussion are<br />
described below.<br />
2.3.1 Substitution of the Velar Nasal Initial ng- [ŋ-] for the Zero Initial<br />
According to Yue (1972: 89, 121fn12), the substitution of the velar nasal initial<br />
ng- [ŋ] for the zero initial in the speech of some speakers of Cantonese is perhaps due<br />
to the influence of the pronunciation of unspecified neighboring dialects, citing the<br />
example of Cantonese a [A:] versus dialectal [ŋA:] for 亞 ‘second’. Chao (1947: 21)<br />
at 3<br />
at 6<br />
a 1<br />
a 2<br />
a 3<br />
a 4<br />
a 5<br />
a 6<br />
at 1o<br />
at 3<br />
at 6
does not indicate a particular origin for this merger either, but quantifies it as<br />
happening to three-fourths of Cantonese speakers except in “interjections, particles,<br />
and the proper noun prefix Ah [阿], which begin with an open vowel for all types of<br />
speakers”, and in fact recommends this pronunciation, although use of the zero initial<br />
is not discouraged. This substitution was noted as early as the mid-nineteenth century<br />
by Williams (1856: 1), who comments that “words in a or á, are often heard beginning<br />
with ng, as in ngá, ngai, ngat”. He considered it part of a greater set of variations in<br />
pronunciation, which he characterizes as typical of several neighboring Yue dialects:<br />
All words having no initial consonant, are very liable to have a nasal ng<br />
or h prefixed to them, or to have the vowel altered. <strong>The</strong> people in<br />
Hiángshán [Xiangshan 香山, now Zhongshan 中山], Macao, and Sinngán,<br />
<strong>change</strong> many words in this way, so that if one does not see the<br />
character, he will look for it under h or ng. (xx)<br />
2.3.2 Substitution of the Zero Initial for the Velar Nasal Initial ng- [ŋ-]<br />
According to Yue (1972: 89, 121fn12), the substitution of the zero initial for<br />
the velar nasal initial ng- [ŋ] initial in the speech of some Cantonese speakers is<br />
perhaps due to the influence of the pronunciation of neighboring Panyu 番禺, another<br />
Yue dialect, citing the example of Cantonese nga [ŋA:] versus Panyu [A:] for 牙<br />
‘tooth’. Chao (1947: 18) does not posit a particular origin for this merger, but notes<br />
that there is a “minority” who does not have the velar nasal initial, and thus uses the<br />
majority zero initial pronunciation in his teaching. However, unlike the substitution of<br />
the velar nasal initial ng- [ŋ] for the zero initial, this variation is not attested in<br />
Williams (1856).<br />
22
2.3.3 Substitution of the Liquid Initial l- [l-] for the Nasal Initial n- [n-]<br />
According to Yue (1972: 89, 120fn11), the substitution of the liquid initial l-<br />
[l-] for the nasal initial n- [n-] is perhaps due to the influence of the pronunciation of<br />
neighboring Nanhai 南海, another Yue dialect. However, Yue notes that Whitaker<br />
(1952: 31) 4 considered it to be due to the influence of the pronunciation of either<br />
Swatow [Shantou 汕頭] or Hainan 海南, both Min dialects. Chao (1947: 18) does not<br />
posit a particular origin for this merger, but notes that one fourth of Cantonese<br />
speakers do not have the nasal initial. Williams (1856: xxi) does not comment on this<br />
merger, considering it part of a greater set of variations in pronunciation, but considers<br />
it secondary to the non-homorganic substitution with the labial nasal initial m- [m-]:<br />
<strong>The</strong> two initials l and m are frequently so inter<strong>change</strong>d in the mouths of<br />
some people, that one is much puzzled to distinguish them, and even n<br />
is altered too; as lám 南 for nám; mán 欄 for lán; lò 奴 for nò; &c. <strong>The</strong><br />
number of such words is not very great, and while the few who speak<br />
thus cannot discriminate the inital consonant before some vowels, they<br />
never inter<strong>change</strong> them before others.<br />
2.4 Phonological Distinctions<br />
Chao (1947: 18) observed that earlier sources on Cantonese made a number of<br />
distinctions that were only present in neighboring dialects or in older forms of the<br />
language. Yue (1972: 71-72) identifies them as being under the direct or indirect<br />
influence of Zhou Guanshan’s 周冠山 Fenyun cuoyao 分韻撮要 dictionary 5 , such as<br />
Samuel Wells Williams’ Tonic Dictionary of the Canton Dialect (1856), which bears<br />
the name of Zhou’s book in its Chinese title, Ying-Wa fenyun cuoyao 英華分韻撮要.<br />
However, Yue is unsure if these distinctions reflect Zhou’s own Shunde 順德<br />
23
pronunciation, a neighboring Yue dialect, and/or earlier pronunciation. <strong>The</strong><br />
distinctions relevant to the discussion are described below.<br />
2.4.1 Distinction Between the Vowels -o- [-ɔ-] and -a- [-ɐ-]<br />
Before Labial Finals -m [-m] and -p [-p]<br />
A number of earlier sources on Cantonese, including some of the sources used<br />
in this discussion (Williams 1856, Aubazac 1909, Meyer 1947, etc.), make a<br />
distinction between the vowels -o- [-ɔ-] and -a- [-ɐ-] before the labial finals -m [-m]<br />
and -p [-p]. Even the third edition of Bernard F. Meyer and <strong>The</strong>odore F. Wempe’s<br />
<strong>The</strong> Student’s Cantonese-English Dictionary (1947), which was the “most popular in<br />
current use” (Yue 1972: 687) almost three decades later, still included the -om [-ɔm]<br />
and -op [-ɔp] rimes despite not being present in contemporary Cantonese. This<br />
distinction is illustrated by 金 ‘gold’ and 甘 ‘sweet’, which are homophonously gam 1<br />
[kɐm 55 ] in contemporary Cantonese (table 2.9).<br />
Source 金 甘<br />
‘gold’ ‘sweet’<br />
Williams 1856<br />
Aubazac 1909<br />
ckam<br />
kam<br />
ckòm<br />
1<br />
kom 1<br />
Meyer 1947 kam kom<br />
Chao 1947 kam kam<br />
Yue 1972 kɐm 53<br />
kɐm 53<br />
Lau 1977 gam 1o<br />
gam 1<br />
Rao 1996 gem1 gem1 Yale gam 1<br />
gam 1<br />
Table 2.9: Vowels -o- [-ɔ-] and -a- [-ɐ-] Before Labial Finals -m [-m] and -p [-p]<br />
24
2.4.2 Distinction Between the Dental and Palatal Sibilant Initials<br />
[ts-]/[ts h -]/[s-] and [tʃ-]/[tʃ h -]/[ʃ-]<br />
Cantonese has a series of sibilant initials that vary between dental and palatal<br />
articulation depending on the speaker, from the dental articulation [ts-]/[ts h -]/[s-] of<br />
Yue (1972: 88) to the alveopalatal [tʃ-]/[tʃ h -]/[ʃ-] of Rao (1996: 267) to the palato-<br />
alveolar [tɕ-]/[tɕ h -]/[ɕ-] of Chao (1947: 28). According to Yue (1972: 88, 120fn8),<br />
these sibilants are often palatalized before high front vowels, which she herself does<br />
more often with the affricates than the fricative, and she cites D.C. Lau’s observation<br />
that male speakers have a greater tendency to palatalize. A semi-palatalized set, [tʃ-<br />
]/[tʃ h -]/[s-] (as in English “jaw”/“church”/“sand”), is described by Sidney Lau (1977:<br />
ix) without any particular constraints on what vowels they must precede, and this is<br />
also reflected in the Yale romanization system, which uses j-/ch-/s- for the sibilant<br />
initials.<br />
However, a number of earlier sources on Cantonese, including some of the<br />
sources used in this discussion (Williams 1856, Aubazac 1909, Meyer 1947, etc.),<br />
make a distinction between a dental and a palatal series of sibilants, e.g., Williams<br />
(1856: xxii) distinguishes [ts-]/[ts h -] and [s-] (as in English “ratsbane”/“wits” and<br />
“sea”/“yes”) from [tʃ-]/[tʃ h -] and [ʃ-] (as in English “church” and “shut”/“chaise”). This<br />
distinction is still made in Mandarin, where the palatal series roughly correspond to<br />
the retroflex initials in Mandarin, zh- [tʂ-], ch- [tʂ h -], and sh- [ʂ-], and are illustrated by<br />
the below three minimal or near-minimal pairs (table 2.10).<br />
25
Source 宗 中 村 春 笑<br />
‘ancestor’ ‘middle’ ‘village’ ‘spring’ ‘to laugh’<br />
Mandarin zōng zhōng cūn chūn xiào<br />
< *siào<br />
26<br />
少<br />
‘young’<br />
shào<br />
Williams 1856 cts’ung cch’ung cts’ün cch’un siúɔ shiúɔ Aubazac 1909 tsoung 1<br />
tchoung 1<br />
ts’un 1<br />
tch’eun 1<br />
siou 3<br />
shiou 3<br />
Meyer 1947 tsung chung ts’uen ch’un siù shiù<br />
Chao 1947 tzong cong tsön chön siw shiw<br />
Yue 1972 tsʊŋ 53<br />
tsʊŋ 53<br />
ts’øn 53<br />
ts’y:n 53<br />
siŭ 44<br />
siŭ 44<br />
Lau 1977 jung 1<br />
jung 1<br />
chuen 1o<br />
chun 1<br />
siu 3<br />
siu 3<br />
Rao 1996 zung 1<br />
zung 1<br />
qun 1<br />
cên 1<br />
xiu 3<br />
xiu 3<br />
Yale jung 1<br />
jung 1<br />
chyun 1<br />
cheun 1<br />
siu 3<br />
siu 3<br />
Table 2.10: Dental and Palatal Sibilant Initials [ts-]/[ts h -]/[s-] and [tʃ-]/[tʃ h -]/[ʃ-]<br />
On the other hand, Williams (1856) observed that the dental and palatal series<br />
of sibilant initials were not always distinguished, such as dental affricates [ts-]/[ts h -]<br />
often becoming palatalized to [tʃ-]/[tʃ h -], while in neighboring Yue dialects the palatal<br />
fricative [ʃ-] was often depalatalized to [s-]:<br />
<strong>The</strong> initials ch and ts are constantly confounded, and some persons are<br />
absolutely unable to detect the difference, more frequently calling the<br />
words under ts as ch, than contrariwise. All characters with the sounds<br />
tsz’ and ts’z’ are liable to be heard chí and ch’í, with a stronger<br />
breathing than those properly read chí and ch’í. (xx-xxi)<br />
<strong>The</strong> initial sh is called s along the coast; in the districts of Hiángshán<br />
[Xiangshan 香山], Sinning [Xinning 新寧] and Sinngán, this obtains to<br />
a very great extent; shui 水, shü 書, shuk 熟, sháng shing 省城, &c.<br />
&c., being heard sui, sü, suk, and sáng sing, as in the Tiéchiú<br />
[Chaozhou 潮州] and Amoy [Xiamen 夏門] dialects. <strong>The</strong> initial sh is a<br />
complete shibboleth to the people of those districts. West of Canton,<br />
many are found who <strong>change</strong> sz’ into sü, and a large part of the words<br />
beginning with s are <strong>change</strong>d into sh, just the opposite of the usage at<br />
Macao. (xxi)
2.5 Description of Rare Characters<br />
Unlike the characters in general use in modern standard written Chinese, which<br />
enjoy widespread typographic support, there has been less than sufficient support for<br />
rare characters used in specialized contexts. Yin and Rohsenow (1994: 80-82)<br />
identified ten categories of usages of specialized characters: 1) keji 科技, science and<br />
technology; 2) renming 人名, person names; 3) diming 地名, place names; 4) minzu<br />
民族 and zongjiao 宗教, ethnic minorities and religion; 5) hangye 行業, industry; 6)<br />
yiyin 譯音, transliteration; 7) fangyan 方言, dialects; 8) wenyan 文言 and gu hanyu<br />
古漢語, classical Chinese and ancient Chinese; 9) kouyu 口語, colloquial language;<br />
and 10) fei hanyu 非漢語, non-Chinese languages. <strong>The</strong> characters used in written<br />
Cantonese are the same as those used in modern standard written Chinese, but also<br />
include those that fall into Yin and Rohsenow’s fangyan (dialect) category. As written<br />
Cantonese often reflects the spoken language, characters from the kouyu (colloquial<br />
language) category are also used. Characters from the wenyan and gu hanyu (classical<br />
Chinese and ancient Chinese) category are also used, since Cantonese preserves some<br />
words and their characters that have become extinct in Mandarin and modern standard<br />
written Chinese. Occasionally, characters from the yiyin (transliteration) category are<br />
also used, for transliterating English and other foreign words.<br />
Like other groups who use specialized supersets of rare characters, there is<br />
always the issue that a necessary character is not available typographically, and the<br />
open-ended nature of characters precludes there ever being a complete remedy to this<br />
problem, especially for newly-coined characters. To allow for discussion, rare<br />
27
characters are described here using Ideographic Description Characters (IDC), which<br />
were originally introduced in the early 1990s (Unicode Consortium 2000: 268-271,<br />
565-566). IDCs are operators that take two or three following characters as operands,<br />
and describe a character as a combination of two or three component characters in<br />
various arrangements. This combination is called an Ideographic Description<br />
Sequence (IDS), and relies on the component characters being available<br />
typographically. <strong>The</strong> ten IDCs, including examples demonstrating their usage, are<br />
given below (table 2.11).<br />
IDC IDC Name Word Definition Character IDS<br />
� left to right míng bright 明 �日月<br />
� above to below jí lucky 吉 �士口<br />
� left to middle and right jiē street 街 �彳圭亍<br />
� above to middle and below jiù old 舊 �艹隹臼<br />
� full surround guó country 國 �囗或<br />
� surround from above wèn to ask 問 �門口<br />
� surround from below xiōng unlucky 凶 �凵乂<br />
� surround from left jiàng carpenter 匠 �匚斤<br />
� surround from upper left guǎng wide 廣 �广黃<br />
� surround from upper right qì air 氣 �气米<br />
� surround from lower left zhè this 這 �辶言<br />
� overlaid wū witch 巫 �从工<br />
Table 2.11: Ideographic Description Characters<br />
An IDS can also include other IDSs in lieu of component characters, to<br />
describe more complex characters. Each IDS is read from right to left, applying each<br />
operator to the two or three component characters to its right, and this process is<br />
repeated until a final mental image of the entire character is formed, e.g., the long IDS<br />
28
�火��木缶木冖�鬯彡 would assemble into the complex character wat 1 爩 ‘to<br />
smoke’. Examples of complex IDSs showing intermediate stages are given below.<br />
1) chú ‘cupboard’<br />
a) 木�广�壴寸<br />
b) 木�广尌 壴 and 寸 are arranged left to right to create 尌.<br />
c) �木廚 广 surrounds 尌 from the upper left to create 廚.<br />
d) 櫥 木 and 廚 are arranged left to right to create 櫥.<br />
2) fān ‘border’<br />
3) shān ‘fan’<br />
a) �艹�氵�釆田<br />
b) �艹�氵番 釆 and 田 are arranged above to below to create 番.<br />
c) �艹潘 氵 and 番 are arranged left to right to create 潘.<br />
e) 藩 艹 and 潘 are arranged above to below to create 藩.<br />
a) �火�戶�习习<br />
b) �火�戶羽 习 and 习 are arranged left to right to create 羽.<br />
c) �火扇 戶 surrounds 羽 from the upper left corner to create 扇.<br />
d) 煽 火 and 扇 are arranged left to right to create 煽.<br />
4) zhào ‘to shine’<br />
a) ��日月�穴工<br />
b) ��日月空 穴 and 工 are arranged from above to below to create<br />
空.<br />
c) �明空 日 and 月 are arranged from left to right to create 明.<br />
d) 曌 明 and 空 are arranged from above to below to create<br />
曌.<br />
2.6 Unicode Codepoint<br />
Where known, the Unicode codepoint for each character, which is expressed as<br />
“U+” followed by a four or five digit hexadecimal number, has been provided for<br />
reference. It is envisioned that this information can serve as a unique identifier for<br />
cross-referencing other dictionaries or to provide a means for inputting rare Cantonese<br />
29
dialect characters. Since newer versions of Unicode (table 2.12) support more<br />
Chinese characters (table 2.13), one may also assess based on which block a<br />
character’s codepoint falls in whether it is supported on one’s equipment. <strong>The</strong><br />
Unicode codepoint will also facilitate future replacement of characters temporarily<br />
described here by IDSs with a proper representation.<br />
Year Unicode ISO 10646 URO ExtA ExtB Han<br />
Version Version<br />
Characters<br />
1993 Unicode 1.1 ISO 10646-1: 1993 ✓ 20,902<br />
1996 Unicode 2.0 ISO 10646-1: 1993<br />
plus amendments<br />
✓<br />
20,902<br />
1998 Unicode 2.1 ISO 10646-1: 1993<br />
plus amendments<br />
✓<br />
20,902<br />
2000 Unicode 3.0 ISO 10646-1: 2000 ✓ ✓ 27,484<br />
2001 Unicode 3.1 ISO 10646-2: 2001 ✓ ✓ ✓ 70,195<br />
Table 2.12: Unicode Versions<br />
Block Codepoint Range Han<br />
Characters<br />
CJK Unified Ideographs (URO) U+4E00 to U+9FA5 20,902<br />
CJK Unified Ideographs Extension A (ExtA) U+3400 to U+4DB5 6,582<br />
CJK Unified Ideographs Extension B (ExtB) U+20000 to U+2A6D6 42,711<br />
Table 2.13: Unicode Blocks<br />
30
Endnotes<br />
1 O’Melia (1959) also uses the Meyer-Wempe system.<br />
2<br />
This is a modified version of Yale. In actual Yale romanization, these would be: à,<br />
á, a, àh, áh, ah, āt, at, and aht.<br />
3 Williams (1856) does not distinguish between the yinru 陰入 and zhongru 中入<br />
tones.<br />
4 Whitaker, Katherine P.K. 1952. “Characterization of the Cantonese Dialect with<br />
Special Reference to its Modified Tones”. Ph.D. dissertation. London: <strong>University</strong> of<br />
London.<br />
5<br />
See Yue (1972: 84fn5) for a description of the differences between the Fenyun<br />
cuoyao 分韻撮要 and contemporary Cantonese.<br />
31
CHAPTER 3<br />
PROJECT<br />
When using earlier Cantonese sources such as the third edition of Bernard F.<br />
Meyer and <strong>The</strong>odore F. Wempe’s <strong>The</strong> Student’s Cantonese-English Dictionary<br />
(1947), one is struck by the drastically different and sometimes unrecognizable<br />
characters used for some words, such as 倃 for gau 6 ‘lump’, 唨 for jo 2 , the perfective<br />
aspect particle, and 蹘 for mau 1 ‘to squat’. Less than three decades later, while that<br />
dictionary was still the “most popular in current use” (Yue 1972: 687), the same three<br />
words were written with 嚿, 咗, and 踎, respectively. Further investigation reveals<br />
that the different forms given in different sources is not necessarily the idiosyncrasy of<br />
each author, but part of an ongoing optimization process of refining the written form<br />
by changing and replacing characters. Unlike characters which have undergone<br />
prescriptive script reform, these optimizations are driven by a populace who finds the<br />
characters currently used to be insufficient and hence creates superior ones to<br />
supersede them. Even today, among contemporary sources such as newspapers,<br />
advertisements, popular fiction, comics, and personal letters, there is considerable<br />
variation in the written forms used, suggesting that their usage is primarily driven by a<br />
populace without reference to “authoritative” sources such as contemporary<br />
dictionaries, or the research of scholars of the benzikao 本字考 school who seek to<br />
32
discover the now-forgotten etymologically “correct” character attested in writings<br />
from antiquity. This study seeks to understand the motivations behind the <strong>change</strong>s<br />
that eventually weeds out the less preferable written forms.<br />
3.1 Requirements for Sources of Cantonese Dialect Characters<br />
Although older sources such as Robert Morrison’s three-part dictionary and<br />
primer, A Vocabulary of the Canton Dialect (1828), were available, they were<br />
disqualified because they did not meet certain criteria. Since it is impossible to<br />
exhaustively sift through every extant work with Cantonese dialect characters,<br />
dictionaries were taken to be representative of the usage of their era. As Williams<br />
(1856: xiii) testifies,<br />
<strong>The</strong> best course to adopt respecting the colloquial words found in this<br />
dialect, has been a matter of considerable perplexity in the preparation<br />
of this Dictionary. <strong>The</strong>re being so many modes to express them, it was<br />
concluded to follow that plan for each character, which seemed to be<br />
the best understood among the people.<br />
However, he also warns,<br />
<strong>The</strong> student must not however place much dependence on many of the<br />
characters employed to denote these unwritten sounds, for they are not<br />
uniformly represented, and other persons would perhaps choose<br />
different characters. (xiii)<br />
Furthermore, as multiple dialect characters may be used for a particular word<br />
even within a single work, there needs to be a way to expediently identify dialect<br />
characters. For this reason, besides disqualifying non-dictionary works, only<br />
dictionaries that are arranged or indexed by the characters used were considered,<br />
where the number of places where the characters used to write a particular word can<br />
be found is kept to a minimum. Most dictionaries of this type are called zidian 字典,<br />
33
and unless they are arranged inappropriately with no regard for the orthography, most<br />
cidian 詞典, dictionaries of compounds, can also be used. Dictionaries that are<br />
arranged by unhelpful orders were excluded, such as bilingual English-Chinese<br />
dictionaries, which are arranged according to an English translation, the exact wording<br />
differing from dictionary to dictionary.<br />
Additionally, it is also important to have both the pronunciation and definition<br />
available, so that a character can be reliably identified as being used to write a<br />
particular word. Without the pronunciation, homonyms cannot be distinguished, e.g.,<br />
車 can be used to write che 1 ‘car’ as well as geui 1 , a surname, whereas without a<br />
definition, homophones cannot be distinguished, e.g., 卡 can be used to write both ka 1<br />
‘card’ and ka 1 ‘calorie’.<br />
Given the sources that were available and the above criteria, the earliest source<br />
that was not disqualified was Samuel Wells Williams’ A Tonic Dictionary of the<br />
Chinese Language in the Canton Dialect of 1856. Earlier sources such as Robert<br />
Morrison’s A Vocabulary of the Canton Dialect of 1828 includes a Chinese-English<br />
dictionary that makes use of characters, but the pronunciation is ambiguously<br />
indicated by a romanization system that does not mark tones nor aspiration.<br />
3.2 Overview of Sources Used<br />
<strong>The</strong> sources used may be divided into five chronological periods: 1) the mid-<br />
nineteenth century, represented by Williams (1856); 2) the late nineteenth century,<br />
represented by Williams (1909 [1874]); 3) the early twentieth century, represented by<br />
Aubazac (1909); 4) the mid-twentieth century represented by: 3) Meyer (1947) and<br />
34
O’Melia (1959); and 5) the late twentieth century, represented by: Yue (1972); Lau<br />
(1977), and Rao (1996).<br />
Samuel Wells Williams published A Tonic Dictionary of the Chinese<br />
Language in the Canton Dialect in Canton (Guangzhou) in 1856, which contains over<br />
7850 characters, and A Syllabic Dictionary of the Chinese Language Arranged<br />
According to the Wu-Fang Yüan Yin [五方元音] in Beijing in 1874, which contains<br />
12,527 characters. Unlike the earlier dictionary, the latter dictionary was not restricted<br />
to Cantonese, and also covered Mandarin, Fuzhou, and Shanghai usages. Williams<br />
later turned it over to the North China Union College, who rearranged the entries and<br />
published it in 1909 with the title extended to include the phrase “and Alphabetically<br />
Rearranged According to the Romanization of Sir Thomas F. Wade”. However, the<br />
contents are essentially identical to those in the 1874 edition, and this source has been<br />
treated as such.<br />
Williams’ earlier dictionary uses the prose tag “colloquial word”, which refers<br />
to a word or sense of a word that is used only in colloquial language. In many cases,<br />
when the tag “colloquial word” is used and there are no non-colloquial senses of the<br />
word, the character used to write it is a Cantonese dialect character. However, it is<br />
unclear what criteria is used to make this determination, and the results do not always<br />
correspond to what one would consider Cantonese dialect characters, such as mā 媽<br />
under the (Cantonese) “má” section:<br />
c媽 A colloquial word; a nurse; c nái cmá [奶媽], a wet nurse; ckon cmá<br />
[乾媽], a nurse; csho ct’au cma, a tiring woman; chapɔ cmá [執媽], a<br />
midwife; cmá cmá [媽媽], mother, mamma; ckú cmá [姑媽], aunt, aunty.<br />
(1856: 269)<br />
35
On the other hand, Williams’ later dictionary uses the prose tag “in<br />
Cantonese”, which refers to words or senses of words that are used only in Cantonese,<br />
which often correlates with Cantonese dialect characters. Williams’ two dictionaries<br />
also use the prose tag “unauthorized”, which is defined in the later dictionary as<br />
characters which do not appear in the Kangxi zidian 康熙字典 dictionary (1716).<br />
However, this term also applies to characters that have been created in the century and<br />
a half since it was written, which are not necessarily Cantonese dialect characters,<br />
such as shuāi 甩 under the (Mandarin) “shuai” section:<br />
c甩 An unauthorized character, used for 丟 to discard. To throw away,<br />
as worthless; to discard, to reject. 丨脫 throw it away. 丨拉外頭 throw<br />
it outside. 事丨不開 I cannot leave this work. 丨磚打人 to throw a<br />
brick at a man. 丨瓦 to toss tiles up. (1909 [1874]: 718)<br />
Louis Aubazac, described as a “missionnaire au Kouangtong” (missionary in<br />
Guangdong), published Liste des Caractères les Plus Usuels de la Langue<br />
Cantonnaise (List of the most ordinary characters in the Cantonese tongue) in Hong<br />
Kong in 1909, a pronunciation-sorted glossary of over 1800 characters. However, no<br />
additional information is provided in the pamphlet.<br />
Bernard F. Meyer and <strong>The</strong>odore F. Wempe of Maryknoll first published <strong>The</strong><br />
Student’s Cantonese-English Dictionary in 1935 in Hong Kong, which contains about<br />
10,000 characters. <strong>The</strong> edition used here is the third edition, published in New York<br />
in 1947. Although promising tags are defined in the “Explanatory Notes” section such<br />
as “Coll.” (colloquial) and “Ca.” (Cantonese), they do not appear to be used<br />
36
productively, despite the large number of Cantonese dialect characters included in<br />
their dictionary.<br />
Thomas A. O’Melia of Maryknoll first published the four-part First-Year<br />
Cantonese, a textbook, in 1938 in Hong Kong. <strong>The</strong> edition used here is the fourth<br />
edition, published in Hong Kong in 1959. Despite being a textbook, part four,<br />
“Random Idioms and Notes Arranged Alphabetically” contains a small dictionary by<br />
F.C. Dietz, described as the “first Director of the Language School”. However, dialect<br />
characters are not marked.<br />
Oi-Kan Yue Hashimoto published Phonology of Cantonese in 1972, the first<br />
volume of the proposed “Studies in the Yue Dialects” series. Chapter 4, “Syllabary<br />
Arranged According to Cantonese Sounds” (202-398) does not intend to be an<br />
exhaustive listing, but purports to include characters wherever possible, although in<br />
reality some words are not given written forms, even though there are characters for<br />
them in Meyer (1947), which is one of the sources that Yue consulted (203). Yue<br />
explains that “characters particular to Cantonese and colloquial forms for which no<br />
characters are designed” (202) are identified with an English gloss within parentheses,<br />
but onomatopaeic syllables are only given an English gloss and transcription within<br />
brackets. Loanwords are glossed in their source language, and where it exists, a<br />
character within parentheses.<br />
Sidney Lau 劉錫祥, the author of a series of Cantonese textbooks (Elementary<br />
Cantonese, Intermediate Cantonese, and Advanced Cantonese) for the Hong Kong<br />
government, published A Practical Cantonese-English Dictionary in 1977, in lieu of<br />
producing the companion glossary volume for Advanced Cantonese to parallel the<br />
37
ones previously written for the elementary and intermediate levels. It contains over<br />
3,600 characters, and marks some with the tag “CC” (Cantonese Character), although<br />
there is no explanation of what that is intended to mean, as there are also some dialect<br />
characters marked with only the tag “Coll.” (Colloquial), or with both tags, as well as<br />
non-dialect characters marked with the tag “Coll”.<br />
Rao Bingcai 饒秉才, Ouyang Jueya 歐陽覺亞, and Zhou Wuji 周無忌, three<br />
mainland Chinese authors, published their dictionary, Guangzhouhua fangyan cidian<br />
廣州話方言詞典, in Hong Kong in 1996. Although dialect characters are not marked<br />
in the body of the dictionary, there is an appendix (377-380) called “Guangzhouhua<br />
teshu zibiao” 廣州話特殊字表 (Table of Characters Specific to Cantonese) which<br />
lists characters in the dictionary that fall into one of three categories: 1) dialect<br />
characters frequently used in Cantonese, 2) characters borrowed to represent<br />
Cantonese-specific words, and 3) ancient characters that are not included in the<br />
Xinhua zidian 新華字典 dictionary used in mainland China.<br />
3.3 Characters Selected for Study<br />
Since most sources do not adequately mark Cantonese dialect characters as<br />
such, it was decided to adopt the definition and list of characters given by Rao (1996:<br />
377-380) as a data set of Cantonese dialect characters. However, that list was soon<br />
found to include obscure words that were not familiar to contemporary Cantonese<br />
speakers and were not attested in other sources. In the interest of working with<br />
familiar words in contemporary use whose written form could be compared to those<br />
used in other sources, only words in that list that were also found in Lau (1977), Yue<br />
38
(1972), and Meyer (1947) were retained. That left words that were certainly in use in<br />
the past half century, representing one-third of the time period covered by the study,<br />
and used commonly enough to be included in four out of the seven sources used.<br />
However, four words (and their characters) which did not meet that requirement<br />
(cheun 1 ‘animal egg’, gau 6 ‘lump’, hong 6 ‘young hen’, and lau 1 ‘coat’) were also<br />
included as exceptions because they demonstrated an important point.<br />
<strong>The</strong>re were 116 words total, of which 113 were monosyllabic, two disyllabic<br />
(gaat 6 jaat 6 ‘cockroach’ and ngau 6 dau 6 ‘unwell; stupid’), and one trisyllabic<br />
(ham 6 baang 6 laang 6 ‘all’). <strong>The</strong>re were 266 unique characters, of which seven (冚, 冧,<br />
嘥, 奀, 徙, 揼, and �貝子) were used to write two different words, while one<br />
character (泵) was used to write three different words.<br />
3.4 Traditional Model of Character Construction and Usage Principles<br />
Nearly every work that has a discussion of Chinese writing includes an<br />
obligatory explanation of the liushu 六書, the traditional model of the six principles of<br />
constructing and using Chinese characters. <strong>The</strong> standard version of the liushu is given<br />
in juan 卷 15A of Xu Shen’s 許慎 Shuowen jiezi 說文解字 (AD 100), which gives a<br />
terse definition and two examples for each principle. Due to the numerous and<br />
sometimes disputed interpretations of each of the liushu principles, this model will be<br />
presented merely for reference.<br />
Xiangxing 象形 1 , typically translated as ‘pictographs’, is the second of the<br />
liushu principles explained in the Shuowen jiezi 說文解字. Characters constructed<br />
according to the xiangxing principle were originally depictions of concrete objects,<br />
39
such as rì 日 ‘sun’ and yuè 月 ‘moon’, which are no longer as transparent in the<br />
streamlined contemporary orthography.<br />
Zhishi 指事 2 , typically translated as ‘symbols’ or ‘ideographs’, is actually the<br />
first of the liushu principles. Characters constructed according to the zhishi principle<br />
were originally indications of abstract concepts, such as shàng 上 ‘above’ and xià 下<br />
‘below’, where additional marks have been placed above or below a horizontal line.<br />
Huiyi 會意 3 , typically translated as ‘compound ideographs’, is the fourth of the<br />
liushu principles. Characters constructed according to the huiyi principle combine two<br />
or more other characters together to suggest a new meaning, such as wǔ 武 ‘military’<br />
composed of zhǐ ‘to stop’ and gē 戈 ‘dagger-axe’, suggesting the stopping of weapons;<br />
and xìn 信 ‘trust’ composed of rén 亻(人) ‘person’ and yán 言 ‘speech’, suggesting a<br />
person’s words.<br />
Xingsheng 形聲 4 , typically translated as ‘phonetic compounds’, is the third of<br />
the liushu principles. Characters constructed according to the xingsheng principle<br />
combine two characters together, where one signifies its general meaning while the<br />
other is used in rebus fashion for its phonetic value, e.g., jiāng 江 and hé 河, which<br />
both mean ‘river’, are composed of shuǐ 氵(水) ‘water’ for the signific and gōng 工<br />
‘work’ or kě 可 ‘able’ for the phonetic, respectively.<br />
Zhuanzhu 轉注 5 , the fifth of the liushu principles, is an ill-defined principle,<br />
but apparently involves some semantic and graphic connection between two<br />
characters. <strong>The</strong> standard example is kǎo 考 ‘old’ 6 (now, ‘to test’) and lǎo 老 ‘old’ 7 .<br />
40
Jiajie 假借 8 , typically translated as ‘loan characters’, is the last liushu<br />
principle. Characters used according to the jiajie principle involve the rebus use of<br />
another character for its phonetic value, such as the character 令, which is usually used<br />
to write lìng ‘command’, borrowed to write liáng ‘good’, which is now written as 良<br />
(Boltz 1996: 197).<br />
3.5 Modified Model of Character Construction and Usage Principles<br />
Since the traditional liushu model of character construction and usage<br />
principles is often insufficiently defined, instead of imposing an interpretation on it, a<br />
modified model is used in this discussion, consisting of four principles: co-signific<br />
characters, semantic loans, phonetic loans, and signific-phonetic characters. Co-<br />
signific characters may be equated with the huiyi 會意 ‘compound ideographs’<br />
principle in the traditional liushu 六書 model, while phonetic loans and signific-<br />
phonetic characters may be roughly equated with the jiajie 假借 ‘loan characters’ and<br />
xingsheng 形聲 ‘phonetic compounds’ principles, although the actual distinction<br />
between the two principles may not necessarily be the same in both models. Semantic<br />
loans have no discernable analogue in the traditional model, although they may be<br />
aligned with the zhuanzhu 轉注 principle given certain interpretations of the latter, but<br />
this will not be attempted here. In other words, no claims are made about the<br />
inter<strong>change</strong>ability of the two models, and the names of the principles used in the<br />
modified model are not intended and should not be regarded as translations of those in<br />
the traditional model, and vice versa.<br />
41
Traditional Typical Translation Traditional Modified Model<br />
Model<br />
Examples<br />
象形 xiangxing pictographs 日月 no equivalent<br />
指事 zhishi symbols/ideographs 上下 no equivalent<br />
會意 huiyi compound ideographs 武信 co-signific<br />
形聲 xingsheng phonetic compounds 江河 signific-phonetic<br />
轉注 zhuanzhu varies 考老 no equivalent<br />
假借 jiajie loan characters 令長 phonetic loans<br />
no equivalent semantic loans<br />
Table 3.1: Comparison of Principles in the Traditional and Modified Models<br />
In the modified model, there are no equivalents to the xiangxing 象形<br />
‘pictographs’ and zhishi 指事 ‘symbols’/’ideographs’ principles in the traditional<br />
model, since the latter have long ceased to be productive principles 9 , and were not<br />
used to construct any of the Cantonese dialect characters in the data set. Furthermore,<br />
characters which could not be clearly classified into one of the four principles in the<br />
modified model have been placed into a category for indeterminate cases.<br />
In the following three chapters, each of the character construction and usage<br />
principles will be discussed along with examples from the data set, beginning with co-<br />
signific characters and semantic loans in chapter 4, phonetic loans in chapter 5, and<br />
signific-phonetic characters in chapter 6. Characters constructed according to<br />
indeterminate principles, which are relatively few in number, will be discussed at the<br />
end of chapter 4.<br />
It should be noted that in the following chapters, the word “create” will be used<br />
as a cover term to refer to the “creation” of a character in the sense that is first being<br />
constructed or used according to a particular principle to write a word in the data set<br />
42
within the sources used in this study. It does not intend to claim that the character did<br />
not exist earlier, since a phonologically, semantically, and/or graphically similar form<br />
may in some cases be attested in older works 10 .<br />
43
Endnotes<br />
1 象形者,畫成其物,隨體詰詘,日月是也。<br />
2 指事者,視而可識,察而見意,上下是也。<br />
3 會意者,比類合誼,以見指撝,武信是也。<br />
4 形聲者,以事為名,取譬相成,江河是也。<br />
5 轉注者,建類一首,同意相受,考老是也。<br />
6 老也,从老省,丂聲。(juan 8A)<br />
7 考也,七十曰老,从人毛匕,言須髮變白也,凡老之屬皆从老。(juan 8A)<br />
8 假借者,本無其字,依聲託事,令長是也。<br />
9 According to data summarized by DeFrancis (1984: 84), the xiangxing 象形<br />
‘pictographic’ principle had dwindled from the 23% of the Shang 商 dynasty to 4% in<br />
the Shuowen jiezi 說文解字 (AD 100), and later to 3% by the twelfth century; while<br />
the zhishi 指事 ‘simple indicative’ principle, which had never been a large category,<br />
had dwindled from the 2% in the Shang dynasty to 1% in the Shuowen jiezi.<br />
10 I thank Professor Jianqi Wang for this observation. In particular, the character 褸,<br />
which was used as early as the 1940s (Meyer 1947) for the word lau 1 ‘coat’, is attested<br />
in the Kangxi zidian 康熙字典 dictionary (1716: 1123). <strong>The</strong> pronunciation *lau<br />
落侯切, 良侯切丛音樓 is given with the meaning 衣襟 ‘front of a garment’, while the<br />
pronunciation *lyu 力主切 is given with the meaning 衣壞也 ‘threadbare clothes’. It<br />
is also possible that the character 褸 may have been adopted for lau 1 ‘coat’ by<br />
someone who had seen it before, with no claims to cognacy.<br />
44
4.1 Co-Signific Characters<br />
CHAPTER 4<br />
CO-SIGNIFIC CHARACTERS, SEMANTIC LOANS,<br />
AND INDETERMINATE CASES<br />
Co-signific characters, which are the analogue of the huiyi 會意 ‘compound<br />
ideographs’ principle in the traditional liushu 六書 model, are characters which<br />
combine two or more other characters together to suggest a new meaning, such as<br />
laai 1 ‘last (child)’ 1 , which is written with 孻. According to Williams (1909 [1874]:<br />
493), 孻 is composed of significs ji 2 子 ‘child’ and jeun 6 盡 ‘to finish’, a reference to<br />
the last child of an old man, and generalized to mean ‘last child’ and ‘last’.<br />
A co-signific character may explicitly spell out a synonym or a description of<br />
their meaning, rather than vaguely alluding to or suggesting their meaning. <strong>The</strong><br />
relationship between each of the significs is clear, as they can be joined together<br />
linguistically. For example, sū 甦 ‘to revive’ is composed of two co-significs, gèng 更<br />
‘even more’ and shēng 生 ‘life’, which spell out the synonym gēngshēng 更生 ‘to<br />
revive’, while béng 甭 ‘no need’ is composed of two co-significs, bú 不 ‘no’ and yòng<br />
用 ‘need’, which spell out the phrase búyòng 不用 ‘no need’ of which it is a<br />
contraction. Others merely describe their meaning, such as wāi 歪 ‘crooked’, which is<br />
composed of two co-significs bú 不 ‘no’ and zhèng 正 ‘straight’, which spell out the<br />
45
phrase búzhèng 不正 ‘not straight’, while rì 氜 ‘helium’ is composed of two co-<br />
significs, rì 日 ‘sun’ and qì 气(氣) ‘gas’, which spell out the phrase rìqì 日氣 ‘sun<br />
gas’, a reference to where helium was first discovered.<br />
cheun 1 ‘animal egg’ 2 is written with �末�成肉 or 膥, composed of significs<br />
mei 6 未 ‘not yet’, sing 4 成 ‘to become’, and yuk 6 肉 ‘flesh’, which spells out the<br />
descriptive phrase mei 6 sing 4 yuk 6 未成肉 ‘not yet become flesh’, a reference to the<br />
undeveloped state of an egg. �末�成肉 or 膥 differ only in that the former has the<br />
positions of the components rearranged so that the yuk 6 肉 ‘flesh’ signific is less<br />
prominent, occupying only the lower right quarter of the character, rather than the<br />
lower half. Similarly, ngan 1 ‘tiny’ 3 is written with 奀, which is composed of significs<br />
bat 1 不 ‘not’ and daai 6 大 ‘large’, which spells out the descriptive phrase bat 1 daai 6<br />
不大 ‘not large’.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
cheun1 animal<br />
egg<br />
laai1 last<br />
(child)<br />
�末�<br />
成肉<br />
✓ ✓<br />
U+81A5 膥 ✓ ✓ ✓<br />
U+6625 春 ✓<br />
U+5B7B 孻 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
ngan1 tiny U+5940 奀 ✓ ✓ ✓ ✓ ✓ ✓<br />
Table 4.1: Co-Signific Characters<br />
A co-signific character can be optimized, such as me 1 ‘to carry on the back’ 4 ,<br />
which was first written with �貝子, with the bui 3 貝 ‘cowrie’ signific positioned on<br />
46
the left and the ji 2 子 ‘child’ signific on the right. According to Williams (1909<br />
[1874]: 571), 貝 may actually be bui 3 背 ‘back’, a reference to the carrying of a child<br />
on the back, and me 1 ‘to carry on the back’ was written with it up to at least the 1950s<br />
(O’Melia 1959). However, by the 1940s (Meyer 1947), the positions of the significs<br />
had already been rearranged to create 孭, so that the ji 2 子 ‘child’ signific would<br />
occupy the left half of the character, suggesting that it is the preferred positioning for<br />
the signific that indicates the general meaning of the character. Even if bui 3 貝<br />
‘cowrie’ were actually an abbreviated form of bui 3 背 ‘back’, the general meaning of<br />
the character has more to do with matters related to children than money (cowries).<br />
Although Rao (1996) also lists the older �貝子 form, it is otherwise not attested in<br />
sources later than the 1950s, suggesting that it was included just for completeness.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
me1 to carry<br />
on the<br />
back<br />
U+27D2F �貝子 ✓ ✓ ✓ ✓ ✓ ✓<br />
4.2 Semantic Loans<br />
U+5B6D 孭 ✓ ✓ ✓ ✓<br />
Table 4.2: Optimization of a Co-Signific Character<br />
Semantic loans, which have no discernable analogue in the traditional liushu<br />
六書 model, are characters which have been borrowed for their identical or similar<br />
meaning. <strong>The</strong> concept of semantic loans was recognized by Williams (1856) as a<br />
device for writing Cantonese, who says:<br />
47
… characters having nearly the same meaning as the colloquial word,<br />
but of an entirely different sound, are adopted, so that even if the reader<br />
does not know the vulgar sound he will make no mistake as to the<br />
sense. Thus, the words chung 烘 to roast, used for cnung, to scorch, to<br />
scowl; chung 孔 a hole, used for clung; are instances of this mode of<br />
adaption. (xiii).<br />
That is, nung 1 ‘to scorch’ is a semantic loan of hung 4 烘 ‘to toast’, which is near-<br />
synonymous, while lung 1 ‘hole’ is a semantic loan of hung 2 孔 ‘hole’, which is<br />
completely synonymous.<br />
<strong>The</strong> device of semantic loans was frequently employed, such that it is difficult<br />
to discern the actual identity of words written in this manner without the aid of a<br />
parallel transcription. For unexplained reasons, Chalmers, in a note originally<br />
introduced in the fourth edition of his An English and Cantonese Dictionary, quoted<br />
here from the fifth edition (1878), explains:<br />
<strong>The</strong> common characters 唔 ‘m, 嘅 ke`, and 冇 mo’, which are<br />
unauthorized and local, have been in most cases replaced by their<br />
classic equivalents, 不, 之, and 無 while the colloquial sounds have<br />
been retained. (viii)<br />
Apparently, the replacement of the characters for the negative m 4 唔, the genitive<br />
particle ge 3 嘅, and mou 5 冇 ‘to not have’ was not motivated by practical concerns<br />
such as typography, as they do appear in the note itself, as well as throughout the<br />
dictionary, such as:<br />
Neither … nor, 不是——又不是 pat-shi`—yau`-pat-shi`; 唔係——<br />
又唔係 ‘m-hai`—yau`-‘m-hai`, ——都唔係 too-‘m-hai`. (146)<br />
Disagree, 不對 ‘m-tui`, 唔啱 ‘m-ngaam, 相爭 seung-chaang. (61)<br />
In the definition of “neither … nor”, the distinction between the characters for the<br />
negative m 4 唔 and its “classic” synonym, bat 1 不, have been retained. However, this<br />
48
is not the case in the definition of “disagree”, where the negative m 4 is written with<br />
both characters. Fortunately, the transcription in romanization of the first Chinese<br />
definition, ‘m-tui`, shows that it uses m 4 rather than bat 1 , as the characters 不對 alone<br />
would otherwise indicate.<br />
ma 1 孖 ‘twin’ 5 , me 2 歪 ‘crooked’ 6 , and pok 1 泡 ‘blister’ 7 are semantic loans 8 of<br />
the completely synonymous words ji 1 孖 ‘twin’, wai 1 歪 ‘crooked’, and pou 5 泡<br />
‘blister’, respectively. On the other hand, dau 3 竇 ‘den; nest’ 9 , also pronounced dau 6 ,<br />
and lit 3 纈 ‘knot’ 10 are semantic loans of the semi-synonymous words dau 3 竇 ‘hole’<br />
and kit 3 纈 ‘to tie up silk for dyeing’, respectively. Meanwhile, mou 5 冇 ‘to not<br />
have’ 11 is a semantic loan of sorts of its antonym, yau 5 有 ‘to have’, less the two center<br />
strokes.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
dau3 den; nest U+7AC7 竇 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
�口兜 ✓<br />
lit3 knot U+7E88 纈 ✓ ✓ ✓ ✓ ✓ ✓<br />
ma1 twin U+5B56 孖 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
me2 crooked U+6B6A 歪 ✓ ✓ ✓ ✓ ✓ ✓<br />
mou5 to not U+5187 冇 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
have<br />
pok1 blister U+6CE1 泡 ✓ ✓ ✓<br />
U+2688A �月暴 ✓<br />
Table 4.3: Semantic Loans (History)<br />
49
Word Gloss Unicode Char Semantic Loan of Char Gloss<br />
dau3 den; nest U+7AC7 竇 dau3 竇 hole<br />
lit3 knot U+7E88 纈 kit3 纈<br />
ma1 twin U+5B56 孖 ji1 孖 twin<br />
me2 crooked U+6B6A 歪 waai1 歪 crooked<br />
mou5 to not have U+5187 冇 yau5 ㈲<br />
50<br />
to tie up silk for dyeing<br />
to have<br />
pok1 blister U+6CE1 泡 pou5 泡 blister<br />
4.3 Indeterminate Cases<br />
Table 4.4: Semantic Loans (Basis)<br />
A number of cases which could not be clearly classified as a co-signific<br />
character, semantic loan, phonetic loan, or signific-phonetic character given the<br />
available information are covered here.<br />
laam 2 ‘olive’ 12 is written with 欖, which is the standard character for the word.<br />
Rao (1996) also lists 杬, but 杬 does not appear to have ever been used to mean ‘olive’<br />
(HYDZD 2: 1164), and there is no similarity with its phonetic, yun 4 元 ‘first’.<br />
lung 5 ‘trunk’ 13 is written with 槓, which Williams (1909 [1874]: 432) suggests<br />
is altered from lung 4 籠 ‘cage’, without further explanation. Apparently, the gung 3 貢<br />
‘to contribute’ phonetic is being treated as a lung 5 phonetic. Meyer (1947) also lists<br />
篢, as well as Yue (1972), who only lists 篢, but it is also unclear how 篢 is<br />
constructed, although it also uses 貢 as a lung 5 phonetic.<br />
nap 6 ‘sticky’ 14 is written with 湆, which the Shuowen jiezi 說文解字 (AD 100;<br />
HYDZD 3: 1684) suggests is a signific-phonetic character with yam 1 音 ‘sound’ as a
phonetic, but the only similarity is the labial place of articulation of the final<br />
consonant. Other sources such as the Guangyun 廣韻 (AD 1011; HYDZD 3: 1684)<br />
and the Jiyun 集韻 (AD 1067; Kangxi zidian 1716: 636) give a pronunciation that<br />
does ends with a labial stop, but with a velar place of articulation for the initial<br />
consonant rather than a dental one. However, they were all used to write a<br />
semantically different word meaning ‘damp’. Rao (1996) instead lists �氵�囗又,<br />
which is attested in the Kangxi zidian 康熙字典 (1716: 615) as a phonologically<br />
similar but semantically different word, ‘watery’.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
laam2 olive U+6B16 欖 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
U+676C 杬 ✓<br />
lung5 trunk U+69D3 槓 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
U+7BE2 篢 ✓ ✓<br />
nap6 sticky U+6E46 湆 ✓ ✓ ✓ ✓ ✓ ✓<br />
po1 classifier<br />
for<br />
plants<br />
tam5 pit;<br />
cesspool<br />
U+23CB<br />
7<br />
�氵�<br />
囗又<br />
✓<br />
U+6A16 樖 ✓ ✓ ✓ ✓ ✓ ✓<br />
U+68F5 棵 ✓<br />
U+6C39 氹 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
�宀甾 ✓<br />
U+7A9E 窞 ✓<br />
yaak3 to eat U+55AB 喫 ✓ ✓ ✓ ✓ ✓ ✓<br />
U+5403 吃 ✓ ✓<br />
Table 4.5: Indeterminate Cases<br />
51
po 1 , a classifier for plants 15 , is written with 樖, and the Zheng zitong 正字通<br />
(1671; HYDZD 2: 1282) suggests that o 1 柯 ‘stalk’ is the phonetic, where the only<br />
dissimilarity is the initial. However, it was used to write a semantically different word<br />
meaning ‘bamboo twigs rubbing and tapping against each other’ or ‘tree branches<br />
crossed and connected’ (HYDZD 2: 1282). Yue (1972) also lists 棵, the character for<br />
its literary counterpart fo 2 , a classifier for plants.<br />
tam 5 氹 ‘pit; cesspool’ 16 is written with 氹, which according to Williams (1909<br />
[1874]: 860) is composed of a seui 2 水 ‘water’ signific and a ‘one’ signific to indicate<br />
a “hole”, although the latter component appears to be yut 6 乙 ‘second’. Rao (1996)<br />
also lists 窞, a semantic loan of daam 6 窞 ‘pit’. Apparently, there is some confusion<br />
between the man-made ‘pit; cesspool’ and the naturally occurring ‘bog’. According to<br />
Williams (1856: 544), �宀甾, with the pronunciation c tòm (*dom 5 ), which differs in<br />
the aspiration of the initial, d- /t-/ rather than t- /t h -/ and the final, -om /-ɔm/ rather than<br />
-am /-ɐm/, is erroneously listed in the Fenyun 分韻 for tam 5 氹 ‘a tank; a pit’, while<br />
he makes a distinction between tam 5 氹 ‘a cesspool; a pit, a tank’ and dam 6 �宀甾 ‘a<br />
low place, a bog’ (498). Aubazac (1909: 30) is similarly confused, as he lists 氹 with<br />
the tam 5 pronunciation but defines it as “marais” (marsh). Besides tam 5 氹 ‘a pool’,<br />
Meyer (1947) also lists �宀甾 ‘a pit’ with the same tam 5 pronunciation, as well as<br />
t’ŏm (*tom 5 ) and dam 6 . t’ŏm (*tom 5 ) may be analyzed as a variation of tam 5 with the<br />
phonological merger of -om /-ɔm/ with -am /ɐm/, while dam 6 is the pronunciation that<br />
Williams (1856) gave for ‘bog’. <strong>The</strong> confusion may perhaps also involve the visually<br />
52
similar 窞 ‘pit’, for which Huang (1941: 4) gives the pronunciation daam 6 , which<br />
differs in the final from dam 6 , /tam/ rather than /tɐm/, while He (1999: 96) gives tam 5 .<br />
yaak 3 ‘to eat’ 17 is written with 喫, but according to Karlgren (1923: 120), the<br />
function of the gat 1 契 ‘tally’ component is unclear. As early as the mid-nineteenth<br />
century (Williams 1856), it could be substituted with 吃 as a semantic loan of hat 1 吃<br />
‘to eat’ (75), which in turn is either an unmarked phonetic loan of gat 1 吃 ‘to stutter’<br />
(136) which differs in the manner of articulation of the initial, g- /k-/ rather than h- /h-<br />
/, or a signific-phonetic character composed of a hau 2 口 ‘mouth’ signific and a<br />
completely homophonous hat 1 乞 ‘to beg’ phonetic. By the 1940s (Meyer 1947), 吃<br />
had also become borrowed as a semantic loan for hek 3 ‘to eat’.<br />
53
Endnotes<br />
1 laai 1 ‘last (child)’. 孻. U+5B7B. Williams (1856: 219) clái; Williams (1909<br />
[1874]: 493) “in Cantonese”; Aubazac (1909: 13) lái 1 ; Meyer (1947: #1429) laai; Yue<br />
(1972: 235) lA:ĭ 53 “colloquial character”; Lau (1977: #1758) laai 1o ; Rao (1996: 118)<br />
lai 1 .<br />
2 cheun 1 ‘animal egg’. ① �末�成肉. Williams (1856: 37*) cch’un; Rao (1996: 19)<br />
cên 1 . ② 膥. U+81A5. Aubazac (1909: 33) tch’eun 1 ; Meyer (1947: #413) ch’un; Yue<br />
(1972: 311) ts’øn 53 “colloquial character”. ③ 春. U+6625. Rao (1996: 19) cên 1 .<br />
3 ngan 1 ‘tiny’. 奀. U+5940. Williams (1856: 319) cngan “colloquial word”; Aubazac<br />
(1909: 19) ngan 1 ; Meyer (1947: #2038) ngan; Yue (1972: 334) ngɐn 53 “colloquial<br />
character”; Lau (1977: #2333) ngan 1 “CC”, “Coll.”; Rao (1996: 168) ngen 1 .<br />
4 me 1 ‘to carry on the back’. ① �貝子. U+27D2F. Williams (1856: 283) cmé<br />
“colloquial word”; Williams (1909 [1874]: 571) “unauthorized”, “in Cantonese”;<br />
Aubazac (1909: 17) mé 1 ; Meyer (1947: #1801) me; O'Melia (1959: 4: 100) me; Rao<br />
(1996: 146) mé 1 . ② 孭. U+5B6D. Meyer (1947: #1801) me; Yue (1972: 216) mɛ: 53<br />
“colloquial character”; Lau (1977: #2121) me 1 “CC”; Rao (1996: 146) mé 1 .<br />
5 ma 1 ‘twin’. 孖. U+5B56. Williams (1856: 269) cmá; Williams (1909 [1874]: 866)<br />
“in Cantonese”; Aubazac (1909: 16) ma 1 ; Meyer (1947: #1725) ma; O'Melia (1959: 4:<br />
94) ma; Yue (1972: 205) mA: 53 “colloquial character”; Lau (1977: #2034) ma 1 “CC”;<br />
Rao (1996: 142) ma 1 .<br />
6 me 2 ‘crooked’. 歪. U+6B6A. Williams (1856: 283) c mé “colloquial word”;<br />
Aubazac (1909: 17) mé 2 ; Meyer (1947: #1802) mé; Yue (1972: 216) mɛ: 35 “colloquial<br />
character”; Lau (1977: #2122) me 2 “Coll.”; Rao (1996: 146) mé 2 , wai 1 (waai 1 ).<br />
7 pok 1 ‘blister’. ① 泡. U+6CE1. Meyer (1947: #2454) p’òk (pok 3 ); Yue (1972: 226)<br />
p’ɔk 5 “colloquial character”; Lau (1977: #2547) pok 1o “CC”. ② �月暴. U+2688A.<br />
Rao (1996: 181) pog 1 .<br />
8 Alternatively, it is possible that 孖 was independently “re-invented” for ma 1 ‘twin’ as<br />
a co-signific character rather than as a semantic loan of ji 1 孖 ‘twin’. Thanks to<br />
Professor Marjorie Chan for this observation.<br />
9 dau 3 ‘den, nest’. ① 竇. U+7AC7. Williams (1856: 512) tau ɔ (dau 6 ); Williams<br />
(1909 [1874]: 805); Aubazac (1909: 31) tao3 (dau 6 ); Meyer (1947: #3029) taù;<br />
O'Melia (1959: 4: 173) tàu, tâu (tau 6 ); Yue (1972: 244) tɐŭ 44 “colloquial character”;<br />
54
Lau (1977: #507) dau 3 “Coll.”; Lau (1977: #508) dau 6 (dau 6 ); Rao (1996: 39) deo 3 ,<br />
deo 6 (dau 6 ). ② �口兜. Meyer (1947: #3029) taù.<br />
10 lit 3 ‘knot’. 纈. U+7E88. Williams (1856: 244) lítɔ “colloquial word”; Williams<br />
(1909 [1874]: 107); Meyer (1947: #1594) lìt; Yue (1972: 262) li:t 4 “colloquial<br />
character”; Lau (1977: #1907) lit 3 “CC”; Rao (1996: 129) lid 3 , kid 3 (kit 3 ).<br />
11 mou 5 ‘to not have’. 冇. U+5187. Williams (1856: 294) c mò “colloquial word”;<br />
Williams (1909 [1874]: 894) “unauthorized”, “in Cantonese”; Aubazac (1909: 17)<br />
mó2; Meyer (1947: #1848) mŏ; O'Melia (1959: 4: 103) mŏ; Yue (1972: 227) moŭ 24<br />
“colloquial character”; Lau (1977: #2169) mo 5 “CC”; Rao (1996: 153) mou 5 .<br />
12 laam 2 ‘olive’. ① 欖. U+6B16. Williams (1856: 222) c lám; Williams (1909<br />
[1874]: 497); Aubazac (1909: 13) lám2 (laam 5 ); Meyer (1947: #1435) laám; Yue<br />
(1972: 237) lA:m 24 (laam 5 ), lA:m 35 ; Lau (1977: #1765) laam 2 ; Rao (1996: 119) lam 5-2 .<br />
② 杬. U+676C. Rao (1996: 119) lam 5-2 .<br />
13 lung 5 ‘trunk’. ① 槓. U+69D3. Williams (1856: 266) c lung “unauthorized”;<br />
Williams (1909 [1874]: 432) “unauthorized”; Aubazac (1909: 16) loung2; Meyer<br />
(1947: #1717) lŭng; O'Melia (1959: 4: 93) lŭng; Lau (1977: #2027) lung 5 ; Rao (1996:<br />
137) lung 5 , gong 3 (gong 3 ). ② 篢. U+7BE2. Meyer (1947: #1717) lŭng; Yue (1972:<br />
274) lʊŋ 24 .<br />
14 nap 6 ‘sticky’. ① 湆. U+6E46. Williams (1856: 310) napɔ “colloquial word”;<br />
Williams (1909 [1874]: 78) “in Cantonese”; Meyer (1947: #1969) nâp; Yue (1972:<br />
248) nɐp 3 “colloquial character”; Lau (1977: #2273) nap 6 “CC”, “Coll.”. ②<br />
�氵�囗又. U+23CB7. Rao (1996: 159) neb 6 .<br />
15 po 1 ‘classifier for plants’. ① 樖. U+6A16. Williams (1856: 382) cp’o “colloquial<br />
word”; Meyer (1947: #2439) p’oh; O'Melia (1959: 4: 134) p’oh; Yue (1972: 224)<br />
p’ɔ: 53 “colloquial character”; Lau (1977: #2541) poh 1 “CC”; Rao (1996: 181) po 1 . ②<br />
棵. U+68F5. Yue (1972: 224) p’ɔ: 53 .<br />
16 tam 5 ‘pit; cesspool’. ① 氹. U+6C39. Williams (1856: 498) c t’am “colloquial<br />
word”; Williams (1909 [1874]: 860) “unauthorized”; Meyer (1947: #3005) t’ăm;<br />
O'Melia (1959: 4: 172) t’ăm; Yue (1972: 245) t’ɐm 24 “colloquial character”; Lau<br />
(1977: #3036) tam 5 “CC”; Rao (1996: 213) tem 5 . ② �宀甾. Meyer (1947: 3203)<br />
t’ŏm (*tom 5 ), tâm (dam 6 ), t’ăm (tam 5 ). ② 窞. U+7A9E. Rao (1996: 213) tem 5 .<br />
55
17 yaak 3 ‘to eat’. ① 喫. U+55AB. Williams (1856: 674) yákɔ; Aubazac (1909: 44)<br />
yák0; Meyer (1947: #3798) yaàk, hèk (hek 3 ); Yue (1972: 287) ĭA:k 4 “colloquial<br />
character”, “vulgar form”; Lau (1977: #3305) yaak 3 “Coll.”; Rao (1996: 236) yag 3 . ②<br />
吃. U+5403. Williams (1856: 674) yákɔ; Lau (1977: #3305) yaak 3 “Coll.”.<br />
56
CHAPTER 5<br />
PHONETIC LOANS<br />
Phonetic loans, which are the analogue of the jiajie 假借 ‘loan characters’<br />
principle in the traditional liushu 六書 model, are characters which have been<br />
borrowed as rebuses for their phonetic value. However, the traditional model is<br />
insufficiently defined with regards to the concept of marking phonetic loans as such,<br />
and rather than impose an interpretation, we establish a new principle modeled after it.<br />
<strong>The</strong> concept of phonetic loans was recognized by Williams (1856) as a device<br />
for writing Cantonese, to wit:<br />
Sometimes a well-known character of the same tone is selected to<br />
express the sound; and its evidently utter inaptitude in the connection to<br />
express any sense is depended upon to intimate that it is used for a<br />
colloquial word. (xii)<br />
Sometimes, again, a character which comes nearest in tone is taken to<br />
represent the needed sound, and the knowledge of the reader is<br />
expected to inform him that it is employed in a vulgar sense. <strong>The</strong><br />
words cnín 年 milk; clán 欄 a bazaar; and cnái 奶 a lady, are examples<br />
of this practice. (xiii)<br />
That is, nin 1 ‘milk’ is a phonetic loan of nin 4 年 ‘year’, laan 1 ‘marketplace’ is a<br />
phonetic loan of laan 4 欄 ‘fence’, and naai 1 ‘lady’ is a phonetic loan of naai 5 奶<br />
‘milk’. In other words, the character borrowed for a phonetic loan can be completely<br />
or semi-homophonous with respect to tone. However, in actuality, the character<br />
57
orrowed for a phonetic loan can also be less homophonous with respect to its<br />
segments.<br />
<strong>The</strong> productive nature of phonetic loans as a device for writing Cantonese is<br />
also recognized by Williams (1856), who says:<br />
This expedient is frequently employed by partly educated persons in<br />
letters, when they do not know, or cannot remember the proper<br />
characters. (xii)<br />
However, the situation with writing Cantonese is not that the “proper characters” are<br />
unknown or forgotten by undereducated people at the individual level, but that a<br />
“proper character” was never created due to the underdeveloped state of written<br />
Cantonese, or that the “proper character” is known only to scholars of the benzikao<br />
本字考 school who are engaged in researching the “original” etymological character.<br />
For example, 嚟 was apparently created for lai 4 ‘to come’ by people who were<br />
unaware or choose to ignore that lai 4 is the colloquial counterpart to loi 4 來 ‘to come’,<br />
and it is theoretically unnecessary to have a separate character for the colloquial<br />
pronunciation. <strong>The</strong>refore, Williams’ “partly educated” really refers to society as a<br />
whole in regards to writing in Cantonese, in contrast to writing in the then-current<br />
literary standard of classical Chinese.<br />
5.1 Unmarked Phonetic Loans<br />
<strong>The</strong> most basic phonetic loans are unmarked phonetic loans, where a character<br />
is borrowed without modification, such as daat 3 笪 ‘spot’ 1 , gat 1 �吉刂 ‘to stab’ 2 , lo 2<br />
‘to take’ 3 , naat 3 鈉 ‘to burn’ 4 , nam 4 腍 ‘tender’ 5 , nau 1 嬲 ‘angry’ 6 , ngat 1 扤 ‘to cram’ 7 ,<br />
ngok 6 咢 ‘to raise the head’ 8 , ning 1 擰/�扌寕/�扌寍 ‘to carry; to bring’ 9 , and wan 2<br />
58
搵/揾 ‘to find’ 10 , which are phonetic loans of the completely homophonous words<br />
daat 3 笪 ‘bamboo mat’, gat 1 �吉刂 ‘to flay the face’, lo 2 攞 ‘to choose’, naat 3 鈉 ‘to<br />
light’, nam 4 腍 ‘well cooked’, nau 1 嬲 ‘to flirt’, ngat 1 扤 ‘to sway’, ngok 6 咢 ‘to beat a<br />
drum’, ning 1 擰/�扌寕/�扌寍 ‘to pull’, and wan 2 搵/揾 ‘to dip’, respectively.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
daat3 spot U+7B2A 笪 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
gat1 to stab U+34E4 �吉刂 ✓ ✓ ✓ ✓ ✓ ✓<br />
lo2 to take U+651E 攞 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
naat3 to burn U+9209 鈉 ✓ ✓ ✓ ✓ ✓ ✓<br />
U+712B 焫 ✓<br />
nam4 tender U+814D 腍 ✓ ✓ ✓ ✓ ✓ ✓<br />
nau1 angry U+5B32 嬲 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
U+60F1 惱 ✓<br />
ngat1 to cram n/a ∅ ✓<br />
U+6264 扤 ✓ ✓ ✓ ✓ ✓ ✓<br />
ngok6 to raise U+54A2 咢<br />
the head<br />
✓ ✓ ✓ ✓ ✓ ✓<br />
U+294E �岳頁 ✓<br />
ning1 to carry;<br />
to bring<br />
5<br />
U+64F0 擰 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
�扌寕 ✓ ✓<br />
�扌寍 ✓<br />
U+62CE 拎 ✓<br />
wan2 to find U+6435 搵 ✓ ✓ ✓ ✓<br />
U+63FE 揾 ✓ ✓ ✓ ✓ ✓<br />
wan3 to<br />
confine<br />
U+97DE 韞 ✓ ✓<br />
�韋昷 ✓ ✓<br />
U+7E15 縕 ✓<br />
U+7DFC 緼 ✓<br />
Table 5.1: Completely Homophonous Unmarked Phonetic Loans (History)<br />
59
wan 3 ‘to confine’ 11 has two forms, 韞/韋昷 and 縕/緼, which are phonetic<br />
loans of the completely homophonous words wan 3 韞/韋昷 ‘to conceal’ and wan 3<br />
縕/緼 ‘hemp flax’, respectively.<br />
Word Gloss Unicode Char Phonetic Loan of Char Gloss<br />
daat3 spot U+7B2A 笪 daat3 笪 bamboo mat<br />
gat1 to stab U+34E4 �吉刂 gat1 �吉刂<br />
60<br />
to flay the face<br />
lo2 to take U+651E 攞 lo2 攞 to choose<br />
nam4 tender U+814D 腍 nam4 腍 well cooked<br />
naat3 to burn U+9209 鈉 naat3 鈉 to light<br />
U+712B 焫<br />
nau1 angry U+5B32 嬲 nau1 嬲 to flirt<br />
U+60F1 惱<br />
ngat1 to cram U+6264 扤 ngat1 扤 to sway<br />
ngok6 to raise<br />
the head<br />
ning1 to carry;<br />
to bring<br />
U+54A2 咢 ngok6 咢 to beat a drum<br />
U+294E5 �岳頁<br />
U+64F0 擰 ning1 擰 to pull<br />
�扌寕 ning1 �扌寕<br />
�扌寍 ning1 �扌寍<br />
U+62CE 拎<br />
to pull<br />
to pull<br />
wan2 to find U+6435 搵 wan2 搵 to dip<br />
U+63FE 揾 wan2 揾 to dip<br />
wan3 to confine U+97DE 韞 wan3 韞 to conceal<br />
�韋昷 wan3 �韋昷<br />
to conceal<br />
U+7E15 縕 wan3 縕 hemp flax<br />
U+7DFC 緼 wan3 緼 hemp flax<br />
Table 5.2: Completely Homophonous Unmarked Phonetic Loans (Basis)
However, dim 6 掂/敁 ‘straight’ 12 , ngan 3 奀 ‘to jiggle the feet’ 13 , and ung 2 擁<br />
‘to push’ 14 are unmarked phonetic loans of less than homophonous words. dim 6 掂/敁<br />
‘straight’ is a phonetic loan of dim 1 掂/敁 ‘to weigh in the hand’, which differs in the<br />
tone, yinping 陰平 (tone #1) rather than yangqu 陽去 (tone #6), while ngan 3 奀 ‘to<br />
jiggle the feet’ is a phonetic loan of ngan 1 奀 ‘tiny’, which differs in the tone, yinping<br />
陰平 (tone #1) rather than yinqu 陰去 (tone #3). On the other hand, ung 2 擁 ‘to push’<br />
is a phonetic loan of yung 2 擁 ‘to push’, which differs in the initial,y- /j-/ rather than a<br />
zero initial.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
dim6 straight U+6382 掂 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
U+6541 敁 ✓<br />
U+20DA �口店 ✓<br />
7<br />
m4 not U+5514 唔 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
mat1 what U+4E5C 乜 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
ngan3 to jiggle<br />
the feet<br />
U+5940 奀 ✓ ✓ ✓<br />
U+47F4 �足辰 ✓<br />
ung2 to push U+64C1 擁 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
U+39EC �巩手 ✓<br />
U+22B2<br />
E<br />
�扌戎 ✓<br />
Table 5.3: Semi-Homophonous Unmarked Phonetic Loans (History)<br />
m 4 ‘not’ and mat 1 ‘what’ are also phonetic loans of less than homophonous<br />
words, but they are extenuating cases. m 4 ‘not’ 15 , similar in function to Mandarin bù<br />
不, is a phonetic loan of ng 4 唔, a sound in singing, which differs in the place of<br />
61
articulation of the syllabic nasal, ng /ŋ̩/ rather than m /m̩/. However, there are no<br />
characters with exactly the same syllable as m 4 . Similarly, mat 1 ‘what’ 16 , also<br />
pronounced me 1 as a contraction, is actually a phonetic loan of me 2 乜 ‘to squint’<br />
based on the latter pronunciation, which differs in the tone, yinshang 陰上 (tone #2)<br />
rather than yinping 陰平 (tone #1).<br />
Word Gloss Unicode Char Phonetic Loan of Char Gloss<br />
dim6 straight U+6382 掂 dim1 掂<br />
U+6541 敁 dim1 敁<br />
U+20DA7 �口店 dim3 店 store<br />
m4 not U+5514 唔 ng4 唔<br />
mat1 what U+4E5C 乜 me2 乜<br />
ngan3 to jiggle<br />
the feet<br />
U+5940 奀 ngan1 奀 tiny<br />
U+47F4 �足辰<br />
ung2 to push U+64C1 擁 yung2 擁<br />
U+39EC �巩手<br />
U+22B2E �扌戎<br />
62<br />
to weigh in the hand<br />
to weigh in the hand<br />
a sound in singing<br />
to squint<br />
to hold in the arms<br />
Table 5.4: Semi-Homophonous Unmarked Phonetic Loans (Basis)<br />
5.2 Marked Phonetic Loans<br />
Phonetic loans are sometimes marked to distinguish them from other usages of<br />
the character, which usually takes the form of a hau 2 口 ‘mouth’ or a yan 4 亻(人)<br />
‘person’ radical added to the left. Williams (1856) recognized this, saying:<br />
Another device to indicate colloquial words is to prefix the character<br />
hau 口 mouth, or yan 人 a man, at the side of some well known<br />
character of the same sound, but not always of the same tone. <strong>The</strong>
words tsoi ɔ 儎 cargo; cká c fo 傢伙, furniture; c mai 咪 do not; ctsoi 啋<br />
pshaw! and c té 嗲 remiss, &c., are examples of this sort. (xii-xiii)<br />
That is, joi 6 儎 ‘cargo’ is a phonetic loan of joi 6 載 ‘to transport’; ga 1 fo 2 傢伙<br />
‘furniture’ is a phonetic loan of ga 1 家 ‘family’ and fo 2 火 ‘fire’, respectively; mai 5 咪<br />
‘do not’ is a phonetic loan of mai 5 米 ‘rice’; choi 1 啋 ‘fie; pshaw’ is a phonetic loan of<br />
choi 2 采 ‘to gather’; and de 2 嗲 ‘lazy’ is a phonetic loan of de 1 爹 ‘father’. Only the<br />
last two examples are not completely homophonous with the character borrowed, but<br />
they have all been marked. However, the use of the ren 亻(人) ‘person’ radical as a<br />
marker is very rare compared to the use of kou 口 ‘mouth’.<br />
<strong>The</strong>re are also less common ways to mark a phonetic loan character, such as<br />
enclosing it in double quotes, e.g., tam 3 ‘to deceive’ can be written as “氹”, a phonetic<br />
loan of tam 5 氹 ‘pit; cesspool’, and ha 1 ‘to bully’ can be written as “蝦”, a phonetic<br />
loan of ha 1 蝦 ‘shrimp’. Another way to mark a phonetic loan character is to alter its<br />
graphic form, but this is very rare, e.g., pīngpāng 乒乓 ‘ping-pong’, where both<br />
characters are phonetic loans of bīng 兵 ‘soldier’, but with a stroke deleted.<br />
<strong>The</strong> character borrowed for a marked phonetic loan can differ in the initial,<br />
such as kat 1 咭 ‘card’ 17 , a loanword of English “card”. kat 1 咭 ‘card’ is a phonetic<br />
loan of gat 1 吉 ‘lucky’, which differs in the aspiration of the initial, g- /k-/ rather than<br />
k- /k h -/. Similarly, yai 5 �口兮 ‘bad’ 18 , also pronounced yai 4 and yai 5 , is a phonetic<br />
loan of hai 4 兮, a classical particle, which differs in the initial, y- /j-/ rather than h- /h-<br />
/. Likewise, lok 3 , a sentence-final particle 19 , is a phonetic loan of gok 3 各 ‘each’.<br />
Although gok 3 各 ‘each’ appears to be a less than optimal phonetic, it can serve as a<br />
63
lok 3 phonetic, such as in the lok 3 洛 of lok 3 yeung 4 洛陽 ‘Luoyang’, lok 3 絡 ‘to join’,<br />
and the lok 3 駱 of lok 3 tuo 4 駱駝 ‘camel’.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
ge3 genitive<br />
particle<br />
U+5605 嘅 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
gip1 bag U+55BC 喼 ✓ ✓ ✓ ✓<br />
kat1 card U+54AD 咭 ✓ ✓ ✓<br />
n/a ✓<br />
lok3 SFP U+54AF 咯 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
yai5 bad U+20BCB �口兮 ✓ ✓ ✓ ✓ ✓<br />
n/a ✓<br />
U+66F3 曳 ✓<br />
Table 5.5: Marked Phonetic Loans<br />
Differing in the Initial or Final (History)<br />
<strong>The</strong> character borrowed for a marked phonetic loan can also differ in the final,<br />
such as ge 3 嘅, a genitive particle 20 , similar in function to Mandarin de 的. ge 3 嘅 is a<br />
phonetic loan of the semi-homophonous word gei 3 既 ‘already’, which differs in the<br />
final, -ei /-ei/ rather than -e /-ɛ/. However, there are no characters with exactly the<br />
same syllable and tone as ge 3 , and this was recognized by Morrison (1828: 2: “kay”),<br />
who gives the unmarked form, saying, “<strong>The</strong> Chinese have no character for this<br />
sound”. Although ke 4 茄 ‘eggplant’ and ke 4 騎 ‘to ride’ do match the final, they differ<br />
in the aspiration of the initial, k- /k h -/ rather than g- /k-/, and the tone, yangping 陽平<br />
(tone #4) rather than yinqu 陰去 (tone #3). This suggests that it is more important for<br />
the phonetic to match the initial and the tone than the final. Similarly, gip 1 喼 ‘bag’ 21 ,<br />
64
considered by some to be a loanword of English “grip”, is a phonetic loan of the semi-<br />
homophonous word gap 1 急 ‘urgent’, which differs in the final, -ap /-ɐp/ rather than -<br />
ip /-ip/.<br />
Word Gloss Unicode Char Phonetic Loan of Char Gloss<br />
ge3 genitive particle U+5605 嘅 gei3 既 already<br />
gip1 bag U+55BC 喼 gap1 急 urgent<br />
kat1 card U+54AD 咭 gat1 吉 lucky<br />
lok3 SFP U+54AF 咯<br />
yai5 bad U+20BCB �口兮 hai4 兮 particle<br />
U+66F3 曳 yai6 曳<br />
Table 5.6: Marked Phonetic Loans<br />
Differing in the Initial or Final (Basis)<br />
65<br />
to drag<br />
<strong>The</strong> character borrowed for a marked phonetic loan can also differ in the tone,<br />
such as mak 1 ‘mark’ 22 , a loanword of English “mark”, which is written with 嚜 and<br />
less commonly 嘜, both of which are phonetic loans of the semi-homophonous words<br />
mak 6 墨 ‘ink’ and mak 6 麥 ‘wheat’, respectively, and differ in the tone, yangru 陽入<br />
(tone #6) rather than yinru 陰入 (tone #1).<br />
Likewise, miu 2 �口妙 ‘to purse the lips’ 23 is a phonetic loan of the semi-<br />
homophonous word miu 6 妙 ‘wonderful’, which differs in the tone, yangqu 陽去 (tone<br />
#6) rather than yinshang 陰上 (tone #2).
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
mak1 mark U+569C 嚜 ✓ ✓ ✓ ✓ ✓ ✓<br />
U+561C 嘜 ✓ ✓<br />
miu2 to purse<br />
the lips<br />
U+20D15 �口妙 ✓ ✓ ✓ ✓<br />
Table 5.7: Marked Phonetic Loans<br />
Differing in the Tone (History)<br />
Word Gloss Unicode Char Phonetic Loan of Char Gloss<br />
mak1 mark U+569C 嚜 mak6 墨 ink<br />
U+561C 嘜 mak6 麥 wheat<br />
miu2 to purse the lips U+20D15 �口妙 miu6 妙 wonderful<br />
Table 5.8: Marked Phonetic Loans<br />
Differing in the Tone (Basis)<br />
Although it was not originally a marked phonetic loan, gaat 6 jaat 6<br />
‘cockroach’ 24 has been reanalyzed as one. gaat 6 jaat 6 ‘cockroach’ was first written<br />
with a pair of characters 甴曱, one of which is apparently a copy of the other rotated<br />
180 degrees. O’Melia (1959: 4: 60) lists 甴曱 as a compound under gaap 3 甲 ‘shell’,<br />
suggesting that 甴, the first of the pair, is a semi-homophonous phonetic loan with a<br />
shortened stroke, which was then rotated. However, Williams (1856) gives the<br />
pronunciation of the word as ga 1 jaat 6 , with neither syllable close to gaap 3 甲 ‘shell’<br />
/kap/, unless the first syllable is analyzed as the result of consonant deletion, /*kat tsat/<br />
� /ka tsat/. By the 1970s (Yue 1972), the order of the characters had been reversed to<br />
66
曱甴 25 , the contemporary arrangement, perhaps to better fit a conceptual model similar<br />
to O’Melia’s analysis but without the need for rotation.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
gaat6<br />
jaat6<br />
cockroach U+7534<br />
U+66F1<br />
U+66F1<br />
U+7534<br />
甴曱 ✓ ✓ ✓ ✓<br />
曱甴 ✓ ✓<br />
Table 5.9: Indeterminate Case Reanalyzed as a Phonetic Loan<br />
5.3 Unmarked Phonetic Loans Superseded by Marked Phonetic Loans<br />
<strong>The</strong> usefulness of markers in distinguishing phonetic loans from other usages<br />
of the character has caused numerous unmarked phonetic loans to be superseded by<br />
marked phonetic loans as early as the mid-nineteenth century (Williams 1856), such<br />
as: 1) a 1 吖, a sentence-final particle 26 ; 2) dei 6 哋, a plural marker 27 , similar in<br />
function to Mandarin men 們; 3) gam 2 噉 ‘so (manner)’ 28 , similar in function to<br />
Mandarin zhèyàng 這樣 and nàyàng 那樣; 4) gam 3 咁 ‘so (quantity) 29 ’, similar in<br />
function to Mandarin zhème 這麼 and nàme 那麼; 5) gwa 3 啩, a sentence-final particle<br />
expressing uncertainty 30 , considered by O’Melia (1959: 4: 78) to be a contraction of<br />
gu 2 a 3 估呀; 6) haai 4 嚡 ‘coarse’ 31 ; 7) hai 2 喺 ‘to be at’ 32 , similar in function to<br />
Mandarin zài 在; 8) kwaak 1 �口緙 ‘loop; to loop’ 33 , 9) lai 4 嚟 ‘to come’ 34 , the<br />
colloquial counterpart of loi 4 來 ‘to come’; 10) mai 5 咪 ‘do not’ 35 , similar in function<br />
to Mandarin bié 別; 11) mo 1 嚤 ‘slow’ 36 ; and 12) ngaam 1 啱 ‘correct’ 37 .<br />
67
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
a1 SFP U+4E2B 丫 ✓<br />
U+5416 吖 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
dei6 plural<br />
marker<br />
U+5440 呀 ✓<br />
U+5730 地 ✓<br />
U+54CB 哋 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
gam2 so U+6562 敢 ✓<br />
U+5649 噉 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
gam3 so U+7518 甘 ✓<br />
U+5481 咁 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
gwa3 SFP U+5366 卦 ✓<br />
U+5569 啩 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
haai4 coarse U+978B 鞋 ✓<br />
U+56A1 嚡 ✓ ✓ ✓ ✓ ✓<br />
n/a ∅ ✓<br />
hai2 to be at U+4FC2 係 ✓<br />
U+55BA 喺 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
kwaak1 loop;<br />
to loop<br />
U+7DD<br />
9<br />
U+210C<br />
8<br />
緙 ✓ ✓<br />
�口<br />
緙<br />
✓ ✓ ✓<br />
�口<br />
隙<br />
✓<br />
lai4 to come U+9ECE 黎 ✓<br />
U+569F 嚟 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
mai5 don’t U+7C73 米 ✓<br />
U+54AA 咪 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
mo1 slow U+6469 摩 ✓<br />
U+56A4 嚤 ✓ ✓ ✓ ✓ ✓<br />
n/a ∅ ✓<br />
ngaam1 correct U+5CA9 岩 ✓<br />
U+5571 啱 ✓ ✓ ✓ ✓ ✓ ✓<br />
Table 5.10: Unmarked Phonetic Loans Superseded by Marked Phonetic Loans, Part I<br />
(History)<br />
68
Word Gloss Unicode Char Phonetic Loan of Char Gloss<br />
a1 SFP U+4E2B 丫 a1 丫 fork<br />
dei6 plural<br />
marker<br />
U+5416 吖 a1 丫 fork<br />
U+5440 呀 a3 呀 SFP<br />
U+5730 地 dei6 地 earth<br />
U+54CB 哋 dei6 地 earth<br />
gam2 so (manner) U+6562 敢 gam2 敢 to dare<br />
U+5649 噉 gam2 敢 to dare<br />
gam3 so (quantity) U+7518 甘 gam1 甘 sweet<br />
U+5481 咁 gam1 甘 sweet<br />
gwa3 SFP U+5366 卦 gwa3 卦 to divine<br />
U+5569 啩 gwa3 卦 to divine<br />
haai4 coarse U+978B 鞋 haai4 鞋 shoe<br />
U+56A1 嚡 haai4 鞋 shoe<br />
hai2 to be at U+4FC2 係 hai6 係<br />
U+55BA 喺 hai6 係<br />
kwaak1 loop; to loop U+7DD9 緙 kaak1 緙<br />
U+210C8 �口緙 kaak1 緙<br />
69<br />
to be<br />
to be<br />
�口隙 gwik 隙 crack<br />
woven threads<br />
woven threads<br />
lai4 to come U+9ECE 黎 lai4 黎 multitude<br />
U+569F 嚟 lai4 黎 multitude<br />
mai5 don’t U+7C73 米 mai5 米 rice<br />
U+54AA 咪 mai5 米 rice<br />
mo1 slow U+6469 摩 mo4 摩 to rub<br />
U+56A4 嚤 mo4 摩 to rub<br />
ngaam1 correct U+5CA9 岩 ngaam4 岩 cliff<br />
U+5571 啱 ngaam4 岩 cliff<br />
Table 5.11: Unmarked Phonetic Loans Superseded by Marked Phonetic Loans, Part I<br />
(Basis)
In a number of cases, the unmarked form appears to post-date the marked<br />
form, but the fact that this happens in two sources by the same author (Williams 1856,<br />
1909 [1874]) suggests that it is merely an omission of the unmarked form from the<br />
earlier source.<br />
However, there are some cases where the unmarked form was not superseded<br />
until later, such as di 1 ‘some’ 38 , considered by Williams (1856: 514) to be a<br />
“colloquial corruption” of 的”, which existed as early as the mid-nineteenth century<br />
(Williams 1856), but is only given a written form in later sources. As early as the late<br />
nineteenth century (Williams 1909 [1874]), it was written with 的, and has been<br />
written with it up to at least the 1940s (Meyer 1947), but by the beginning of the<br />
twentieth century (Aubazac 1909), a hau 2 口 ‘mouth’ radical had already been added<br />
to create 啲. Similarly, ye 5 ‘thing’ 39 , was first written with 野, but by the 1970s (Yue<br />
1972), a hau 2 口 ‘mouth’ radical had been added to create 嘢.<br />
On the other hand, tau 2 ‘to rest’ 40 and yuk 1 ‘to move’ 41 are still unresolved.<br />
tau 2 was first written with 抖, and by the beginning of the twentieth century (Aubazac<br />
1909), a hau 2 口 ‘mouth’ radical had been added to create 唞, but both forms have co-<br />
existed up to the present. Similarly, yuk 1 ‘to move’ was first written with 郁, and by<br />
the 1940s (Meyer 1947), a hau 2 口 ‘mouth’ radical had been added to create 喐, but<br />
both forms have co-existed up to the present.<br />
70
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
bai6 bad U+5F0A 弊 ✓ ✓ ✓ ✓ ✓<br />
U+21681 �敝大 ✓<br />
U+210C �口弊 ✓<br />
7<br />
di1 some n/a ∅ ✓<br />
U+7684 的 ✓ ✓<br />
U+5572 啲 ✓ ✓ ✓ ✓ ✓ ✓<br />
ga3 SFP U+67B6 架 ✓<br />
U+35CE �口架 ✓ ✓ ✓ ✓<br />
U+210C �口駕 ✓<br />
9<br />
go2 that U+500B 個 ✓ ✓ ✓ ✓ ✓<br />
U+55F0 嗰 ✓ ✓ ✓ ✓ ✓<br />
U+7B87 箇 ✓ ✓ ✓<br />
U+4E2A 个 ✓ ✓ ✓ ✓<br />
tau2 to rest U+6296 抖 ✓ ✓ ✓<br />
U+551E 唞 ✓ ✓ ✓ ✓ ✓<br />
U+3A97 �咅攴 ✓<br />
ye5 thing U+91CE 野 ✓ ✓ ✓ ✓<br />
U+5622 嘢 ✓ ✓ ✓<br />
U+57DC 埜 ✓<br />
yuk1 to move U+90C1 郁 ✓ ✓ ✓ ✓ ✓ ✓<br />
U+5590 喐 ✓ ✓ ✓<br />
Table 5.12: Unmarked Phonetic Loans Superseded by Marked Phonetic Loans, Part II<br />
(History)<br />
Although it is it not attested in sources earlier than the 1940s (Meyer 1947),<br />
ga 3 , a sentence-final particle 42 which is a contraction of ge 3 a 3 嘅呀, also fits the<br />
pattern of unmarked phonetic loans being superseded by marked phonetic loans. ga 3<br />
was written with �口架 as early as the 1940s (Meyer 1947), but the fact that an<br />
71
unmarked form, 架, was used up to at least the 1950s (O’Melia) suggests that it<br />
existed earlier.<br />
Word Gloss Unicode Char Phonetic Loan of Char Gloss<br />
bai6 bad U+5F0A 弊 bai6 弊 bad<br />
U+21681 �敝大 bai6 敝大 bad<br />
U+210C7 �口弊 bai6 弊 bad<br />
di1 some U+7684 的 dik1 的 genitive particle<br />
U+5572 啲 dik1 的 genitive particle<br />
ga3 SFP U+67B6 架 ga3 架 frame<br />
U+35CE �口架 ga3 架 frame<br />
U+210C9 �口駕 ga3 駕 to drive<br />
go2 that U+500B 個 go3 個 one<br />
U+55F0 嗰 go2 個 that<br />
U+7B87 箇 go3 箇 one<br />
U+4E2A 个 go3 个 one<br />
tau2 to rest U+6296 抖 dau3 抖 to rouse<br />
U+551E 唞 dau3 抖 to rouse<br />
U+3A97 �咅攴 tau2 �咅攴 to unwrap<br />
ye5 thing U+91CE 野 ye5 野 wild<br />
U+5622 嘢 ye5 野 wild<br />
U+57DC 埜 ye5 埜 wild<br />
yuk1 to move U+90C1 郁 yuk1 郁 elegant<br />
U+5590 喐 yuk1 郁 elegant<br />
Table 5.13: Unmarked Phonetic Loans Superseded by Marked Phonetic Loans, Part II<br />
(Basis)<br />
Sometimes, the transition from an unmarked phonetic loan to a marked<br />
phonetic loan is the result of semantic specialization, such as go 2 ‘that’ 43 , which<br />
developed from go 3 個 ‘one’ to distinguish cases such as go 2 go 3 嗰個 ‘that one’ from<br />
go 3 go 3 個個 ‘every one’ (Williams 1856: 167), and was written with 箇, 個, and 个 as<br />
inter<strong>change</strong>able variant forms. According to Williams (1909 [1874]: 444), 箇 was<br />
72
“not common”, 个 was “much used”, while no comment is made about 個, the<br />
standard form. This distribution is reflected in the number of sources that list each of<br />
them, as well as the disappearance of 箇 and 个 from sources later than the mid-<br />
twentieth century. By the 1940s (Meyer 1947), a hau 2 口 ‘mouth’ radical had been<br />
added to 個, the standard and remaining form, to create 嗰.<br />
<strong>The</strong> usefulness of marking phonetic loans even extends to bai 6 弊 ‘bad’ 44 and<br />
its variant form �敝大, to which Rao (1996) adds an extraneous hau 2 口 ‘mouth’<br />
radical to create �口弊, without any apparent motivation for doing so.<br />
5.4 Optimization<br />
<strong>The</strong> character borrowed for a phonetic loan is sometimes replaced by one that<br />
is more homophonous in tone, such as ngai 1 ‘to beg’ 45 , which was first written with<br />
unmarked and marked phonetic loans of the semi-homophonous word ngai 6 偽 ‘false’,<br />
which differs in the tone, yangqu 陽去 (tone #6) rather than yinping 陰平 (tone #1).<br />
By the 1940s (Meyer 1947), it was written with �口危, a marked phonetic loan of the<br />
semi-homophonous word ngai 4 危 ‘dangerous’, which differs in the tone register,<br />
yangping 陽平 (tone #4) instead of yinping 陰平 (tone #1). Whereas there was no<br />
direct relationship between the tone of ngai 1 ‘to beg’ and the tone of ngai 6 偽 ‘false’,<br />
the tone of the former and the tone of ngai 4 危 ‘dangerous’ are both ping 平 tones, but<br />
belonging to different registers.<br />
Similarly, gau 6 ‘lump’ 46 was first written with 倃, a marked phonetic loan of<br />
the semi-homophonous word gau 3 咎 ‘fault’, which differs in the tone register, yinqu<br />
陰去 (tone #3) rather than yangqu 陽去 (tone #6). By the 1940s (Meyer 1947), it was<br />
73
written with 嚿, a marked phonetic loan of the completely homophonous word gau 6 舊<br />
‘old’. Furthermore, the marker of phonetic loans had been <strong>change</strong>d from the rarely-<br />
used yan 4 亻(人) ‘person’ radical to the commonly-used hau 2 口 ‘mouth’ radical.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
gau6 lump U+5003 倃 ✓ ✓ ✓ ✓<br />
U+56BF 嚿 ✓ ✓ ✓ ✓<br />
ngai1 to beg U+507D 偽 ✓<br />
U+20F2E �口偽 ✓ ✓ ✓ ✓<br />
tam3 to<br />
deceive<br />
U+20C53 �口危 ✓ ✓ ✓<br />
U+5664 噤 ✓ ✓ ✓ ✓ ✓ ✓<br />
U+20C41 �口氹 ✓ ✓<br />
U+27A3E �言�<br />
�冖八<br />
木<br />
Table 5.14: Optimization of the Phonetic in Phonetic Loans (History)<br />
Word Gloss Unicode Char Phonetic Loan of Char Gloss<br />
gau6 lump U+5003 倃 gau3 咎 fault<br />
U+56BF 嚿 gau6 舊 old<br />
ngai1 to beg U+507D 偽 ngai6 偽 false<br />
U+20F2E �口偽 ngai6 偽 false<br />
U+20C53 �口危 ngai4 危 dangerous<br />
tam3 to deceive U+5664 噤 gam3 噤 mute<br />
U+20C41 �口氹 tam5 氹<br />
U+27A3E �言��冖八木<br />
74<br />
pit; cesspool<br />
Table 5.15: Optimization of the Phonetic in Phonetic Loans (Basis)<br />
✓
<strong>The</strong> character borrowed for a phonetic loan can also be replaced by one that is<br />
more homophonous with respect to its segments, such as tam 3 ‘to deceive’ 47 , which<br />
was first written with 噤, and has been written with it up to at least the early 1970s<br />
(Yue 1972). 噤 is a unmarked phonetic loan of the word gam 3 噤 ‘mute’, which<br />
differs in the place of articulation of the initial, g- /k-/ rather than t- /t h -/. Although<br />
Meyer (1947: #1018) also lists kam 1 and tam 1 as pronunciations for ‘mute’, of which<br />
the latter which may be analyzed as the basis of a phonetic loan, other sources,<br />
including Williams (1856), do not give such a pronunciation. However, by the 1940s<br />
(Meyer 1947), tam 3 ‘to deceive’ was already written with �口氹, a marked phonetic<br />
loan of the semi-homophonous word tam 5 氹 ‘pit; cesspool’, which differs in the tone,<br />
yangshang 陽上 (tone #5) rather than yinqu 陰去 (tone #3). Although the tone of tam 5<br />
氹 ‘cesspool; pit’ differs from that of tam 3 ‘to deceive’ whereas the tone of gam 3 噤<br />
‘mute’ did not, tam 5 氹 ‘cesspool; pit’ does not differ in the initial, suggesting that it is<br />
more important for the phonetic to match the initial than the tone.<br />
<strong>The</strong> replacement of the character borrowed for a phonetic loan is sometimes<br />
facilitated by a phonological merger which causes a more homophonous character to<br />
become available, such as saai 3 , a quantifying particle indicating completeness 48 ,<br />
similar in function to Mandarin guāng 光. saai 3 was first written with unmarked and<br />
marked phonetic loans of the semi-homophonous word saai 2 徙 ‘to move’, which<br />
differs in the tone, yinshang 陰上 (tone #2) rather than yinqu 陰去 (tone #3). By the<br />
late 1970s (Lau 1977), it was written with unmarked and marked phonetic loans of the<br />
completely homophonous word saai 3 晒 ‘to shine on’, whose initial was formerly /ʃ-/<br />
75
(Williams 1856: 417; O’Melia 1959: 4: 141) but now /s-/ (Yue 1972: 280). This made<br />
it completely homophonous with saai 3 , the completeness quantifying particle, whose<br />
initial had always been /s-/ (Williams 1856: 405).<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
jo2 perfective<br />
aspect<br />
marker<br />
U+963B 阻 ✓ ✓<br />
U+5528 唨 ✓ ✓ ✓ ✓<br />
U+5497 咗 ✓ ✓<br />
saai3 particle U+5F99 徙 ✓<br />
U+5625 嘥 ✓ ✓ ✓ ✓ ✓ ✓<br />
U+6652 晒 ✓<br />
U+55EE 嗮 ✓<br />
Table 5.16: Optimization of the Phonetic in Phonetic Loans<br />
Facilitated by Phonological Mergers (History)<br />
Word Gloss Unicode Char Phonetic Loan of Char Gloss<br />
jo2 perfective aspect marker U+963B 阻 jo2 阻 to obstruct<br />
U+5528 唨 jo2 阻 to obstruct<br />
U+5497 咗 jo2 左 left<br />
saai3 particle U+5F99 徙 saai2 徙<br />
U+5625 嘥 saai2 徙<br />
U+6652 晒 saai3 晒<br />
U+55EE 嗮 saai3 晒<br />
Table 5.17: Optimization of the Phonetic in Phonetic Loans<br />
Facilitated by Phonological Mergers (Basis)<br />
76<br />
to move<br />
to move<br />
to shine on<br />
to shine on
Similarly, jo 2 , the perfective aspect marker 49 , similar in function to Mandarin<br />
le 了, was first written with unmarked and marked phonetic loans of the completely<br />
homophonous word jo 2 阻 ‘to obstruct’, and was written with the marked form up to at<br />
least the late 1970s (Lau 1977). However, by the early 1970s (Yue 1972), it was<br />
already written with 咗, a marked phonetic loan of the completely homophonous word<br />
jo 2 左 ‘left’, whose initial was formerly /ts-/ (Williams 1856: 582; O’Melia 1959: 205)<br />
but now /tʃ-/ (Yue 1972: 313). This made it completely homophonous with jo 2 , the<br />
perfective aspect marker, whose initial had always been had always been /tʃ-/<br />
(Williams 1856: 25*). Although jo 2 阻 ‘to obstruct’ was already completely<br />
homophonous with jo 2 , the perfective aspect marker, it was <strong>orthographic</strong>ally more<br />
complex than jo 2 左 ‘left’, which had since become completely homophonous.<br />
<strong>The</strong> character borrowed for a phonetic loan can also be replaced for reasons<br />
unrelated to its degree of homophony, such as jo 2 , the perfective aspect marker, and<br />
ngak 1 ‘to trick’ 50 . ngak 1 ‘to trick’ was first written with 阨, an unmarked phonetic<br />
loan of the semi-homophonous word ak 1 阨 ‘obstruction’, which differs in the initial, a<br />
zero initial rather than ng- /ŋ-/. However, the zero initial and ng- /ŋ-/ are commonly<br />
substituted for each other as a phonological merger. ngak 1 ‘to trick’ was written with<br />
阨 up to at least the 1950s (O’Melia 1959), but by the 1940s (Meyer 1947), a hau 2 口<br />
‘mouth’ radical had already been added and the phonetic <strong>change</strong>d to ak 1 厄<br />
‘misfortune’ to create the <strong>orthographic</strong>ally less complex 呃.<br />
Similarly, la 3 , a sentence-final particle 51 , was first written with an unmarked<br />
phonetic loan of the completely homophonous word la 3 罅 ‘crack’, which is also<br />
77
written as �阝虖, 鏬, �土虖, and later marked phonetic loans �口鏬 and 嚹. By<br />
the 1940s, only the marked phonetic loan of the standard form of la 3 罅 ‘crack’<br />
remained. By the late 1970s, it was written with 喇, an unmarked phonetic loan of the<br />
la 3 of la 3 ba 1 喇叭 ‘trumpet’. Although 喇 is an unmarked phonetic loan unlike<br />
�口鏬 and 嚹, it is <strong>orthographic</strong>ally less complex, while resembling the marked<br />
phonetic loans for other sentence-final particles, such as a 1 吖, gwa 3 啩, and ga 3<br />
�口架.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
la3 SFP U+7F45 罅 ✓<br />
U+28EF2 �阝虖 ✓<br />
U+93EC 鏬 ✓<br />
U+3664 �土虖 ✓<br />
�口鏬 ✓ ✓<br />
U+56B9 嚹 ✓ ✓ ✓<br />
U+561E 嘞 ✓<br />
U+5587 喇 ✓ ✓<br />
me1 SFP U+27D2F �貝子 ✓ ✓<br />
U+54A9 咩 ✓ ✓ ✓ ✓ ✓ ✓<br />
ngak1 to trick U+9628 阨 ✓ ✓ ✓ ✓ ✓<br />
U+5443 呃 ✓ ✓ ✓ ✓<br />
U+7732 眲 ✓<br />
Table 5.18: Optimization of the Phonetic in Phonetic Loans<br />
for Other Reasons (History)<br />
Likewise, me 1 , a sentence-final particle expressing doubt 52 , considered by<br />
O’Melia (1959: 4: 101) to be a contraction of mei 6 e 1 未睎, was first written with<br />
78
�貝子, an unmarked phonetic loan of the completely homophonus word me 1 �貝子<br />
‘to carry on the back’. By the beginning of the twentieth century (Aubazac 1909), it<br />
was written with me 1 咩, an unmarked phonetic loan of the completely homophonous<br />
word me 1 咩, the sound of a sheep. Although me 1 �貝子 ‘to carry on the back’ was<br />
already completely homophonous with the sentence-final particle me 1 , unlike me 1 咩,<br />
the sound of a sheep, it does not resemble the marked phonetic loans for other<br />
sentence-final particles, such as a 1 吖, gwa 3 啩, and ga 3 �口架.<br />
Word Gloss Unicode Char Phonetic Loan of Char Gloss<br />
la3 SFP U+7F45 罅 la3 罅 crack<br />
U+28EF2 �阝虖 la3 �阝虖 crack<br />
U+93EC 鏬 la3 鏬 crack<br />
U+3664 �土虖 la3 �土虖 crack<br />
�口鏬 la3 鏬 crack<br />
U+56B9 嚹 la3 罅 crack<br />
U+561E 嘞 lak3 嘞 SFP<br />
U+5587 喇 la3 喇 trumpet<br />
me1 SFP U+27D2F �貝子 me1 �貝子<br />
79<br />
to carry on the back<br />
U+54A9 咩 me1 咩 sound of a sheep<br />
ngak1 to trick U+9628 阨 ak1 阨 obstruction<br />
U+5443 呃 ak1 厄 misfortune<br />
U+7732 眲<br />
Table 5.19: Optimization of the Phonetic in Phonetic Loans<br />
for Other Reasons (Basis)<br />
However, in a few cases, “optimizations” to the phonetic have actually made it<br />
less homophonous. dap 1 ‘to pound’ 53 was written with 搭 and 撘, unmarked phonetic
loans of the semi-homophonous word daap 3 搭/撘 ‘to join together’, which differs in<br />
the final, -aap /-ap/ rather than -ap /-ɐp/, as well as the tone, yinqu 陰去 (tone #3)<br />
rather than yinping 陰平 (tone #1). Lau (1977) instead lists �口扱, a marked<br />
phonetic loan of kap 1 �口扱 ‘to receive’, which differs in the initial, k- /k h -/ rather<br />
than d- /t-/. Although �口扱 does not differ in the tone, there is no direct relationship<br />
between the initials of dap 1 ‘to pound’ and kap 1 ‘to receive’. This suggests that a less<br />
homophonous phonetic may be countered by marking it as a phonetic loan.<br />
Similarly, long 2 ‘to rinse’ 54 , existed as early as the mid-nineteenth century<br />
(Williams 1856), but is only given a written form in later sources. As early as the late<br />
nineteenth century (Williams 1909 [1874]), it was written with 朗, an unmarked<br />
phonetic loan of the semi-homophonous word long 5 朗 ‘clear’, which differs in the<br />
tone register, yangshang 陽上 (tone #5) rather than yinshang 陰上 (tone #2). By the<br />
beginning of the twentieth century (Aubazac), it was written with �口浪, a marked<br />
phonetic loan of the semi-homophonous word long 6 浪 ‘wave’, which differs in the<br />
tone, yangqu 陽去 (tone #6) rather than yinshang 陰上 (tone #2). Whereas there was<br />
a direct relationship between the tone of long 2 ‘to rinse’ and long 5 朗 ‘clear’, there is<br />
none between the former and long 6 浪 ‘wave’. However, like dap 1 ‘to pound’, a less<br />
homophonous phonetic is offset by marking it as a phonetic loan.<br />
ngap 1 ‘to jabber’ 55 , which is written with 吸 and less commonly 噏, both of<br />
which are unmarked phonetic loans of the semi-homophonous word kap 1 吸/噏 ‘to<br />
inhale’, which differs in the manner of articulation of the homorganic initial<br />
consonant, k- /k h -/ rather than ng- /ŋ-/. Lau (1977) instead lists �口揖, an unmarked<br />
80
phonetic loan of ngap 6 �口揖 ‘to bow’, which differs in the tone, yangqu 陽去 (tone<br />
#6) rather than yinping 陰平 (tone #1). However, Rao (1996) instead lists<br />
�口�日絲, a marked phonetic loan of hin 2 �日絲 ‘to display’, but which can serve<br />
as a -ap 1 final, such as in sap 1 濕 ‘wet’. Whereas ngap 6 �口揖 ‘to bow’ was more<br />
homophonous than kap 1 吸/噏 ‘to inhale’, hin 2 �日絲 ‘to display’ is a less optimal<br />
phonetic.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
dap6 to pound U+642D 搭 ✓ ✓<br />
U+6498 撘 ✓<br />
�口扱 ✓<br />
U+22C55 �扌耷 ✓<br />
long2 to rinse n/a ∅ ✓ ✓<br />
U+6717 朗 ✓<br />
U+20E98 �口浪 ✓ ✓ ✓ ✓<br />
ngap1 to jabber U+5438 吸 ✓ ✓ ✓ ✓ ✓<br />
U+564F 噏 ✓ ✓<br />
�口揖 ✓<br />
U+2103E �口�<br />
日絲<br />
wo5 SFP U+555D 啝 ✓ ✓<br />
U+558E 喎 ✓ ✓ ✓<br />
Table 5.20: Erroneous Optimization of the Phonetic in a Phonetic Loan<br />
(History)<br />
Similarly, wo 5 , a sentence-final particle indicating hearsay 56 , was first written<br />
with 啝, a marked phonetic loan of the semi-homophonous word wo 4 和 ‘peace’,<br />
which differs in the tone, yangping 陽平 (tone #4) rather than yangshang 陽上 (tone<br />
81<br />
✓
#5). However, Meyer (1947) also lists wo 4 as another pronunciation of the sentence-<br />
final particle, which would make it completely homophonous. By the 1970s (Yue<br />
1972), it was written with 喎, a marked phonetic loan of wa 1 咼 ‘crooked mouth’,<br />
which can serve as a wo phonetic, such as in wo 6 禍 ‘calamity’, wo 1 窩 ‘nest’, and wo 1<br />
鍋 ‘pan’. While the pronunciation of the sentence-final particle differed from wo 4 和<br />
‘peace’ at most in the tone, there is no direct relationship with wa 1 咼 ‘crooked<br />
mouth’.<br />
Word Gloss Unicode Char Phonetic Loan of Char Gloss<br />
dap6 to pound U+642D 搭 daap3 搭 to join together<br />
U+6498 撘 daap3 撘 to join together<br />
�口扱 kap1 扱 to receive<br />
U+22C55 �扌耷<br />
long2 to rinse U+6717 朗 long5 朗 clear<br />
U+20E98 �口浪 long6 浪 wave<br />
ngap1 to jabber U+5438 吸 kap1 to inhale<br />
U+564F 噏 kap1 to inhale<br />
�口揖 ngap6 �口揖 to nod<br />
U+2103E �口�日絲 hin2 �口�日絲 to display<br />
wo5 SFP U+555D 啝 wo4 和 peace<br />
U+558E 喎 wa1 咼 crooked mouth<br />
Table 5.21: Erroneous Optimization of the Phonetic in a Phonetic Loan<br />
(Basis)<br />
<strong>The</strong> variety of optimizations is further illustrated by the polysyllabic words<br />
ngau 6 dau 6 ‘unwell; stupid’ 57 and ham 6 baang 6 laang 6 ‘all’ 58 . ngau 6 dau 6 ‘unwell; stupid’<br />
existed as early as the mid-nineteenth century (Williams 1856), but it is unclear what<br />
82
the written form of the second syllable was intended to be, as Williams (1856) does<br />
not provide characters for compounds. By the late nineteenth century (Williams 1909<br />
[1874]), it was written with �馬馬逗. �馬馬 is presumably an unmarked phonetic<br />
loan of �馬馬 ‘to gallop wildly’, although the pronunciation of the latter is unknown,<br />
while 逗 is an unmarked phonetic loan of the completely homophonous word dau 6 逗<br />
‘to stop’. By the 1940s (Meyer 1947), ngau 6 dau 6 ‘unwell; stupid’ was written with<br />
吽哣, marked phonetic loans of the completely homophonous words ngau 4 牛 ‘cow’<br />
and dau 6 豆 ‘bean’, respectively. ngau 4 牛 ‘cow’ is a clearer basis for a phonetic loan<br />
than �馬馬, while dau 6 豆 ‘bean’ is <strong>orthographic</strong>ally simpler than dau 6 逗 ‘to stop’.<br />
Similarly, ham 6 baang 6 laang 6 ‘all’, which is contracted to ham 6 ba 6 laang 6 or<br />
ham 6 blaang 6 , existed as early as the mid-nineteenth century (Williams 1856), where it<br />
was transcribed as hòm ɔ pa ɔ láng ɔ (ham 6 ba 6 laang 6 ), but it is unclear what the written<br />
form of the second and third syllables was intended to be, as Williams (1856) does not<br />
provide characters for compounds. <strong>The</strong> first syllable, ham 6 , is written with 喊, an<br />
unmarked phonetic loan of the semi-homophonous word haam 3 喊 ‘to call’, which<br />
differs in the final, -aam /-am/ rather than -am /-ɐm/, as well as the tone register, yinqu<br />
陰去 (tone #3) rather than yangqu 楊去 (tone #6). Lau (1977) instead lists �口感, a<br />
marked phonetic loan of the semi-homophonous word gam 2 感 ‘to feel’, which differs<br />
in the place of articulation of the homorganic final, g- /k-/ rather than h- /h-/, as well as<br />
the tone, yinshang 陰上 (tone #2) rather than yinqu 陰去 (tone #3). Although gam 2 感<br />
‘to feel’ does not differ in the final like haam 3 喊 ‘to call’ except for the tone, it has a<br />
different albeit related initial, which can be offset by being marked as a phonetic loan.<br />
83
Alternatively, �口感 may be analyzed as 喊 with an extraneous sam 1 心 ‘heart’<br />
radical added to it. However, Rao (1996) instead lists 冚, an unmarked phonetic loan<br />
of the semi-homophonous word ham 6 ‘to cover’, which differs in the final, -aam /-am/<br />
rather than -am /-ɐm/, but not the tone, unlike haam 3 喊 ‘to call’, as well as being<br />
<strong>orthographic</strong>ally simpler.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
ham6 U+558A 喊 ✓ ✓ ✓<br />
U+20FD1 �口感 ✓<br />
U+519A 冚 ✓<br />
n/a ∅ ✓<br />
baang6 U+20FB4 口棒 ✓ ✓<br />
�口捧 ✓<br />
U+552A 唪 ✓<br />
n/a ∅ ✓<br />
laang6 U+5464 呤 ✓<br />
U+5525 唥 ✓ ✓ ✓<br />
n/a ∅ ✓<br />
ngau6 U+2994B �馬馬 ✓ ✓<br />
U+543D 吽 ✓ ✓ ✓ ✓<br />
dau6 U+9017 逗 ✓<br />
U+54E3 哣 ✓ ✓ ✓ ✓<br />
Table 5.22: Optimization of Phonetics in Polysyllabic Phonetic Loans<br />
(History)<br />
<strong>The</strong> second syllable, baang 6 , is written with 口棒, a marked phonetic loan of<br />
the semi-homophonous word paang 5 棒 ‘staff’, which differs in the aspiration of the<br />
initial, p- /p h -/ rather than b- /p-/, as well as the tone, yangshang 陽上 (tone #5) rather<br />
84
than yangqu 陽去 (tone #6). O’Melia (1959) and Rao (1996) instead list �口捧 and<br />
唪, respectively, unmarked phonetic loans of the semi-homophonous words pung 2 捧<br />
‘to hold in both hands’ and fung 6 奉 ‘to serve’, which are less optimal than paang 5 棒<br />
‘staff’.<br />
Word Gloss Unicode Char Phonetic Loan of Char Gloss<br />
ham6 U+558A 喊 haam3 喊 to call<br />
U+20FD1 �口感 gam2 感 to feel<br />
U+519A 冚 ham6 冚 to cover<br />
baang6 U+20FB4 口棒 paang5 棒 staff<br />
�口捧 pung2 捧 to hold in both hands<br />
U+552A 唪 fung6 奉 to serve<br />
laang6 U+5464 呤 ling6 令 to command<br />
U+5525 唥 laang5 冷 cold<br />
ngau6 U+2994B �馬馬<br />
U+543D 吽 ngau4 牛 cow<br />
dau6 U+9017 逗 dau6 逗 to stop<br />
U+54E3 哣 dau6 豆 bean<br />
Table 5.23: Optimization of Phonetics in Polysyllabic Phonetic Loans<br />
(Basis)<br />
<strong>The</strong> third syllable, laang 6 , was first written with 呤, a marked phonetic loan of<br />
the semi-homophonous word ling 6 令 ‘to command’, which differs in the final, -ing /-<br />
iŋ/ rather than -aang /-aŋ/. By the 1950s (O’Melia 1959), it was written with 唥 59 , a<br />
marked phonetic loan of the semi-homophonous word laang 5 冷 ‘cold’, which differs<br />
in the tone, yangshang 陽上 (tone #5) rather than yangqu 陽去 (tone #6).<br />
85
5.5 Summary<br />
Marked phonetic loans are greatly preferred over unmarked phonetic loans,<br />
and in many cases, the former has already superseded the latter, although there are<br />
some characters that are still in the progress of transitioning to marked phonetic loans.<br />
In all cases, the preferred device for marking them as such is a hau 2 口 ‘mouth’ radical<br />
rather than a yan 4 亻(㆟) ‘person’ radical.<br />
However, unmarked phonetic loans still do exist, and unlike marked phonetic<br />
loans, the borrowed character tends to be completely homophonous or differs only in<br />
the tone. On the other hand, in marked phonetic loans, the borrowed character may<br />
differ in the initial, final, and/or tone, which is offset by the marking. When the<br />
borrowed character is not completely homophonous, it is usually preferred for the<br />
initial and tone to match if not the final, and for the initial to match if not the tone.<br />
Sometimes, the borrowed character may be replaced by one which is more<br />
homophonous or <strong>orthographic</strong>ally less complex, which is in some cases facilitated by<br />
a phonological merger which allows a more homophonous character to become<br />
available.<br />
86
Endnotes<br />
1 daat 3 ‘spot’. 笪. U+7B2A. Williams (1856: 510) t’átɔ (taat 3 ) “colloquial word”;<br />
Williams (1909 [1874]: 742) “in Cantonese”; Meyer (1947: #2967) taàt; O'Melia<br />
(1959: 4: 168) tàat; Yue (1972: 241) tA:t 4 “colloquial character”; Lau (1977: #471)<br />
daat 3 “CC”; Rao (1996: 27) dad 3 .<br />
2 gat 1 ‘to stab’. �吉刂. U+34E4. Williams (1856: 135) katɔ “colloquial word”;<br />
Aubazac (1909: 47) kat 4 ; Meyer (1947: #1056) kat; Yue (1972: 337) kɐt 5 “colloquial<br />
character”; Lau (1977: #860) gat 1o ; Rao (1996: 66) ged 1 .<br />
3 lo 2 ‘to take’. 攞. U+651E. Williams (1856: 248) c lo “colloquial word”; Williams<br />
(1909 [1874]: 536) “in Cantonese”; Aubazac (1909: 15) lo 2 ; Meyer (1947: #1636)<br />
lóh; O'Melia (1959: 4: 90) lóh; Yue (1972: 269) lɔ: 35 “colloquial character”; Lau<br />
(1977: #1948) loh 2 “CC”; Rao (1996: 131) lo 2 .<br />
4 naat 3 ‘to burn’. ① 鈉. U+9209. Williams (1856: 311) nátɔ “colloquial word”;<br />
Williams (1909 [1874]: 587) “in Cantonese”; Aubazac (1909: 19) náto; Meyer (1947:<br />
#1949) naàt; Yue (1972: 241) nA:t 4 “colloquial character”; Lau (1977: #2256) naat 3<br />
“Coll.”. ② 焫. U+712B. Rao (1996: 157) nad 3 .<br />
5 nam 4 ‘tender’. 腍. U+814D. Williams (1856: 307) cnam “colloquial word”;<br />
Williams (1909 [1874]: 409) “in Cantonese”; Meyer (1947: #1957) nām; Yue (1972:<br />
245) nɐm 21 “colloquial character”; Lau (1977: #2265) nam 4 “CC”, “Coll.”; Rao (1996:<br />
160 ) nem 4 .<br />
6 nau 1 ‘angry’. ① 嬲. U+5B32. Williams (1856: 311) cnau “colloquial word”;<br />
Williams (1909 [1874]: 595) “in Cantonese”; Aubazac (1909: 18) nao 1 ; Meyer (1947:<br />
#1971) nau; O'Melia (1959: 4: 108) nau; Yue (1972: 244) nɐŭ 53 “colloquial<br />
character”; Lau (1977: #2275) nau 1 “CC”, “Coll.”; Rao (1996: 160) neo 1 . ② 惱.<br />
U+60F1. Meyer (1947: #1971) nau.<br />
7 ngat 1 ‘to cram’. ① n/a. Williams (1856: 5, 724) atɔ (at 1 ) ngatɔ “colloquial word”.<br />
② 扤. U+6264. Williams (1909 [1874]: 896); Aubazac (1909: 19) ngat 4 ; Meyer<br />
(1947: #2045) ngat; Yue (1972: 337) ŋɐt 5 ; Lau (1977: #2341) ngat1 o “CC”; Rao<br />
(1996: 167) nged 1 .<br />
8 ngok 6 ‘to raise the head’. ① 咢. U+54A2. Williams (1856: 328) ngokɔ “colloquial<br />
word”; Williams (1909 [1874]: 605) “in Cantonese”; Aubazac (1909: 20) ngok4;<br />
Meyer (1947: #2069) ngôk; Yue (1972: 360) ngɔ:k 3 “colloquial character”; Lau (1977:<br />
#2362) ngok 6 “CC”, “Coll.” ② �岳頁. U+294E5. Rao (1996: 171) ngog 6 .<br />
87
9 ning 1 ‘to carry; to bring’. ① 擰. U+64F0. Williams (1856: 332) cning “colloquial<br />
word”; Williams (1909 [1874]: 599) “in Cantonese”; Aubazac (1909: 20) ning 1 ;<br />
Meyer (1947: #2087) ning; O'Melia (1959: 4: 109) ning; Yue (1972: 255) nɪŋ 53<br />
“colloquial character”; Lau (1977: #2383) ning 1 . ② �扌寍. Meyer (1947: #2087)<br />
ning. ③ 拎. U+62CE. Rao (1996: 174) ning 1 , ling 1 (ling 1 ).<br />
10 wan 2 ‘to find’. ① 搵. U+6435. Williams (1909 [1874]: 889) “in Cantonese”;<br />
Meyer (1947: #3750) wán; O'Melia (1959: 4: 222) wán; Lau (1977: #3219) wan 2<br />
“CC”, “Coll.”. ② 揾. U+63FE. Williams (1856: 662) c wan “colloquial word”;<br />
Aubazac (1909: 43) wan 2 ; O'Melia (1959: 4: 222) wán; Yue (1972: 379) ŭɐn 35<br />
“colloquial character”; Rao (1996: 223) wen 2 .<br />
11 wan 3 ‘to confine’. ① 韞. U+97DE. Meyer (1947: #3753) wàn, wán (wan 2 ); Lau<br />
(1977: #3220) wan 3 “Coll.”. ② �韋昷. Williams (1856: 662) wan ɔ “colloquial<br />
word”; Rao (1996: 223) wen 3 . ③ 縕. U+7E15. Meyer (1947: #3753) wàn, wán<br />
(wan 2 ). ④ 緼. U+7DFC. Yue (1972: 379) ŭɐn 44 “colloquial character”.<br />
12 dim 6 ‘straight’. ① 掂. U+6382. Williams (1856: 518) tím ɔ “colloquial word”;<br />
Williams (1909 [1874]: 787) “in Cantonese”; Aubazac (1909: 36) tim3; Meyer (1947:<br />
#3075) tîm; O'Melia (1959: 4: 177) tîm; Yue (1972: 259) ti:m 33 “colloquial word”;<br />
Lau (1977: #539) dim 6 “CC”, “Coll.”. ② 敁. U+6541. Williams (1909 [1874]: 787)<br />
“in Cantonese”. ③ �口店. U+20DA7. Rao (1996: 42) dim 6 .<br />
13 ngan 3 ‘to jiggle the feet’. ① 奀. U+5940. Meyer (1947: #2039) ngàn; Yue (1972:<br />
334) ŋɐn 44 “colloquial character”; Lau (1977: #2334) ngan 3 “CC”, “Coll.”. ②<br />
�足辰. U+47F4. Rao (1996: 169) ngen 3 .<br />
14 ung 2 ‘to push’. ① 擁. U+64C1. Williams (1856: 649) c ung “colloquial word”;<br />
Williams (1909 [1874]: 941); Aubazac (1909: 21) oung 1 (ung 1 ); Meyer (1947: #3687)<br />
úng; O'Melia (1959: 4: 217) úng; Yue (1972: 392) ʔʊŋ 35 “colloquial character”; Lau<br />
(1977: #3144) ung 2 “Coll.”. ② �巩手. U+39EC. Rao (1996: 173) ngung 2 (ngung 2 ),<br />
ung 2 . ③ �扌戎. U+22B2E. Rao (1996: 173) ngung 2 (ngung 2 ), ung 2 .<br />
15 m 4 ‘not’. 唔. U+5514. Williams (1856: 268) c‘m “colloquial word”; Williams<br />
(1909 [1874]: 893) “in Cantonese”; Aubazac (1909: 16) m1; Meyer (1947: #1724) m̄;<br />
O’Melia (1959: 4: 93) m̄; Yue (1972: 398) m̩ 21 “colloquial character”; Lau (1977:<br />
#2032) m 4 “CC”; Rao (1996: 138) m 4 .<br />
16 mat 1 ‘what’. 乜. U+4E5C. Williams (1856: 279) matɔ “colloquial word”;<br />
Williams (1909 [1874]: 571)“in Cantonese”; Aubazac (1909: 17) mat 4 ; Meyer (1947:<br />
88
#1790) mat, mi (mi 1 ); O'Melia (1959: 4: 99) mat, mi (mi 1 ); Yue (1972: 215) mɐt 5<br />
“colloquial character”; Lau (1977: #2105) mat 1 ° “CC”; Lau (1977: #2135) mi 1 ° (mi 1 )<br />
“CC”; Rao (1996: 146) med 1 , mé 1 (me 1 ).<br />
17 kat 1 ‘card’. ① n/a. Yue (1972: 329) k’A:t 5 . ② 咭. U+54AD. Meyer (1947:<br />
#1062) k’at; Lau (1977: #1644) kaat 1o “CC”; Rao (1996: 110) ked 1 .<br />
18 yai 5 ‘bad’. ① �口兮. U+20BCB. Williams (1856: 674) cyai (yai 4 ); Williams<br />
(1909 [1874]: 395) “in Cantonese”; Aubazac (1909: 44) yai1 (yai 4 ); Meyer (1947:<br />
#3800) yaī (yai 4 ); Lau (1977: #3306) yai 4 (yai 4 ) “CC”, Lau (1977: #3307) yai 5 “CC”.<br />
② n/a. Yue (1972: 288) ĭɐĭ 24 “colloquial character”. ③ 曳. U+66F3. Rao (1996:<br />
240) yei 5 , yei 4 (yai 4 ), yei 6 (yai 6 ).<br />
19 lok 3 ‘sentence-final particle’. 咯. U+54AF. Williams (1856: 253) lokɔ “colloquial<br />
final particle”; Williams (1909 [1874]: 536); Aubazac (1909: 15) loko; Meyer (1947:<br />
#1647) lòk; O'Melia (1959: 4: 91) lòk; Yue (1972: 272) lɔk 4 “colloquial character”;<br />
Lau (1977: #1964) lok 3 “CC”; Rao (1996: 132) log 3 .<br />
20 ge 3 ‘genitive particle’. 嘅. U+5605. Williams (1856: 145) ké ɔ ; Williams (1909<br />
[1874]: 425) “in Cantonese”; Aubazac (1909: 9) ké 3 ; Meyer (1947: #1090) kè; O'Melia<br />
(1959: 4: 65) kè; Yue (1972: 339) kɛ: 44 “colloquial character”; Lau (1977: #875) ge 3 ;<br />
Rao (1996: 65) gé 3 .<br />
21 gip 1 ‘bag’. 喼. U+55BC. Meyer (1947: #1174) kìp (gip 3 ); Yue (1972: 349) ki:p 5<br />
“colloquial character”; Lau (1977: #924) gip 1o “CC”, “Coll.”; Rao (1996: 73) gib 1 .<br />
22 mak 1 ‘mark’. ① 嚜. U+569C. Williams (1909 [1874]: 582) “in Cantonese”;<br />
Meyer (1947: #1766) mak; O'Melia (1959: 4: 98) mak; Yue (1972: 215) mɐk 5<br />
“colloquial character”; Lau (1977: #2081) mak 1o ; “CC”, “Coll.”; Rao (1996: 147)<br />
meg 1 . ② 嘜. U+561C. O'Melia (1959: 4: 98) mak; Rao (1996: 147) meg 1 .<br />
23 miu 2 ‘to purse the lips’. �口妙. U+20D15. Meyer (1947: #1834) miú; Yue<br />
(1972: 222) mi:ŭ 35 “colloquial character”; Lau (1977: #2153) miu 2 “CC”, “Coll.”; Rao<br />
(1996: 152) miu 2 .<br />
24 gaat 6 jaat 6 ‘cockroach’. ① 甴曱. U+7534 U+66F1. Williams (1856: 117, 560) cká<br />
tsátɔ (ga 1 jaat 6 ) “colloquial word”; Meyer (1947: #987, 690) kaât tsâat; O'Melia (1959:<br />
4: 60) kâat tsâat; Yue (1972: 329, 286) kA:t 3 tsA:t 3 “colloquial character”. ② 曱甴.<br />
U+66F1 U+7534. Lau (1977: #817) gaat 6 jaat 6 * (gaat 6 jaat 6-2 ) “CC”, “Coll.”; Rao<br />
(1996: 62) gad 6 zad 6 , ged 6 zed 6 (gat 6 jat 6 ).<br />
89
25 In fact, the contemporary arrangement, 曱甴, appears as early as the late nineteenth<br />
century (Chalmers 1878: 40), but this is probably an isolated case, as it does not<br />
appear in later sources in this study until Yue (1972). Thanks to Professor Marjorie<br />
Chan for this observation.<br />
26 a 1 ‘sentence-final particle’. ① 丫. U+4E2B. Williams (1909 [1874]: 899) “in<br />
Cantonese”. ② 吖. U+5416. Williams (1856: 1) cá “colloquial word”; Williams<br />
(1909 [1874]: 899) “in Cantonese”; Meyer (1947: #2) a; O'Melia (1959: 4: 1) a, nga<br />
(nga 1 ); Yue (1972: 370) ʔA: 53 “colloquial character”; Lau (1977: #2) a 1o “CC”; Rao<br />
(1996: 1) a 1 . ③ 呀. U+5440. Aubazac (1909: 1) a 1 .<br />
27 dei 6 ‘plural marker’. ① 地. U+5730. Williams (1909 [1874]: 774) “in<br />
Cantonese”. ② 哋. U+54CB. Williams (1856: 515) tí ɔ “colloquial word”; Williams<br />
(1909 [1874]: 774) “in Cantonese”; Meyer (1947: #3044) teî; O’Melia (1959: 4: #174)<br />
teî; Yue (1972: 254) teĭ 33 “colloquial character”; Lau (1977: #517) dei 1 “CC”; Rao<br />
(1996: 241) déi 6 .<br />
28 gam 2 ‘so (manner)’. ① 敢. U+6562. Williams (1909 [1874]: 427) “in<br />
Cantonese”. ② 噉. U+5649. Williams (1856: 173) c kòm; Aubazac (1909: 10) kom 2 ;<br />
Meyer (1947: #1224) kóm; O'Melia (1959: 4: 72) kóm; Yue (1972: 333) kɐm 35<br />
“colloquial character”; Lau (1977: #839) gam 2 “CC”; Rao (1996: 69) gem 2 .<br />
29 gam 3 ‘so (quantity)’. ① 甘. U+7518. Williams (1909 [1874]: 426) “in<br />
Cantonese”. ② 咁. U+5481. Williams (1856: 173) kòm ɔ ; Williams (1909 [1874]:<br />
426) “in Cantonese”; Aubazac (1909: 10) kom 3 ; Meyer (1947: #1228) kòm; O'Melia<br />
(1959: 4: 73) kòm; Yue (1972: 333) kɐm 44 “colloquial character”; Lau (1977: #840)<br />
gam 3 “CC”; Rao (1996: 70) gem 3 .<br />
30 gwa 3 ‘a sentence-final particle’. ① 卦. U+5366. Williams (1909 [1874]: 461) “in<br />
Cantonese”. ② 啩. U+5569. Williams (1856: 201) kw’á ɔ ; Williams (1909 [1874]:<br />
461) “in Cantonese”; Meyer (1947: #1350) kwà; O'Melia (1959: 4: 78) kwà; Yue<br />
(1972: 370) kwA: 44 “colloquial character”; Lau (1977: #1014) gwa 3 “CC”; Rao (1996:<br />
78) gua 3 .<br />
31 haai 4 ‘coarse’. ① 鞋. U+978B. Williams (1909 [1874]: 318) “in Cantonese”. ②<br />
嚡. U+56A1. Williams (1856: 69) chái; Aubazac (1909: 3) hái1; Meyer (1947: #619)<br />
haaī; Lau (1977: #1088) haai 4 “CC”, “Coll.”; Rao (1996: 87) hai 4 . ③ n/a. Yue (1972:<br />
324) hA:ĭ 21 “colloquial character”.<br />
32 hai 2 ‘to be at’. ① 係. U+4FC2. Williams (1909 [1874]: 301) “in Cantonese”. ②<br />
喺. U+55BA. Williams (1856: 68) c hai; Aubazac (1909: 3) hai 2 ; Meyer (1947: #654)<br />
90
haí; O'Melia (1959: 4: 42) hái; Yue (1972: 331) hɐĭ 35 “colloquial character”; Lau<br />
(1977: #1114) hai 2 “CC”; Rao (1996: 91) hei 2 .<br />
33 kwaak 1 ‘loop; to loop’. ① 緙. U+7DD9. Williams (1856: 207) kw’ákɔ “colloquial<br />
word”; Williams (1909 [1874]: 446) “in Cantonese”. ② �口緙. U+210C8. Meyer<br />
(1947: #1358) kwaàk (gwaak 3 ), kw’aàk (kwaak 3 ); Yue (1972: 375) kw’A:k 5<br />
“colloquial character”; Rao (1996: 114) kuag 3-1 (kwaak 3-1 ), kuag 3 (kwaak 3 ). ③<br />
�口隙. Lau (1977: #1728) kwaat 1o (kwaat 1 ) “CC”, “Coll.”.<br />
34 lai 4 ‘to come’. ① 黎. U+9ECE. Williams (1909 [1874]: 505) “in Cantonese”. ②<br />
嚟. U+569F. Williams (1856: 217) clai “colloquial word”; Williams (1909 [1874]:<br />
505) “in Cantonese”; Aubazac (1909: 13) lai1; Meyer (1947: #1461) laī; O'Melia<br />
(1959: 4: 84) lāi, lēi (lei 4 ); Yue (1972: 243) lɐĭ 21 “colloquial character”; Lau (1977:<br />
#1792) lai 4 “CC”; Rao (1996: 124) lei 4 , léi 4 (lei 4 ).<br />
35 mai 5 ‘do not’. ① 米. U+7C73. Williams (1909 [1874]: 568) “in Cantonese”. ②<br />
咪. U+54AA. Williams (1856: 271) c mai “colloquial word”; Aubazac (1909: 16)<br />
mai2; Meyer (1947: #1764) maĭ; O'Melia (1959: 4: 98) măi; Yue (1972: 211) mɐĭ 24<br />
“colloquial character”; Lau (1977: #2078) mai 5 “CC”; Rao (1996: 148) mei 5 .<br />
36 mo 1 ‘slow’. ① 摩. U+6469. Williams (1909 [1874]: 578) “in Cantonese”. ② 嚤.<br />
U+56A4. Williams (1856: 292) cmo “colloquial word”; Aubazac (1909: 17) mo 1 ;<br />
Meyer (1947: #1864) mo; Lau (1977: #2182) moh 1o “CC”, “Coll.”; Rao (1996: 152)<br />
mo 1 . ③ n/a. Yue (1972: 224) mɔ: 53 .<br />
37 ngaam 1 ‘correct’. ① 岩. U+5CA9. Williams (1909 [1874]: 916) “in Cantonese”.<br />
② 啱. U+5571. Williams (1856: 319) cngám “colloquial word”; Aubazac (1909: 19)<br />
ngám 1 ; Meyer (1947: #2011) ngaam; Yue (1972: 326) ŋA:m 53 “colloquial character”;<br />
Lau (1977: #2310) ngaam 1o “CC”; Rao (1996: 163) ngam 1 .<br />
38 di 1 ‘some’. ① n/a. Williams (1856: 514) ctí “colloquial corruption”. ② 的.<br />
U+7684. Williams (1909 [1874]: 771) “in Cantonese”; Meyer (1947: #3058) ti. ③<br />
啲. U+5572. Aubazac (1909: 36) ti 1 ; Meyer (1947: #3058) ti; O'Melia (1959: 4: 176)<br />
ti, tit (dit 1 ); Yue (1972: 257) ti: 53 “colloquial character”; Lau (1977: #529) di 1o “CC”;<br />
Rao (1996: 40) di 1 , did 1 (dit 1 ).<br />
39 ye 5 ‘thing’. ① 野. U+91CE. Williams (1856: 691) c yé “colloquial word”;<br />
Williams (1909 [1874]: 911) “in Cantonese”; Meyer (1947: #3867) yĕ; O'Melia (1959:<br />
4: 231) yĕ. ② 嘢. U+5622. Yue (1972: 296) yɛ: 24 “colloquial character”; Lau (1977:<br />
#3376) ye 5 “CC”, “Coll.”; Rao (1996: 236) yé 5 . ③. 埜. U+57DC. Williams (1909<br />
[1874]: 911) “in Cantonese”.<br />
91
40 tau 2 ‘to rest’. ① 抖. U+6296. Williams (1856: 513) c t’au “colloquial word”;<br />
Meyer (1947: #3037) t’aú; Lau (1977: #3044) tau 2 “CC”, “Coll.”. ② 唞. U+551E.<br />
Aubazac (1909: 31) t’ao 2 ; Meyer (1947: #3037) t’aú; O'Melia (1959: 4: 173) t’áo; Lau<br />
(1977: #3044) tau 2 “CC”, “Coll.”; Rao (1996: 213) teo 2 . ③ �咅攴. U+3A97. Yue<br />
(1972: 244) t’ɐŭ 35 “colloquial word”.<br />
41 yuk 1 ‘to move’. ① 郁. U+90C1. Williams (1856: 705) yukɔ “colloquial word”;<br />
Williams (1909 [1874]: 949) “in Cantonese”; Aubazac (1909: 45) youk 4 ; Meyer (1947:<br />
#3930) yuk; O'Melia (1959: 4: 235) yuk; Lau (1977: #3575) yuk 1o “Coll.”. ② 喐.<br />
U+5590. Meyer (1947: #3930) yuk; Yue (1972: 319) yʊk 5 “colloquial character”; Rao<br />
(1996: 250) yug 1 .<br />
42 ga 3 ‘sentence-final particle’. ① 架. U+67B6. O'Melia (1959: 4: 56) kà. ②<br />
�口架. U+35CE. Meyer (1947: #939) kà; Yue (1972: 323) kA: 44 “colloquial<br />
character”; Lau (1977: #785) ga 3 “CC”; Rao (1996: 61) ga 3 . ③ �口駕. U+210C9.<br />
Meyer (1947: #939) kà.<br />
43 go 2 ‘that’. ① 個. U+500B. Williams (1856: 167) ko ɔ (go 3 ); Williams (1909<br />
[1874]: 444); Aubazac (1909: 10) ko 3 (go 3 ); Meyer (1947: #1206) kóh; Meyer (1947:<br />
#1207) kòh (go 3 ); O'Melia (1959: 4: 70) kòh (go 3 ). ② 嗰. U+55F0. Meyer (1947:<br />
#1206) kóh; O'Melia (1959: 4: 70) kóh; Yue (1972: 355) kɔ: 35 “colloquial character”;<br />
Lau (1977: #944) goh 2 “CC”; Rao (1996: 75) go 2 . ③ 箇. U+7B87. Williams (1856:<br />
167) ko ɔ (go 3 ), c ko; Williams (1909 [1874]: 444); Meyer (1947: #1207) kòh (go 3 ). ④<br />
个. U+4E2A. Williams (1856: 167) ko ɔ (go 3 ); Williams (1909 [1874]: 444); Aubazac<br />
(1909: 10) ko 3 (go 3 ); Meyer (1947: #1207) kòh (go 3 ).<br />
44 bai 6 ‘bad’. ① 弊. U+5F0A. Williams (1856: 347) pai ɔ ; Meyer (1947: #2265) paî;<br />
O'Melia (1959: 4: 123) pâi; Yue (1972: 211) pɐĭ 33 ; Lau (1977: #61) bai 6 . ② �敝大.<br />
U+21681. Williams (1856: 347) pai ɔ . ③ �口弊. U+210C7. Rao (1996: 7) bei 6 .<br />
45 ngai 1 ‘to beg’. ① 偽. U+507D. Williams (1909 [1874]: 886) “in Cantonese”. ②<br />
�口偽. U+20F2E. Williams (1856: 316) cngai “colloquial word”; Aubazac (1909:<br />
19) ngai 1 ; Meyer (1947: #2029) ngai; Rao (1996: 168) ngei 1 . ③ �口危. U+20C53.<br />
Meyer (1947: #2029) ngai; Yue (1972: 331) ŋɐĭ 53 “colloquial character”; Lau (1977:<br />
#2325) ngai 1 “CC”, “Coll.”.<br />
46 gau 6 ‘lump’. ① 倃. U+5003. Williams (1856: 140) kau ɔ “colloquial word”;<br />
Williams (1909 [1874]: 167) “in Cantonese”; Aubazac (1909: 8) kao3; Meyer (1947:<br />
92
#1077) kaû. ② 嚿. U+56BF. Meyer (1947: #1077) kaû; O'Melia (1959: 4: 65) kâu;<br />
Yue (1972: 332) kɐŭ 33 “colloquial character”; Rao (1996: 73) geo 6 .<br />
47 tam 3 ‘to deceive’. ① 噤. U+5664. Williams (1856: 498) t’am ɔ “colloquial word”;<br />
Williams (1909 [1874]: 149) “in Cantonese”; Aubazac (1909: 30) t’am 3 ; Meyer (1947:<br />
#3003) t’àm; O'Melia (1959: 4: 171) t’àm; Yue (1972: 245) t’ɐm 44 “colloquial<br />
character”. ② �口氹. U+20C41. Meyer (1947: #3004) t’àm; Lau (1977: #3035)<br />
tam 3 “CC”, “Coll.”. ③ �言��冖八木. U+27A3E. Rao (1996: 213) tem 3 .<br />
48 saai 3 ‘quantifying particle’. ① 徙. U+5F99. Williams (1909 [1874]: 300) “in<br />
Cantonese”. ② 嘥. U+5625. Williams (1856: 405) sái ɔ “colloquial word”; Aubazac<br />
(1909: 25) sái 3 ; Meyer (1947: #2527) saaì; O'Melia (1959: 4: 137) saài; Yue (1972:<br />
280) sA:ĭ 44 “colloquial character”. ③ 晒. U+6652. Rao (1996: 188) sai 3 . ④ 嗮.<br />
U+55EE. Lau (1977: #2595) saai 3 “CC”.<br />
49 jo 2 ‘ perfective aspect marker’. ① 阻. U+963B. Williams (1856: 25*) c cho<br />
“colloquial word”; Williams (1909 [1874]: 833) “in Cantonese”. ② 唨. U+5528.<br />
Aubazac (1909: 35) tcho 2 ; Meyer (1947: #323) chóh; O'Melia (1959: 4: 21) chóh; Lau<br />
(1977: #1539) jo 2 “CC”. ③ 咗. U+5497. Yue (1972: 313) tsɔ: 35 “colloquial<br />
character”; Rao (1996: 262) zo 2 .<br />
50 ngak 1 ‘to trick’. ① 阨. U+9628. Williams (1856: 3, 318) ákɔ (aak 1 ), akɔ (ak 1 ),<br />
ngakɔ “colloquial word”; Williams (1909 [1874]: 605) “in Cantonese”; Aubazac<br />
(1909: 19) ngak 4 ; Meyer (1947: #2036) ngak; O'Melia (1959: 4: 113) ngak. ② 呃.<br />
U+5443. Meyer (1947: #2036) ngak; Yue (1972: 330) ŋA:k 5 (ngaak 1 ) “colloquial<br />
character”; Lau (1977: #22) ak 1o (ak 1 ) “CC”; Rao (1996: 163) ngag 1 (ngaak 1 ). ③ 眲.<br />
U+7732. Rao (1996: 163) ngag 1 (ngaak 1 ).<br />
51 la 3 ‘sentence-final particle’. ① 罅. U+7F45. Williams (1909 [1874]: 306) “in<br />
Cantonese”. ② �阝虖. U+28EF2. Williams (1909 [1874]: 306) “in Cantonese”. ③<br />
鏬. U+93EC. Williams (1909 [1874]: 306) “in Cantonese”. ④ �土虖. U+3664.<br />
Williams (1909 [1874]: 306) “in Cantonese”. ⑤ �口鏬. Williams (1856: 217) lá ɔ ;<br />
Meyer (1947: #1427) là. ⑥ 嚹. U+56B9. Meyer (1947: #1427) là; Yue (1972: 234)<br />
lA: 44 “colloquial character”; Rao (1996: 116) la 3 . ⑦ 嘞. U+561E. O'Melia (1959: 4:<br />
83) là. ⑧ 喇. U+5587. Lau (1977: #1755) la 3 ; Rao (1996: 116) la 3 .<br />
52 me 1 ‘sentence-final particle’. ① �貝子. U+27D2F. Williams (1856: 283) cmé;<br />
Williams (1909 [1874]: 571) “unauthorized”, “in Cantonese”. ② 咩. U+54A9.<br />
Aubazac (1909: 17) mé 1 ; Meyer (1947: #1800) me; O'Melia (1959: 4: 101) meh; Yue<br />
93
(1972: 216) mɛ: 53 “colloquial character”; Lau (1977: #2120) me 1o “CC”; Rao (1996:<br />
146) mé 1 .<br />
53 dap 6 ‘to pound’. ① 搭. U+642D. Meyer (1947: #3020) tâp; Yue (1972: 248) tɐp 3<br />
“colloquial character”. ② 撘. U+6498. Meyer (1947: #3020) tâp. ③ �口扱. Lau<br />
(1977: #498) dap 6 “CC”, “Coll.”. ④ �扌耷. U+22C55. Rao (1996: 33) deb 6 .<br />
54 long 2 ‘to rinse’. ① n/a. Williams (1856: 722) c long; Yue (1972: 271) nɔ:ŋ 35<br />
(nong 2 ), lɔ:ŋ 35 “colloquial word”. ② 朗. U+6717. Williams (1909 [1874]: 499) “in<br />
Cantonese”. ③ �口浪. U+20E98. Aubazac (1909: 15) long 2 ; Meyer (1947: #1653)<br />
lóng; Lau (1977: #1972) long 2 “CC”; Rao (1996: 132) long 2 .<br />
55 ngap 1 ‘to jabber’. ① 吸. U+5438. Williams (1856: 321) ngapɔ “colloquial<br />
word”; Williams (1909 [1874]: 294) “in Cantonese”; Aubazac (1909: 19) ngap 4 ;<br />
Meyer (1947: #2043) ngap; Yue (1972: 336) ŋɐp 5 “colloquial character”. ② 噏.<br />
U+564F. Williams (1909 [1874]: 294) “in Cantonese”; Rao (1996: 167) ngeb 1 . ③<br />
�口揖. Lau (1977: #2339) ngap 1o “CC”, “Coll.”. ④ �口�日絲. U+2103E. Rao<br />
(1996: 167) ngeb 1 .<br />
56 wo 5 ‘sentence-final particle’. ① 啝. U+555D. Meyer (1947: #3780) wôh (wo 6 ),<br />
wòh (wo 3 ), wōh (wo 4 ); O'Melia (1959: 4: 224) wŏh. ② 喎. U+558E. Yue (1972:<br />
386) ŭɔ: 24 “colloquial character”; Lau (1977: #3247) woh 5 ; Rao (1996: 225) wo 5 , wo 3<br />
(wo 3 ).<br />
57 ngau 6 dau 6 ‘unwell; stupid’. ① �馬馬〇. U+2994B ?. Williams (1856: 323)<br />
ngau ɔ tau ɔ . ② �馬馬逗. U+2994B U+9017. Williams (1909 [1874]: 8) “in<br />
Cantonese”. ③ 吽哣. U+543D U+54E3. Meyer (1947: #2050 ngaû taû; Yue (1972:<br />
332, 244) ŋɐŭ 33 tɐŭ 33 “colloquial character”; Lau (1977: #2348) ngau 6 dau 6 “CC”,<br />
“Coll.”; Rao (1996: 170) ngeo 6 deo 6 .<br />
58 ham 6 baang 6 laang 6 ‘all’. ① 喊〇〇. U+558A ? ?. Williams (1856: 92)<br />
hòm ɔ pa ɔ láng ɔ (ham 6 ba 6 laang 6 ). ② 喊�口棒呤. U+558A U+20FB4 U+5464. Meyer<br />
(1947: #662) hâmpanglâng (ham 6 bang 1 lang 6 ), hâmpalâng (ham 6 ba 1 lang 6 ). ③<br />
喊�口捧唥. U+558A ? U+5525. O'Melia (1959: 4: 43) hâmpânglâng<br />
(ham 6 bang 6 lang 6 ), hâmpâlâng (ham 6 ba 6 lang 6 ). ④ n/a. Yue (1972: 333)<br />
hɐm 33 pA 33 lA:ŋ 33 “colloquial word”. ⑤ �口感�口棒唥. U+20FD1 U+20FB4<br />
U+5525. Lau (1977: #1124) ham 6 baang 6 laang 6 , ham 6 blaang 6 (ham 6 blaang 6 ) “CC”,<br />
“Coll.”. ⑥ 冚唪唥. U+519A U+552A U+5525. Rao (1996: 92) hem 6 baang 6 laang 6 .<br />
94
59 唥 actually appears consistently for laang 6 in sources as early as the mid-nineteenth<br />
century, suggesting that the 呤 form listed by Meyer (1947) is an isolated exception.<br />
Thanks to Professor Marjorie Chan for this observation.<br />
95
CHAPTER 6<br />
SIGNIFIC-PHONETIC CHARACTERS<br />
Signific-Phonetic characters, which are the analogue of the xingsheng 形聲<br />
‘phonetic compounds’ principle in the traditional liushu 六書 model, are characters<br />
which combine two characters together, one as a signific to indicate its general<br />
meaning, while the other is used in rebus fashion as a phonetic. However, the<br />
traditional model is insufficiently defined with regards to whether it includes what are<br />
marked phonetic loans in the model used here, and rather than impose an<br />
interpretation, we establish a new principle modeled after it.<br />
<strong>The</strong> most basic signific-phonetic characters are those where the phonetic is<br />
completely homophonous, such as yeun 6 ‘animal liver’ 1 , which is written with 膶,<br />
composed of a yuk 6 月(肉) ‘flesh’ signific and a yeun 6 閏 ‘intercalary’ phonetic.<br />
Similarly, chi 1 ‘to stick’ 2 is written with 黐, composed of a syu 2 黍 ‘millet’ signific<br />
and a chi 1 离 ‘mountain spirit’ phonetic which is also written as 魑 (HYDZD 1: 287).<br />
Less commonly, it is written with �米离 or its vulgar form �米禽 (HYDZD 5:<br />
3161), both of which have a mai 5 米 ‘rice’ radical instead of syu 2 黍 ‘millet’ for the<br />
signific.<br />
96
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
chi1 to stick U+9ED0 黐 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
U+25EF �米离 ✓<br />
yeun6 animal<br />
liver<br />
F<br />
U+25F1<br />
D<br />
�米禽 ✓<br />
�口笞 ✓<br />
n/a ∅ ✓<br />
U+81B6 膶 ✓ ✓ ✓ ✓ ✓<br />
Table 6.1: Signific-Phonetic Characters<br />
with Completely Homophonous Phonetics (History)<br />
Word Gloss Unicode Char Signific Char Gloss Phonetic Char Gloss<br />
chi1 to stick U+9ED0 黐 syu2 黍 millet chi1 离 mountain<br />
spirit<br />
U+25EFF �米离 mai5 米 rice chi1 离 mountain<br />
spirit<br />
U+25F1D �米禽 mai5 米 rice chi1 禽(离) mountain<br />
spirit<br />
�口笞 chi1 笞 to flog<br />
yeun6 animal liver U+81B6 膶 yuk6 月(肉) flesh yeun6 閏 intercalary<br />
Table 6.2: Signific-Phonetic Characters<br />
with Completely Homophonous Phonetics (Basis)<br />
<strong>The</strong> phonetic in a signific-phonetic character can be less than homophonous by<br />
differing in the tone register, such as deng 3 掟 ‘to throw’ 3 , laan 1 躝 ‘to crawl’ 4 , mit 1 搣<br />
‘to pinch; to tear’ 5 , and na 1 �疒拏 ‘scar’ 6 , where the word’s tone belongs to the yin<br />
陰 register, but its phonetic belongs to the yang 陽 register. Meanwhile, the tone<br />
category remains the same: laan 1 躝 ‘to crawl’ and its phonetic laan 4 闌 ‘fence’ and<br />
na 1 �疒拏 ‘scar’ and its phonetic na 4 拏 ‘to take’ are all ping 平 tones, deng 3 掟 ‘to<br />
97
throw’ and its phonetic ding 6 /deng 6 定 ‘certain’ are both qu 去 tones, and mit 1 搣 ‘to<br />
pinch; to tear’ and its phonetic mit 6 烕/灭 ‘to extinguish’ are both ru 入 tones.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
deng3 to throw U+639F 掟 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
laam3 to step<br />
over<br />
U+77F4 矴 ✓<br />
U+2814F �足嵐 ✓ ✓ ✓ ✓ ✓ ✓<br />
U+280BE �足南 ✓<br />
laan1 to crawl U+8E9D 躝 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
lam6 to pile<br />
up<br />
leu1 to spit<br />
out<br />
mit1 to pinch;<br />
to tear<br />
U+3A06 �扌林 ✓ ✓ ✓ ✓ ✓ ✓<br />
U+7F67 罧 ✓<br />
U+51A7 冧 ✓<br />
n/a ∅ ✓<br />
U+269F2 �舌累 ✓ ✓ ✓ ✓<br />
U+6423 搣 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
�扌灭 ✓<br />
na1 scar U+24E3B �疒拏 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
U+24DB8 �疒那 ✓<br />
Table 6.3: Signific-Phonetic Characters<br />
with Phonetics Differing in the Tone (History)<br />
<strong>The</strong> phonetic in a signific-phonetic character can also differ more greatly in<br />
tone such that there is no direct relationship with the tone of the word, such as the<br />
other form of na 1 ‘scar’ given by Rao (1996), �疒那, which has a na 2 那 ‘that’<br />
phonetic that differs in the tone, yinshang 陰上 (tone #2) rather than yinping 陰平<br />
(tone #1). Similarly, lam 6 �扌林 ‘to pile up’ 7 has a lam 4 林 ‘forest’ phonetic which<br />
differs in the tone, yangping 陽平 (tone #4) rather than yangqu 陽去 (tone #6).<br />
98
Word Gloss Unicode Char Signific Char Gloss Phonetic Char Gloss<br />
deng3 to throw U+639F 掟 sau2 扌(手) hand ding6/<br />
deng6<br />
定 certain<br />
U+77F4 矴 ding3 矴 anchor<br />
laam3 to step over U+2814F �足嵐 juk1 足 foot laam4 嵐 mist<br />
U+280BE �足南 juk1 足 foot naam4 南 south<br />
laan1 to crawl U+8E9D 躝 juk1 足 foot laan4 闌 fence<br />
lam6 to pile up U+3A06 �扌林 sau2 扌(手) hand lam4 林 forest<br />
U+7F67 罧 mong5 罒(网) net lam4 林 forest<br />
U+51A7 冧 lam1 冧 bud<br />
leu1 to spit out U+269F2 �舌累 sit6 舌 tongue leui6 累 to<br />
accumulate<br />
mit1 to pinch;<br />
to tear<br />
U+6423 搣 sau2 扌(手) hand mit6 烕 to extinguish<br />
�扌灭 sau2 扌(手) hand mit6 灭 to extinguish<br />
na1 scar U+24E3B �疒拏 bing6 疒(病) sick na4 拏 to take<br />
U+24DB8 �疒那 bing6 疒(病) sick na2 那 that<br />
Table 6.4: Signific-Phonetic Characters<br />
with Phonetics Differing in the Tone (Basis)<br />
Likewise, leu 1 �舌累 ‘to spit out’ 8 has a leui 6 累 ‘to accumulate’ phonetic<br />
which differs in the tone, yangqu 陽去 (tone #6) rather than yinping 陰平 (tone #1).<br />
Furthermore, the phonetic also differs in the final, -eui /-øy/ rather than -eu /-œ/.<br />
However, Yue (1972) also lists leui 1 as an alternative pronunciation for ‘to spit out’,<br />
which would make the phonetic completely homophonous except for tone.<br />
Similarly, laam 3 �足嵐 ‘to step over’ 9 has a laam 4 嵐 ‘mist’ phonetic which<br />
differs in the tone, yangping 陽平 (tone #4) rather than yinqu 陰去 (tone #3). On the<br />
other hand, Rao (1996) also lists laam 3 �足南 ‘to step over’, which has a naam 4 南<br />
‘south’ phonetic, which besides tone also differs in the manner of articulation of the<br />
initial consonant, n- /n-/ instead of l- /l-/. However, he also gives naam 3 as an<br />
99
alternative pronunciation for ‘to step over’, which suggests that the naam 4 南 ‘south’<br />
phonetic was chosen on basis of the naam 3 pronunciation, or by a speaker who<br />
substitutes the liquid l- /l-/ initial for the nasal n- /n-/ initial and pronounces 南 ‘south’<br />
as *laam 3 .<br />
<strong>The</strong> phonetic in a signific-phonetic character can also be less than optimal by<br />
differing in the initial, such as gwui 6 癐 ‘tired’ 10 , which has a wui 6 會 ‘to meet’<br />
phonetic which differs in the manner of articulation of the homorganic initial, w- /w-/<br />
rather than gw- /kw-/.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
gwui6 tired U+7650 癐 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
laap3 to gather<br />
together<br />
U+39DC �扌匝 ✓ ✓ ✓ ✓ ✓ ✓<br />
U+64F8 擸 ✓ ✓ ✓<br />
Table 6.5: Signific-Phonetic Characters<br />
with Phonetics Differing in the Initial (History)<br />
Word Gloss Unicode Char Signific Char Gloss Phonetic Char Gloss<br />
gwui6 tired U+7650 癐 bing6 疒(病) sick wui6 會 to meet<br />
laap3 to gather<br />
together<br />
U+39DC �扌匝 sau2 扌(手) hand jaap3 匝<br />
100<br />
to revolve<br />
U+64F8 擸 sau2 扌(手) hand laap6 巤 bristles<br />
Table 6.6: Signific-Phonetic Characters<br />
with Phonetics Differing in the Initial (Basis)<br />
Similarly, laap 3 ‘to gather together’ 11 , also pronounced laap 6 , is written with<br />
�扌匝 which has a jaap 3 匝 ‘to revolve’ phonetic, formerly saap 6 (Williams 1856:
408), which differs in the initial, j- /tʃ-/ or s- /s-/ rather than l- /l-/, which has no direct<br />
relationship. Besides �扌匝, Williams (1856, 1909 [1874]) also lists an<br />
<strong>orthographic</strong>ally more complex 擸, which he considers to be the character that the<br />
former is “contracted” from (Williams 1909 [1874]: 492), although he distinguishes<br />
the two pronunciations of the word in writing (Williams 1856: 225-226). According<br />
to Williams (1909 [1874]: 492), 擸 is also a signific-phonetic character, composed of a<br />
sau 2 扌(手) ‘hand’ signific and a laap 6 巤 ‘bristles’ phonetic, which is also written as<br />
鬣 (Williams 1909 [1874]: 522), which differs in the tone register, yangru 陽入 (tone<br />
#6) rather than zhongru 中入 (tone #3).<br />
Unlike leu 1 ‘to spit out’ and laam 3 ‘to step over’, there are cases such as kang 3<br />
掯 ‘capable’ 12 and na 2 乸 ‘female’ 13 where the phonetic clearly differs in more than<br />
aspect. <strong>The</strong> phonetic in kang 3 掯 ‘capable’, hang 2 肯 ‘willing’, differs in the tone, as<br />
well as the manner of articulation of the homorganic initial, h- /h-/ rather than k- /k h -/,<br />
while the phonetic in na 2 乸 ‘female’, ya 5 也 ‘also’, differs in the tone register, as well<br />
as the initial, y- /j-/ rather than n- /n-/, which has no direct relationship.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
kang3 capable U+63AF 掯 ✓ ✓ ✓ ✓<br />
na2 female U+4E78 乸 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
nan2 to play with n/a ∅ ✓<br />
U+649A 撚 ✓ ✓ ✓ ✓ ✓ ✓<br />
Table 6.7: Signific-Phonetic Characters<br />
with Phonetics Differing in the Initial and Tone (History)<br />
101
Word Gloss Unicode Char Signific Char Gloss Phonetic Char Gloss<br />
kang3 capable U+63AF 掯 sau2 扌(手) hand hang2 肯 willing<br />
na2 female U+4E78 乸 mou5 母 mother ya5 也 also<br />
nan2 to play<br />
with<br />
U+649A 撚 sau2 扌(手) hand yin4 然<br />
Table 6.8: Signific-Phonetic Characters<br />
with Phonetics Differing in the Initial and Tone (Basis)<br />
102<br />
like so<br />
Similarly, nan 2 ‘to play with’ 14 existed as early as the mid-nineteenth century<br />
(Williams 1856), but is only given a written form in later sources. However, the nin 2<br />
pronunciation was already written with 撚 (332), which has a yin 4 然 ‘like so’<br />
phonetic, which differs in the initial, y- /j-/ rather than n- /n-/, as well as the tone,<br />
yangping 陽平 (tone #4) rather than yinshang 陰上 (tone #2).<br />
<strong>The</strong> phonetic in a signific-phonetic character can also be less than<br />
homophonous by differing in the final, such as nam 2 諗 ‘to think’ 15 , which has a nim 6<br />
念 ‘to think of’ phonetic, which differs in the final, -im /-im/ rather than -am /-ɐm/, as<br />
well as the tone, yangqu 陽去 (tone #6) rather than yinshang 陰上 (tone #2). <strong>The</strong><br />
same phonetic is also used in another form given by Rao (1996), 惗.<br />
aai 3 嗌 ‘to yell’ 16 is different in that its yik 1 益 ‘benefit’ phonetic appears to be<br />
a less than optimal phonetic, but it can serve as an aai 3 phonetic, such as in aai 3 隘<br />
‘mountain pass’, which is used in the other form given by Meyer (1947), �口隘.
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
aai3 to yell U+55CC 嗌 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
�口隘 ✓<br />
nam2 to think U+8AD7 諗 ✓ ✓ ✓ ✓<br />
U+7A14 稔 ✓<br />
U+60D7 惗 ✓<br />
Table 6.9: Signific-Phonetic Characters<br />
with Phonetics Differing in the Final and Tone (History)<br />
Word Gloss Unicode Char Signific Char Gloss Phonetic Char Gloss<br />
aai3 to yell U+55CC 嗌 hau2 口 mouth yik1 益 benefit<br />
�口隘 hau2 口 mouth aai3 隘 mountain pass<br />
nam2 to think U+8AD7 諗 yin4 言 speech nim6 念 to think of<br />
U+7A14 稔 nam5 稔 ripe<br />
6.1 Optimization<br />
U+60D7 惗 sam1 忄(心) heart nim6 念<br />
Table 6.10: Signific-Phonetic Characters<br />
with Phonetics Differing in the Final and Tone (Basis)<br />
103<br />
to think of<br />
<strong>The</strong> phonetic in a signific-phonetic character is sometimes replaced by one that<br />
is more homophonous, such as mau 1 ‘to squat’ 17 . mau 1 ‘to squat’ was first written<br />
with 卯, a phonetic loan of the semi-homophonous word maau 5 卯, an earthly branch,<br />
which differs in the final, -aau /-au/ rather than -au /-ɐu/, as well as the tone,<br />
yangshang 陽上 (tone #5) rather than yinping 陰平 (tone #1). Later it was written<br />
with 蹘, a signific-phonetic character composed of a juk 1 足 ‘foot’ signific and a lau 6<br />
翏 ‘to soar’ phonetic, which differs in the initial consonant, l- /l-/ instead of m- /m-/,
ut which can serve as a m- /m-/ initial phonetic, such as in mau 6 謬 ‘error’.<br />
Incidently, the earthly branch maau 5 卯 is often confused with an l- /l-/ initial<br />
phonetic, such that maau 5 昴 ‘Pleiades’ and lau 4 留 ‘to remain’ are written with the<br />
proper phonetics, but mau 6 貿 ‘to trade’ and lau 5 柳 ‘willow’ are written with each<br />
other’s (Karlgren 1923: 99, 193). By the early 1970s (Yue), it was written with 踎,<br />
with a fau 2 否 ‘not’ phonetic, which differs in the tone, yinshang 陰上 (tone #2) rather<br />
than yinping 陰平 (tone #1), as well as the manner of articulation of the initial<br />
consonant, f- /f-/ instead of m- /m-/, although both are labial.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
mau1 to squat U+536F 卯 ✓<br />
U+8E58 蹘 ✓ ✓ ✓ ✓ ✓<br />
U+8E0E 踎 ✓ ✓ ✓<br />
wing1 to throw<br />
away<br />
n/a ∅ ✓<br />
U+22AD<br />
5<br />
�扌永 ✓ ✓ ✓ ✓ ✓<br />
U+6254 扔 ✓<br />
Table 6.11: Optimization of the Phonetic in Signific-Phonetic Characters (History)<br />
Similarly, wing 1 ‘to throw away’ 18 existed as early as the mid-nineteenth<br />
century (Williams 1856), but it is only given a written form in later sources. By the<br />
beginning of the twentieth century (Aubazac 1909) it was written with �扌永, a<br />
signific-phonetic character, composed of a sau 2 扌(手) ‘hand’ signific and a semi-<br />
homophonous wing 5 永 ‘eternal’ phonetic, which differs in the tone, yangshang 陽上<br />
(tone #5) rather than yinping 陰平 (tone #1). However, Yue (1972) instead lists 扔,<br />
104
the standard character for the word. <strong>The</strong> fact that the phonetic is not actually naai 5 乃<br />
‘then’, but an abbreviation of an unknown phonetic (Karlgren 1923: 203), suggests<br />
that �扌永 was created in reaction to counter this anomaly.<br />
Word Gloss Unicode Char Signific Char Gloss Phonetic Char Gloss<br />
mau1 to squat U+536F 卯 maau5 卯<br />
U+8E58 蹘 juk1 足 foot lau6 翏<br />
wing1 to throw<br />
away<br />
U+8E0E 踎 juk1 足 foot fau2 否 not<br />
105<br />
an earthly branch<br />
to soar<br />
U+22AD5 �扌永 sau2 扌(手) hand wing5 永 eternal<br />
U+6254 扔 sau2 扌(手) hand 乃<br />
Table 6.12: Optimization of the Phonetic in Signific-Phonetic Characters (Basis)<br />
<strong>The</strong> phonetic in a signific-phonetic character is sometimes replaced because of<br />
a <strong>change</strong> in the pronunciation of the word that causes the phonetic to no longer be as<br />
homophonous, such as mang 1 ‘to pull’ 19 , originally only pronounced mang 3 , which<br />
was first written with 掹, a signific-phonetic character composed of a sau 2 扌(手)<br />
‘hand’ signific and a semi-homophonous maang 6 孟 ‘first’ phonetic, which differs in<br />
the final, -aang /-aŋ/ rather than -ang /-ɐŋ/, as well as the tone register, yangqu 陽去<br />
(tone #6) rather than yinqu 陰去 (tone #3). However, Williams (1856: 278) gives the<br />
pronunciation of 孟 as mang 6 , making the phonetic differ only in the tone register. He<br />
also notes that some mang /mɐŋ/ syllables, including 孟 ‘first’ and 掹 ‘to pull’, are<br />
often pronounced as maang /maŋ/.
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
mang3 to pull U+63B9 掹 ✓ ✓ ✓ ✓<br />
U+64DD 擝 ✓ ✓ ✓<br />
�口掹<br />
mang1 U+63B9 掹 ✓<br />
U+64DD 擝 ✓ ✓ ✓ ✓ ✓<br />
�口掹 ✓<br />
Table 6.13: Optimization of a Phonetic in a Signific-Phonetic Character<br />
Due to a Change in Pronunciation (History)<br />
Word Gloss Unicode Char Signific Char Gloss Phonetic Char Gloss<br />
mang3/<br />
mang1<br />
to pull U+63B9 掹 sau2 扌(手) hand maang6 孟 first<br />
U+64DD 擝 sau2 扌(手) hand mang4 盟 alliance<br />
�口掹 mang3 掹<br />
106<br />
to pull<br />
Table 6.14: Optimization of a Phonetic in a Signific-Phonetic Character<br />
Due to a Change in Pronunciation (Basis)<br />
By the 1940s (Meyer 1947), the mang 1 pronunciation had developed, and ‘to<br />
pull’ was written with 擝, a signific-phonetic character with a mang 4 盟 ‘alliance’<br />
phonetic, which also differs in the tone register, but with respect to the mang 1<br />
pronunciation, yangping 陽平 (tone #4) rather than yinping 陰平 (tone #1). Although<br />
the mang 3 pronunciation has been used up to the present, the fact that it is listed in less<br />
sources than the mang 1 pronunciation after the 1940s suggests that there is a<br />
correlation between the use of the latter pronunciation and the use of 擝. Although<br />
Rao (1996) lists both 掹 and 擝, as well as both the mang 3 and the mang 1
pronunciations, he does not associate either written form with a particular<br />
pronunciation.<br />
Meyer (1947) also lists �口掹 with a mang 1 pronunciation, a marked phonetic<br />
loan of the character for the mang 3 pronunciation, suggesting that this form was<br />
created after the development of the mang 1 pronunciation, but before the creation of<br />
the 擝 form.<br />
A signific-phonetic character can also be optimized for reasons unrelated to the<br />
degree of homophony of its phonetic, such as ngou 4 ‘to shake’ 20 , which is written with<br />
�敖手, in a vertical arrangement with the ngou 6 敖 ‘to stroll’ phonetic positioned<br />
above the sau 2 手 ‘hand’ signific, as well as �扌敖, in a horizontal arrangement with<br />
the sau 2 手 ‘hand’ signific in its radical form 扌. However, only the horizontal<br />
arrangement is attested in sources later than the mid-1970s (Lau 1977; Rao 1996),<br />
suggesting that it is preferred over a vertical arrangement.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
bou1 to boil;<br />
kettle<br />
U+7172 煲 ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
U+3DDB �保灬 ✓ ✓ ✓<br />
�火�<br />
保衣<br />
✓<br />
ngou4 to shake �敖手 ✓ ✓ ✓ ✓<br />
�扌敖 ✓ ✓ ✓ ✓<br />
U+22CC<br />
6<br />
Table 6.15: Optimization of Signific-Phonetic Characters<br />
for Other Reasons (History)<br />
107
Word Gloss Unicode Char Signific Char Gloss Phonetic Char Gloss<br />
bou1 to boil;<br />
kettle<br />
U+7172 煲 fo2 火 fire bou2 保 to protect<br />
U+3DDB �保灬 fo2 灬(火) fire bou2 保 to protect<br />
�火�保<br />
衣<br />
fo2 灬(火) fire bou1 (?) �保衣 to praise (?)<br />
ngou4 to shake �敖手 sau2 手 hand ngou6 敖 to stroll<br />
U+22CC6 �扌敖 sau2 扌(手) hand ngou6 敖 to stroll<br />
Table 6.16: Optimization of Signific-Phonetic Characters<br />
for Other Reasons (Basis)<br />
On the other hand, bou 1 ‘to boil; kettle’ 21 is written with 煲 and �保灬, both<br />
in a vertical arrangement with the bou 2 保 ‘to protect’ phonetic positioned above the<br />
fo 2 火/灬 ‘fire’ signific. However, although the fo 2 火 ‘fire’ signific usually appears in<br />
its radical form 灬 when appearing in the bottom half of a character, such as rán 燃 ‘to<br />
burn’, rè 熱 ‘hot’, and zhǔ 煮 ‘to cook’, the form with the full form fo 2 火 ‘fire’<br />
signific is the only form attested in the sources later than the beginning of the<br />
twentieth century.<br />
6.2 Summary<br />
Like marked phonetic loans, the phonetic in a signific-phonetic character may<br />
be completely homophonous or differ in the initial, final, and/or tone. Sometimes, the<br />
phonetic may be replaced by one which is more homophonous or recognizable, which<br />
is in some cases motivated by a <strong>change</strong> in pronunciation that causes the phonetic to no<br />
longer be as homophonous.<br />
108
Endnotes<br />
1 yeun 6 ‘animal liver’. ① n/a. Williams (1856: 708) c yun (yeun 2 ) “colloquial word”.<br />
② 膶. U+81B6. Aubazac (1909: 45) yeun3; Meyer (1947: #3940) yûn; Yue (1972:<br />
311) ĭøn 33 “colloquial character”; Lau (1977: #3587) yun 6 * (yeun 6-2 ) “CC”; Rao<br />
(1996: 242) yên 6-2 (yeun 6-2 ).<br />
2 chi 1 ‘to stick’. ① 黐. U+9ED0. Williams (1856: 10*) cch’í; Williams (1909<br />
[1874]: 140); Aubazac (1909: 34) tch’i 1 ; Meyer (1947: #253) ch’i; Yue (1972: 302)<br />
ts’i: 53 “colloquial character”; Lau (1977: 274) chi 1 ; Rao (1996: 183) qi 1 . ② �米离.<br />
U+25EFF. Williams (1909 [1874]: 140). ③ �米禽. U+25F1D. Williams (1856:<br />
10*) cch’í. ④ �口笞. Meyer (1947: #253) ch’i.<br />
3 deng 3 ‘to throw’. ① 掟. U+639F. Williams (1856: 523) ting ɔ (ding 6 ), teng ɔ (ting 6 );<br />
Williams (1909 [1874]: 794); Meyer (1947: #3050) tèng; O'Melia (1959: 4: 175) tèng;<br />
Yue (1972: 252) tɛ:ŋ 44 “colloquial character”; Lau (1977: #524) deng 3 “CC”; Rao<br />
(1996: 38) déng 3 . ② 矴. U+77F4. Meyer (1947: #3050) tèng.<br />
4 laan 1 ‘to crawl’. 躝. U+8E9D. Williams (1856: 224) clán; Aubazac (1909: 13) lán1<br />
(laan 4 ); Meyer (1947: #1444) laan; O'Melia (1959: 4: 84) laan; Yue (1972: 238)<br />
lA:n 53 “colloquial character”; Lau (1977: #1774) lan 1 “CC”, “Coll.”; Rao (1996: 119 )<br />
lan 1 .<br />
5 mit 1 ‘to pinch; to tear’. ① 搣. U+6423. Williams (1856: 290) mítɔ “colloquial<br />
word”; Williams (1909 [1874]: 572); Aubazac (1909: 17) mit 4 ; Meyer (1947: #1831)<br />
mit, mik (mik 1 ); Yue (1972: 223) mi:t 5 “colloquial character”; Lau (1977: #2148) mit 1o<br />
“CC”, Coll.”; Rao (1996: 151) mid 1 . ② �扌灭. Rao (1996: 151) mid 1 .<br />
6 na 1 ‘scar’. ① �疒拏. U+24E3B. Williams (1856: 306) cná “colloquial word”;<br />
Williams (1909 [1874]: 587) “in Cantonese”; Aubazac (1909: 18) na 1 ; Meyer (1947:<br />
#1925) na; Yue (1972: 234) nA: 53 “colloquial character”; Lau (1977: #2234) na 1o<br />
“CC”; Rao (1996: 157) na 1 . ② �疒那. U+24DB8. Rao (1996: 157) na 1 .<br />
7 lam 6 ‘to pile up’. ① �扌林. U+3A06. Williams (1856: 222) lam ɔ “colloquial<br />
word”; Williams (1909 [1874]: 527) “in Cantonese”; Aubazac (1909: 13) lam3; Meyer<br />
(1947: #1485) lâm; Yue (1972: 245) lɐm 33 “colloquial character”; Rao (1996: 126)<br />
lem 6 . ② 罧. U+7F67. Meyer (1947: #1485) lâm. ③ 冧. U+51A7. Lau (1977:<br />
#1802) lam 6 “CC”.<br />
109
8 leu 1 ‘to spit out’. ① n/a. Williams (1856: 257) clù “colloquial word”. ② �舌累.<br />
U+269F2. Meyer (1947: #1536) leu; Yue (1972: 263) lœ: 53 , løy̆ 53 (leui 1 ) “colloquial<br />
character”; Lau (1977: #1862) leuh 1 “CC”, “Coll.”; Rao (1996: 122) lê 1 .<br />
9 laam 3 ‘to step over’. ① �足嵐. U+2814F. Williams (1856: 309, 721) nám ɔ<br />
(naam 3 ), lám ɔ “colloquial word”; Williams (1909 [1874]: 495); Aubazac (1909: 13)<br />
lám 3 ; Meyer (1947: #1436) laàm; Yue (1972: 237) lA:m 44 “colloquial character”; Lau<br />
(1977: #1766) laam 3 “CC”, “Coll.”. ② �足南. U+280BE. Rao (1996: 157) nam 3<br />
(naam 3 ), lam 3 .<br />
10 gwui 6 ‘tired’. 癐. U+7650. Williams (1856: 187) kúi ɔ “colloquial word”;<br />
Williams (1909 [1874]: 476) “in Cantonese”; Aubazac (1909: 12) koui3; Meyer (1947:<br />
#1421) kwooî; O'Melia (1959: 4: 83) kwoôi; Yue (1972: 365) ku:ĭ 33 (gui 6 ) “colloquial<br />
character”; Lau (1977: #1069) gwooi 6 “CC”; Rao (1996: 82) gui 6 .<br />
11 laap 3 ‘to gather together’. ① �扌匝. U+39DC. Williams (1856: 225) lápɔ<br />
“colloquial word”; Williams (1909 [1874]: 492) “unauthorized contraction”; Aubazac<br />
(1909: 13) láp4 (laap 6 ); Meyer (1947: #1452) laàp; Yue (1972: 240) lA:p 4 “colloquial<br />
character”; Lau (1977: #1785 ) laap 6 (laap 6 ) “CC”, “Coll.”. ② 擸. U+64F8.<br />
Williams (1856: 226) lápɔ (laap 6 ); Williams (1909 [1874]: 492); Rao (1996: 117) lab 3 .<br />
12 kang 3 ‘capable’. 掯. U+63AF. Meyer (1947: #1050) k’àng; Yue (1972: 335)<br />
k’ɐŋ 44 “colloquial character”; Lau (1977: #16609 kang 3 “Coll.”; Rao (1996: 112)<br />
keng 3 .<br />
13 na 2 ‘female’. 乸. U+4E78. Williams (1856: 306) c ná “colloquial word”; Williams<br />
(1909 [1874]: 586) “in Cantonese”; Aubazac (1909: 18) na 2 ; Meyer (1947: #1926) ná;<br />
O'Melia (1959: 4: 107) ná; Yue (1972: 234) nA: 35 “colloquial character”; Lau (1977:<br />
#2235) na 2 “CC”; Rao (1996: 157) na 2 .<br />
14 nan 2 ‘to play with’. ① n/a. Williams (1856: 309) c nan “colloquial word”. ② 撚.<br />
U+649A. Aubazac (1909: 18) nan 2 ; Meyer (1947: #1962) nán; O'Melia (1959: 4:<br />
108) nán; Yue (1972: 246) nɐn 35 , ni:n 35 ; Lau (1977: #2268) nan 2 “Coll.”; Rao (1996:<br />
160) nen 2 , nin 2 (nin 2 ).<br />
15 nam 2 ‘to think’. ① 諗. U+8AD7. Meyer (1947: #1960) năm (nam 5 ), nám; Yue<br />
(1972: 245) nɐm 35 “colloquial character”; Lau (1977: #2264) nam 2 “CC”, “Coll.”; Rao<br />
(1996: 159) nem 2 . ② 稔. U+7A14. O'Melia (1959: 4: 108) nám. ③ 惗. U+60D7.<br />
Rao (1996: 159) nem 2 .<br />
110
16 aai 3 ‘to yell’. ① 嗌. U+55CC. Williams (1856: 3) ái ɔ “colloquial word”; Williams<br />
(1909 [1874]: 921) “in Cantonese”; Aubazac (1909: 1) ái 3 ; Meyer (1947: #11) aaì;<br />
O'Melia (1959: 4: 1) aài; Yue (1972: 324) ŋA:ĭ 44 (ngaai 3 ), ʔA:ĭ 44 “colloquial<br />
character”; Lau (1977: #13) aai 3 “CC”; Rao (1996: 163) ngai 3 (ngai 3 ), ai 3 . ② �口隘.<br />
Meyer (1947: #11) aaì.<br />
17 mau 1 ‘to squat’. ① 卯. U+536F. Williams (1909 [1874]: 519) “in Cantonese”.<br />
② 蹘. U+8E58. Williams (1856: 281) cmau “colloquial word”; Williams (1909<br />
[1874]: 519) “in Cantonese”; Aubazac (1909: 16) mao 1 ; Meyer (1947: #1794) mau;<br />
O'Melia (1959: 4: 100) mau. ③ 踎. U+8E0E. Yue (1972: 212) mɐŭ 53 “colloquial<br />
character”; Lau (1977: #2111) mau 1 “CC”, “Coll.”; Rao (1996: 151) meo 1 .<br />
18 wing 1 ‘to throw away’. ① n/a. Williams (1856: 668) cwing “colloquial word”. ②<br />
�扌永. U+22AD5. Aubazac (1909: 43) wing 1 ; Meyer (1947: #3766) wing; O'Melia<br />
(1959: 4: 223) wing; Lau (1977: #3236) wing 1 ; Rao (1996: 225) wing 6 (wing 6 ). ③ 扔.<br />
U+6254. Yue (1972: 383) ŭɪŋ 53 .<br />
19 mang 1 ‘to pull’. ① 掹. U+63B9. Williams (1856: 279) mang ɔ (mang 3 ) “colloquial<br />
word”; Williams (1909 [1874]: 565) “unauthorized”, “in Cantonese”; Aubazac (1909:<br />
16) mang 3 (mang 3 ); Rao (1996: 150) meng 1 , meng 3 (mang 3 ). ② 擝. U+64DD. Meyer<br />
(1947: #1787) màng; O'Melia (1959: 4: 99) mang, màng (mang 3 ), maang (maang 1 );<br />
Yue (1972: 214) mɐŋ 53 “colloquial character”; Lau (1977: #2100) mang 1 “CC”; Lau<br />
(1977: #2101) mang 3 (mang 3 ) “CC”; Rao (1996: 150) meng 1 , meng 3 (mang 3 ). ③<br />
�口掹. Meyer (1947: #1787) màng.<br />
20 ngou 4 ‘to shake’. ① �敖手. Williams (1856: 326) cngò; Aubazac (1909: 19)<br />
ngó1; Meyer (1947: #2053) ngō; Yue (1972: 361) ngoŭ 21 “colloquial character”. ②<br />
�扌敖. U+22CC6. Williams (1909 [1874]: 7); Meyer (1947: #2053) ngō; Lau<br />
(1977: #2351) ngo 4 “CC”, “Coll.”; Rao (1996: 172) ngou 4 .<br />
21 bou 1 ‘to boil; kettle’. ① 煲. U+7172. Williams (1856: 383) cpò “vulgar<br />
character”; Aubazac (1909: 24) pó 1 ; Meyer (1947: #2403) po; O'Melia (1959: 4: 131)<br />
po; Yue (1972: 227) poŭ 53 ; Lau (1977: #115) bo 1 “CC”; Lau (1977: #116) bo 1o “CC”;<br />
Rao (1996: 12) bou 1 . ② �保灬. U+3DDB. Williams (1909 [1874]: 620)<br />
“unauthorized”; Aubazac (1909: 24) pó 1 ; O’Melia (1959: 4: 131) po. ③ �火�保衣.<br />
Meyer (1947: #2403) po.<br />
111
CHAPTER 7<br />
HIERARCHY OF CHARACTER CONSTRUCTION<br />
AND USAGE PRINCIPLES<br />
In order to determine which character construction and usage principles are<br />
preferred over others, we examine the principle behind the earlier characters used for a<br />
word and the latter ones that supersede it. Although phonetic loans are numerically<br />
the most commonly used character construction and usage principle in this study, they<br />
are not necessarily the most preferred principle, as they often represent the initial<br />
attempts to transcribe a word. <strong>The</strong> four character construction and usage principles in<br />
the model used here, signific-phonetic characters, co-signific characters, semantic<br />
loans, and phonetic loans, yield six possible combinations to be compared. However,<br />
only four of them are attested in this study: 1) signific-phonetic characters and<br />
phonetic loans, 2) signific-phonetic characters and semantic loans, 3) co-signific<br />
characters and phonetic loans, and 4) semantic loans and phonetic loans. Additionally,<br />
the behavior of a fifth combination, signific-phonetic characters and co-signific<br />
characters, may also be surmised.<br />
7.1 Hegemony of Signific-Phonetic Characters<br />
Signific-Phonetic characters, which are the analogue of the xingsheng 形聲<br />
‘phonetic compounds’ principle in the traditional liushu ㈥書 model, is the dominant<br />
112
character construction and usage principle outside of this study. It is commonly<br />
known that the vast majority of characters belong to this category, which makes up<br />
about 70% to 90% of all characters. Although the actual percentage varies depending<br />
on the criteria used for classification and each particular corpus, it is still larger than<br />
all the other categories combined.<br />
Source 1<br />
Williams<br />
(1909: xlix)<br />
Wieger<br />
(1927: 10)<br />
Li<br />
(1977: 41) 2<br />
Xiangxing<br />
象形<br />
象形<br />
Zhishi<br />
指事 指事<br />
指事<br />
Huiyi<br />
會意<br />
會意<br />
113<br />
Xingsheng<br />
形聲<br />
形聲<br />
Zhuanzhu<br />
轉注<br />
轉注<br />
Jiajie<br />
假借<br />
假借<br />
Total<br />
608 107 740 21810 372 598 24235<br />
2.51% 0.44% 3.05% 89.99% 1.53% 2.47%<br />
364 125 1167 7697 ? ? 10516<br />
3.46% 1.19% 11.10% 73.19% ? ?<br />
3.9% 1.3% 12.3% 81.2% 0.07% 1.2% ?<br />
Table 7.1: Distribution of Character Construction and Usage Principles<br />
7.1.1 Signific-Phonetic Characters and Co-Signific Characters<br />
Although there are no attested cases of signific-phonetic characters<br />
superseding co-signific characters in this study, there are also no cases of vice versa<br />
happening. It is only under unusual circumstances that a signific-phonetic character is<br />
superseded by a co-signific character, such as wāi 竵 ‘askew’, superseded by 歪,<br />
which is composed of significs bù 不 ‘not’ and zhèng 正 ‘straight’; or cuān 爨 ‘to<br />
parboil’, superseded by 汆, composed of significs rù 入 ‘to put in’ and shuǐ 水 ‘water’<br />
(Norman 1988: 76-77). In both cases, the phonetic was obscure and <strong>orthographic</strong>ally<br />
complex. However, given the numerical superiority of signific-phonetic characters
over co-signific characters, there is probably a strong preference for signific-phonetic<br />
characters over co-signific characters.<br />
7.1.2 Signific-Phonetic Characters Superseding Phonetic Loans<br />
Signific-Phonetic characters commonly supersede phonetic loans, and in many<br />
cases retain the same phonetic, such as guk 6 ‘to bake’ 3 , which was first written with<br />
局, an unmarked phonetic loan of the completely homophonous word guk 6 局<br />
‘bureau’. By the 1940s (Meyer 1947), a fo 2 火 ‘fire’ radical was added to it as a<br />
signific, creating 焗.<br />
Similarly, fan 3 ‘to sleep’ 4 was first written with 訓, an unmarked phonetic loan<br />
of the completely homophonous word fan 3 訓 ‘to teach’. Later, a muk 6 目 ‘eye’<br />
radical was added to it as a signific, creating 瞓. fan 3 ‘to sleep’ has been identified<br />
with kwan 3 睏 ‘sleepy’ (Williams 1909: 255), which in turn is identified with kwan 3<br />
困 ‘weary’. However, kwan 3 困 ‘weary’ is a less optimal phonetic than fan 3 訓 ‘to<br />
teach’, as there is no direct relationship between the initials kw- /kw-/ and f- /f-/,<br />
although it can be glimpsed in the h- /h-/ initial of that rare han 3 pronunciation that<br />
Rao (1996) also gives, which suggests that f- /f-/ can be related to kw- /kw-/ through<br />
*hu- /*hu-/ and *ku- /*ku-/.<br />
An unmarked phonetic loan may transition to a marked phonetic loan first,<br />
such as saang 2 ‘to scour’ 5 , which was first written with 省 and later �口省, phonetic<br />
loans of the completely homophonous word saang 2 省 ‘to reduce’, but by the<br />
beginning of the twentieth century (Aubazac 1909), a sau 2 扌(手) ‘hand’ radical was<br />
added to the phonetic as a signific, creating �扌省. Similarly, lam 1 ‘bud’ 6 was first<br />
114
written with 林 and later 啉, phonetic loans of the semi-homophonous word lam 4 林<br />
‘forest’, which differs in the tone register, yangping 陽平 (tone #1) rather than yinping<br />
陰平 (tone #1). By the beginning of the twentieth century, a mik 6 冖 ‘cover’ radical<br />
was added to the phonetic as a signific, creating 冧. Lau (1977) instead adds a chou 2<br />
艹(草) ‘grass’ radical to the phonetic as a signific to create 菻.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
dap1 to hang<br />
down<br />
n/a ∅ ✓<br />
U+55D2 嗒 ✓ ✓ ✓<br />
U+35F3 �口答 ✓<br />
U+265BF �耳荅 ✓<br />
U+8037 耷 ✓<br />
fan3 to sleep U+8A13 訓 ✓<br />
U+7793 瞓 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
guk6 to bake U+5C40 局 ✓ ✓ ✓<br />
U+7117 焗 ✓ ✓ ✓ ✓ ✓<br />
lam1 bud U+6797 林 ✓<br />
U+5549 啉 ✓<br />
U+51A7 冧 ✓ ✓ ✓ ✓<br />
U+83FB 菻 ✓<br />
lei6 tongue U+550E 唎 ✓<br />
U+8137 脷 ✓ ✓ ✓ ✓ ✓ ✓<br />
saang2 to scour U+7701 省 ✓<br />
U+35C2 �口省 ✓ ✓<br />
U+3A18 �扌省 ✓ ✓ ✓ ✓ ✓ ✓<br />
Table 7.2: Signific-Phonetic Characters Superseding Phonetic Loans<br />
and Retaining the Same Phonetic, Part I (History)<br />
115
<strong>The</strong> transition from a phonetic loan to a signific-phonetic character does not<br />
require the unmarked form of the former to ever have existed, such as dap 1 ‘to hang<br />
down’ 7 , which existed as early as the mid-nineteenth century (Williams 1856), but is<br />
only given a written form in later sources. As early as the late nineteenth century<br />
(Williams 1909 [1874]), it was written with 嗒 or �口答, marked phonetic loans of<br />
the semi-homophonous words daap 3 荅 ‘bean seeds’ or daap 3 答 ‘to reply’ (or just the<br />
latter, as the 答 form is commonly substituted with 荅), which differ in the final, -aap<br />
/-ap/ rather than -ap /-ɐp/, as well as the tone, yinqu 陰去 (tone #3) rather than yinping<br />
陰平 (tone #1). Rao (1996) adds an yi 5 耳 ‘ear’ radical to the phonetic as a signific,<br />
creating �耳荅, a reference to large ears hanging down. Similarly, lei 6 ‘tongue’ 8 was<br />
first written with 唎, an marked phonetic loan of the completely homophonous word<br />
lei 6 利 ‘benefit’. Later, a yuk 6 月(肉) ‘flesh’ radical was added to the phonetic as a<br />
signific, creating 脷.<br />
<strong>The</strong> transition from phonetic loans to signific-phonetic characters is further<br />
illustrated by the interaction between dam 1 ‘to prolong’ 9 , dam 2 ‘to dump; to pound’ 10 ,<br />
and dam 3 ‘to drop down’ 11 . dam 2 ‘to dump; to pound’ and dam 3 ‘to drop down’ were<br />
first written with 泵, a character constructed according to indeterminate principles. It<br />
is unknown if 泵 began to be used for both words simultaneously, or if it was first<br />
used for one and the other is a phonetic loan. By the beginning of the twentieth<br />
century (Aubazac 1909), dam 2 ‘to dump; to pound’ was distinguished from dam 3 ‘to<br />
drop down’ by adding a sau 2 扌(手) ‘hand’ radical as a signific, creating 揼. dam 1 ‘to<br />
prolong’, which existed as early as the 1940s (Meyer 1947), was written with 泵, a<br />
116
phonetic loan of dam 3 泵 ‘to drop down’, which differs in the tone, yinqu 陰去 (tone<br />
#3) rather than yinping 陰平 (tone #1). By the late 1970s (Lau 1977), dam 3 ‘to drop<br />
down’ was distinguished from dam 1 ‘to prolong’ by adding a si 1 糹(糸) ‘silk’ radical<br />
as a signific, creating �糹泵, although Meyer (1947) gives a form with an extraneous<br />
hau 2 口 ‘mouth’ radical, �口�糹泵, suggesting that this happened as early as the<br />
1940s.<br />
Word Gloss Unicode Char Signific Char Gloss Phonetic Char Gloss<br />
dap1 to hang<br />
down<br />
U+55D2 嗒 daap3 荅 bean seeds<br />
to reply<br />
U+35F3 �口答 daap3 答<br />
U+265BF �耳荅 yi5 耳 ear daap3 荅<br />
U+8037 耷<br />
fan3 to sleep U+8A13 訓 fan3 訓<br />
U+7793 瞓 muk6 目 eye fan3 訓<br />
117<br />
bean seeds<br />
to teach<br />
to teach<br />
guk6 to bake U+5C40 局 guk6 局 bureau<br />
U+7117 焗 fo2 火 fire guk6 局 bureau<br />
lam1 bud U+6797 林 lam4 林 forest<br />
U+5549 啉 lam4 林 forest<br />
U+51A7 冧 mik6 冖 cover lam4 林 forest<br />
U+83FB 菻 chou2 艹(草) grass lam4 林 forest<br />
lei6 tongue U+550E 唎 lei6 利 benefit<br />
U+8137 脷 yuk6 月(肉) flesh lei6 利 benefit<br />
saang2 to scour U+7701 省 saang2 省<br />
U+35C2 �口省 saang2 省<br />
U+3A18 �扌省 sau2 扌(手) hand saang2 省<br />
to reduce<br />
to reduce<br />
to reduce<br />
Table 7.3: Signific-Phonetic Characters Superseding Phonetic Loans<br />
and Retaining the Same Phonetic, Part I (Basis)
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
dam1 to<br />
prolong<br />
dam2 to dump;<br />
to pound<br />
dam3 to drop<br />
down<br />
U+6CF5 泵 ✓ ✓ ✓<br />
U+63FC 揼 ✓<br />
U+6CF5 泵 ✓ ✓<br />
U+63FC 揼 ✓ ✓ ✓ ✓ ✓<br />
�扌冘 ✓<br />
U+6CF5 泵 ✓ ✓ ✓ ✓ ✓ ✓<br />
U+260A5 �糹泵 ✓<br />
�口�<br />
糹泵<br />
✓<br />
U+9AE7 髧 ✓<br />
Table 7.4: Signific-Phonetic Characters Superseding Phonetic Loans<br />
and Retaining the Same Phonetic, Part II (History)<br />
Word Gloss Unicode Char Signific Char Gloss Phonetic Char Gloss<br />
dam1 to prolong U+6CF5 泵 dam3 泵 to drop<br />
down<br />
U+63FC 揼 sau2 扌(手) hand dam3 泵 to drop<br />
dam2 to dump;<br />
to pound<br />
dam3 to drop<br />
down<br />
118<br />
down<br />
U+6CF5 泵 dam2 泵 to dump;<br />
to pound<br />
U+63FC 揼 sau2 扌(手) hand dam2 泵 to dump;<br />
to pound<br />
U+628C 抌 sau2 扌(手) hand<br />
U+6CF5 泵 dam3 泵 to drop<br />
down<br />
U+260A5 �糹泵 si1 糹(糸) silk dam3 泵 to drop<br />
down<br />
�口�糹泵 si1 糹(糸) silk dam3 �糹泵 to drop<br />
down<br />
U+9AE7 髧 daam6 髧 tresses<br />
Table 7.5: Signific-Phonetic Characters Superseding Phonetic Loans<br />
and Retaining the Same Phonetic, Part II (Basis)
In some cases, part of the phonetic is retained graphically in an abbreviated<br />
form, such as lau 1 ‘coat’ 12 , which was first written with 蔞 or its variant form 蒟, an<br />
unmarked phonetic loan of the completely homophonous word lau 1 蔞/蒟 ‘betel<br />
pepper’. By the 1940s (Meyer 1947), the chou 2 艹(草) ‘grass’ radical had been<br />
removed from 蔞 and a yi 1 衤(衣) ‘clothing’ radical added to it as a signific, creating<br />
褸. Alternatively, 褸 could be analyzed as a signific-phonetic character with no<br />
connection to 蔞, but that would make its phonetic the semi-homophonous lau 4 婁 ‘a<br />
constellation’, which differs in the tone register, yangping 陽平 (tone #4) rather than<br />
yinping 陰平 (tone #1).<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
gwaan3 to fall<br />
down<br />
keui5 he, she,<br />
it<br />
U+6163 慣 ✓ ✓<br />
U+8E80 躀 ✓ ✓ ✓ ✓ ✓<br />
U+6E20 渠 ✓ ✓ ✓<br />
U+20372 �亻渠 ✓ ✓<br />
U+4F62 佢 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
lau1 coat U+851E 蔞 ✓ ✓ ✓<br />
U+8938 褸 ✓ ✓ ✓<br />
U+849F 蒟 ✓<br />
Table 7.6: Signific-Phonetic Characters Superseding Phonetic Loans<br />
and Retaining an Abbreviated Form of the Phonetic (History)<br />
Similarly, gwaan 3 ‘to fall down’ 13 was first written with 慣, an unmarked<br />
phonetic loan of the completely homophonous word gwaan 3 慣 ‘accustomed’. By the<br />
1940s (Meyer 1947), the sam 1 忄(心) ‘heart’ radical had been removed from 慣 and a<br />
119
juk 1 足 ‘foot’ radical added to it as a signific, creating 躀. Alternatively, 躀 could be<br />
analyzed as a signific-phonetic character with no connection to 慣, but that would<br />
make its phonetic the less than optimal gun 3 貫 ‘string of coins’. Like lau 1 ‘coat’, it is<br />
more probable that a completely homophonous phonetic was retained in an<br />
abbreviated form than for a less than optimal phonetic to be selected that<br />
coincidentally resembles the earlier phonetic graphically.<br />
Word Gloss Unicode Char Signific Char Gloss Phonetic Char Gloss<br />
gwaan3 to fall down U+6163 慣 gwaan3 慣 accustomed<br />
U+8E80 躀 juk1 足 foot gun3 貫<br />
keui5 he, she, it U+6E20 渠 keui4 渠 drain<br />
U+20372 �亻渠 yan4 亻(人) person keui4 渠 drain<br />
U+4F62 佢 yan4 亻(人) person geui6 巨 giant<br />
lau1 coat U+851E 蔞 lau1 蔞<br />
U+8938 褸 yi1 衤(衣) clothing lau4 婁<br />
U+849F 蒟 lau1 蒟<br />
Table 7.7: Signific-Phonetic Characters Superseding Phonetic Loans<br />
and Retaining an Abbreviated Form of the Phonetic (Basis)<br />
120<br />
string of coins<br />
betel pepper<br />
a constellation<br />
betel pepper<br />
Likewise, keui 5 ‘he, she, it’ 14 was first written with 渠 and �亻渠, both of<br />
which are attested in the Kangxi zidian 康熙字典 (1716: 116, 633) as Wu 吳 region<br />
words. 渠 is an unmarked phonetic loan of the semi-homophonous word keui 4 渠<br />
‘drain’, which differs in the tone, yangping 陽平 (tone #4) rather than yangshang 陽上<br />
(tone #5), while �亻渠 is a signific-phonetic character composed of a yan 4 亻(人)<br />
‘person’signific and the same phonetic. However, by the mid-nineteenth century
(Williams 1856), 佢 was already the form that was “chiefly used” (Williams 1856:<br />
186) rather than 渠, and by the late nineteenth century (Williams 1909 [1874]: 222),<br />
渠 had been “superseded” by 佢 and �亻渠. By then 佢 was also the form that was<br />
“alone used” (Williams 1909 [1874]: 222) rather than both 佢 and �亻渠, and said to<br />
be a “contracted form” (Williams 1909 [1874]: 222) of the latter. Alternatively, 佢<br />
could be analyzed as a signific-phonetic character with no connection to 佢 and<br />
�亻渠, but that would make its phonetic an even less homophonous geui 6 巨 ‘giant’.<br />
Sometimes, the transition from a phonetic loan to a signific-phonetic character<br />
is the result of semantic specialization, such as sung 3 ‘side dishes’ 15 , which developed<br />
from the verb sung 3 送 ‘to accompany’. By the 1950s (O’Melia), a sik 6 飠(食) radical<br />
had been added to it as a signific, creating 餸. Similarly, dou 6 ‘ferry’ 16 , which is in<br />
mainstream usage written with 渡, as in dou 6 syun 4 渡船 ‘ferryboat’, undistinguished<br />
from the verb dou 6 渡 ‘to ford’ from which it developed. By the 1990s (Rao 1996),<br />
the seui 2 氵( 水) ‘water’ radical had been removed from 渡 and a jau 1 舟 ‘boat’<br />
radical added to it as a signific, creating 艔.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
dou6 ferry U+6E21 渡 ✓ ✓ ✓ ✓ ✓ ✓<br />
U+8254 艔 ✓<br />
sung3 side<br />
dishes<br />
U+9001 送 ✓ ✓ ✓<br />
U+9938 餸 ✓ ✓ ✓ ✓<br />
Table 7.8: Signific-Phonetic Characters Superseding Phonetic Loans<br />
as a Result of Semantic Specialization (History)<br />
121
Word Gloss Unicode Char Signific Char Gloss Phonetic Char Gloss<br />
dou6 ferry U+6E21 渡 dou6 渡<br />
sung3 side<br />
dishes<br />
122<br />
to ford<br />
U+8254 艔 jau2 舟 boat dou6 度 degree<br />
U+9001 送 sung3 送<br />
U+9938 餸 sik6 飠(食) to eat sung3 送<br />
to accompany<br />
to accompany<br />
Table 7.9: Signific-Phonetic Characters Superseding Phonetic Loans<br />
as a Result of Semantic Specialization (Basis)<br />
<strong>The</strong> transition from a phonetic loan to a signific-phonetic character can also<br />
involve replacement of the phonetic unrelated to its degree of homophony, such as<br />
saai 1 ‘to waste’ 17 , which existed as early as the mid-nineteenth century (Williams<br />
1856), but is only given a written form in later sources. As early as the mid-nineteenth<br />
century (Williams 1909 [1874]), it was written with unmarked and marked phonetic<br />
loans of the semi-homophonous word saai 2 徙 ‘to move’, which differs in the tone,<br />
yinshang 陰上 (tone #2) rather than yinping 陰平 (tone #1), and was written with the<br />
marked form up to the present (Rao 1996). But by the 1940s (Meyer 1947), sau 2<br />
扌(手) ‘hand’ and seui 2 氵(水) radicals were already added to the phonetic as<br />
significs, creating �扌徙 and 漇. By the late 1970s (Lau 1977), it was written with<br />
�扌晒, a signific-phonetic character composed of a sau 2 扌(手) ‘hand’ radical and a<br />
semi-homophonous saai 3 晒 ‘to shine on’ phonetic, whose initial was formerly /ʃ-/<br />
(Williams 1856: 417; O’Melia 1959: 4: 141) but now /s-/ (Yue 1972: 280). Although<br />
it still differs in the tone, yinqu 陰去 (tone #3) rather than yinping 陰平 (tone #1), and<br />
is not a more homophonous phonetic than saai 2 徙 ‘to move’, it is <strong>orthographic</strong>ally
less complex. <strong>The</strong> <strong>change</strong> of the phonetic from saai 2 徙 ‘to move’ to saai 3 晒 ‘to<br />
shine on’ is perhaps motivated on analogy with the <strong>change</strong> for saai 3 , a quantifying<br />
particle indicating completeness (see section 5.4).<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Yx1972 L1977 R1996<br />
saai1 to waste n/a ∅ ✓<br />
U+5F99 徙 ✓<br />
U+5625 嘥 ✓ ✓ ✓ ✓ ✓<br />
U+22CDC �扌徙 ✓<br />
U+6F07 漇 ✓<br />
�扌晒 ✓<br />
Table 7.10: Signific-Phonetic Character Superseding a Phonetic Loan<br />
and Optimization of the Phonetic (History)<br />
Word Gloss Unicode Char Signific Char Gloss Phonetic Char Gloss<br />
saai1 to waste U+5F99 徙 saai2 徙 to move<br />
U+5625 嘥 saai2 徙<br />
U+22CDC �扌徙 sau2 扌(手) hand saai2 徙<br />
U+6F07 漇 seui2 氵(水) water saai2 徙<br />
�扌晒 sau2 扌(手) hand saai3 晒<br />
123<br />
to move<br />
to move<br />
to move<br />
to shine on<br />
Table 7.11: Signific-Phonetic Character Superseding a Phonetic Loan<br />
and Optimization of the Phonetic (Basis)<br />
7.1.3 Signific-Phonetic Characters and Semantic Loans<br />
Although there is only one case of a signific-phonetic character superseding a<br />
co-signific character in this study, it indicates a preference for the former over the<br />
latter. nung 1 ‘to scorch’ 18 was first written with 烘 or its variant form 灴, a semantic
loan of hung 4 烘/灴 ‘to toast’. By the beginning of the twentieth century (Aubazac<br />
1909), it was written with 燶, a signific-phonetic character composed of a fo 2 火 ‘fire’<br />
signific and a semi-homophonous nung 4 農 ‘to farm’ phonetic, which differs in the<br />
tone register, yangping 陽平 (tone #4) instead of yinping 陰平 (tone #1).<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
nung1 to scorch U+70D8 烘 ✓ ✓<br />
U+7074 灴 ✓<br />
U+71F6 燶 ✓ ✓ ✓ ✓ ✓<br />
Table 7.12: Signific-Phonetic Character Superseding a Semantic Loan (History)<br />
Word Gloss Unicode Char Signific Char Gloss Phonetic Char Gloss<br />
nung1 to scorch U+70D8 烘<br />
U+7074 灴<br />
U+71F6 燶 fo2 火 fire nung4 農<br />
124<br />
to farm<br />
Table 7.13: Signific-Phonetic Character Superseding a Semantic Loan (Basis)<br />
However, there is one case where a signific-phonetic character is superseded<br />
by a semantic loan, but under unusual circumstances. kam 2 ‘to cover’, which<br />
developed from the noun kam 1 衾 ‘blanket’, was first written with �口衾, a marked<br />
phonetic loan, and was written with it up to at least the 1940s (Meyer 1947), but by the<br />
late nineteenth century (Williams 1909 [1874]), a sau 2 扌(手) ‘hand’ radical had<br />
already been added to the phonetic as a signific, creating 搇. But by the late 1970s
(Lau 1977), it was written with 冚, a semantic loan of ham 6 ‘to cover’, which was<br />
formerly hom 6 in sources earlier than the mid-twentieth century (Williams 1856: 92;<br />
Meyer 1947: #776). Besides the semantic similarity, ham 6 冚 ‘to cover’ is also<br />
<strong>orthographic</strong>ally less complex and may perhaps also include elements of a phonetic<br />
loan which differs in the manner of articulation of the homorganic initial consonant, h-<br />
/h-/ rather than k- /k h -/, as well as the tone, yangqu 陽去 (tone #6) rather than yinshang<br />
陰上 (tone #2).<br />
<strong>The</strong>re is also one case where a signific-phonetic character co-exists with, or<br />
may be superseded by a semantic loan, but also under unusual circumstances. sit 6 ‘to<br />
lose money’ 19 , as in sit 6 bun 2 ~本 ‘to lose money’, was written with 舌, �貝舌, or<br />
餂, but by the beginning of the twentieth century (Aubazac 1909), �貝舌 was the<br />
only form that remained. However, Rao (1996) also lists 蝕, a semantic loan of sik 6<br />
蝕 ‘to erode’, a word referring generically to any kind of loss over time, which is used<br />
in the Mandarin synonym, shíběn 蝕本 ‘to lose money’. Although the compound<br />
sit 6 bun 2 �貝舌本 ‘to lose money’ is attested in Williams (1856: 448), neither<br />
*sik 6 bun 2 蝕本 nor *sit 6 bun 2 蝕本 are, suggesting that use of 蝕 for sit 6 ‘to lose<br />
money’ is influenced by Mandarin usage. However, Rao (1996), who is the only<br />
source to give the pronunciation as sit 3 , considers sik 6 to be the literary pronunciation,<br />
which suggests that sit 6 ‘to lose money’ may have developed from sik 6 蝕 ‘to erode’ as<br />
a variant pronunciation.<br />
125
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
kam2 to cover U+20E78 �口衾 ✓ ✓ ✓<br />
U+6407 搇 ✓ ✓ ✓<br />
U+519A 冚 ✓ ✓<br />
sit6 to lose<br />
money<br />
U+820C 舌 ✓<br />
U+27D73 �貝舌 ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓<br />
U+9902 餂 ✓ ✓<br />
U+8755 蝕 ✓<br />
Table 7.14: Semantic Loans Superseding Signific-Phonetic Characters (History)<br />
Word Gloss Unicode Char Signific Char Gloss Phonetic Char Gloss<br />
kam2 to cover U+20E78 �口衾 kam1 衾 blanket<br />
sit6 to lose<br />
money<br />
U+6407 搇 sau2 扌(手) hand kam1 衾 blanket<br />
U+519A 冚<br />
U+820C 舌 sit6 舌 tongue<br />
U+27D73 �貝舌 bui6 貝 cowrie sit6 舌 tongue<br />
U+9902 餂 sik6 食 to eat sit6 舌 tongue<br />
U+8755 蝕<br />
Table 7.15: Semantic Loans Superseding Signific-Phonetic Characters (Basis)<br />
7.2 Co-Signific Characters Superseding Phonetic Loans<br />
Although signific-phonetic characters usually supersede phonetic loans, there<br />
is one case of a co-signific character superseding a phonetic loan, which indicates a<br />
preference for co-signific characters over phonetic loans. hong 2 ‘young hen’ 20 , as in<br />
gai 1 hong 2 鷄~, existed as early as the mid-eighteenth century (Williams 1856: 119),<br />
where it was transcribed as ckaihong ɔ (gai 1 hong 6 ), but it is unclear what its written<br />
126
form was intended to be, as Williams (1856) does not provide characters for<br />
compounds. However, by the 1940s (Meyer 1947), when it was still pronounced<br />
hong 6 , it was written with 項, a phonetic loan of the completely homophonous 21 word<br />
hong 6 ‘nape’.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
hong2 young<br />
hen<br />
U+9805 項 ✓ ✓<br />
U+236BA �末�<br />
n/a<br />
成母<br />
∅ ✓<br />
Table 7.16: Co-Signific Character Superseding a Phonetic Loan (History)<br />
Word Gloss Unicode Char Phonetic Char Gloss<br />
hong2 young hen U+9805 項 hong6 項 nape<br />
U+236BA �末�成母<br />
Table 7.17: Co-Signific Character Superseding a Phonetic Loan (Basis)<br />
Yue (1972) does not give a written form, but Rao (1996), besides 項, also lists<br />
�末�成母, a co-signific character composed of significs mei 6 未 ‘not yet’, sing 4 成<br />
‘to become’, and mou 5 母 ‘mother’, which spells out the descriptive phrase<br />
mei 6 sing 4 mou 5 未成母 ‘not yet become a mother’, a reference to a young hen which<br />
has not yet laid eggs. �末�成母 is perhaps created on analogy with a similar co-<br />
signific character, cheun 1 ‘animal egg’, which is written as �末�成肉 or 膥 (see<br />
section 4.1), spelling out a similar phrase, mei 6 sing 4 yuk 6 肉 ‘not yet become flesh’.<br />
127<br />
✓
<strong>The</strong> fact that a co-signific character is created despite the existence of a phonetic loan<br />
suggests that the former could potentially supersede the latter, especially since hong 6<br />
項 ‘nape’ has not been the basis of a phonetic loan of a completely homophonous<br />
word since at least the 1970s (Yue 1972), in light of the contemporary hong 2<br />
pronunciation.<br />
7.3 Semantic Loans Superseding Phonetic Loans<br />
Although signific-phonetic characters usually supersede phonetic loans, there<br />
is also a case of a semantic loan superseding a phonetic loan, which indicates a<br />
preference for semantic loans over phonetic loans. laat 6 ‘row’ 22 was first written with<br />
剌, a phonetic loan of the completely homophonous word laat 6 剌 ‘to cut’. By the late<br />
1970s (Lau 1977), it was written with 列 or its variant form 迾 (HYDZD 6: 3828), a<br />
semantic loan of lit 6 列/迾 ‘row’.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
laat6 row U+524C 剌 ✓ ✓ ✓<br />
U+5217 列 ✓<br />
U+8FFE 迾 ✓<br />
Table 7.18: Semantic Loans Superseding Phonetic Loans (History)<br />
128
Word Gloss Unicode Char Phonetic Char Gloss<br />
laat6 row U+524C 剌 laat6 剌<br />
U+5217 列<br />
U+8FFE 迾<br />
129<br />
to cut<br />
Table 7.19: Semantic Loans Superseding Phonetic Loans (Basis)<br />
7.4 Indeterminate Cases Being Superseded<br />
Characters constructed or used according to indeterminate principles are often<br />
superseded by signific-phonetic characters and semantic loans, which are described<br />
below.<br />
7.4.1 Signific-Phonetic Characters Superseding Indeterminate Cases<br />
Characters constructed or used according to indeterminate principles can be<br />
superseded by signific-phonetic characters, such as gat 6 ‘to raise up; to limp’ 23 , which<br />
was first written with 跀. 跀 is composed of a juk 1 足 ‘foot’ signific and a yut 6 月<br />
‘moon’ component, but the latter does not seem to be a phonetic nor a co-signific.<br />
Although 跀 is attested in the Kangxi zidian 康熙字典 (1716), it was used to write a<br />
phonologically and semantically different word, ‘to cut off the feet as a punishment’<br />
(1716: 1222).<br />
Besides 跀 for both the ‘to raise up’ and ‘to limp’ senses, Williams (1909<br />
[1874]: 230) also lists 觖, which for unexplained reasons he comments, “this form is<br />
bettter”. 觖 is composed of a gok 3 角 ‘horn’ component and a kyut 3 夬 ‘to break off’<br />
component, but neither component appears to be a signific nor a phonetic.
By the beginning of the twentieth century (Aubazac 1909), the ‘limp’ sense<br />
was differentiated from the ‘to raise up’ sense by writing it with 趷 instead of 跀, a<br />
signific-phonetic character composed of a juk 1 足 ‘foot’ signific and a semi-<br />
homophonous hat 1 乞 ‘to beg’ phonetic, which differs in the manner of articulation of<br />
the homorganic initial consonant, h- /h-/ rather than g- /k-/, as well as the tone register,<br />
yinru 陰入 (tone #1) rather than yangru 陽入 (tone #6).<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
gat6 to raise up U+8DC0 跀 ✓ ✓ ✓ ✓ ✓<br />
U+89D6 觖 ✓<br />
U+8DB7 趷 ✓<br />
U+4798 �走乞 ✓<br />
U+8D8C 趌 ✓<br />
gat6 to limp U+8DC0 跀 ✓ ✓<br />
U+89D6 觖 ✓<br />
U+8DB7 趷 ✓ ✓ ✓ ✓<br />
U+4798 �走乞 ✓<br />
U+8D8C 趌 ✓<br />
Table 7.20: Signific-Phonetic Characters Superseding Indeterminate Cases (History)<br />
However, Rao (1996) lists 趷 for both senses, as well as �走乞 and 趌, which<br />
are also signific-phonetic characters. �走乞 is composed of a jau 2 走 ‘to run’ signific<br />
and the same semi-homophonous hat 1 乞 ‘to beg’ phonetic as 趷, while 趌 is<br />
composed of a jau 2 走 ‘to run’ signific and a semi-homophonous gat 1 吉 ‘lucky’<br />
130
phonetic, which differs in the tone register, yinru 陰入 (tone #1) rather than yangru<br />
陽入 (tone #6).<br />
Word Gloss Unicode Char Signific Char Gloss Phonetic Char Gloss<br />
gat6 to raise up; to limp U+8DC0 跀<br />
U+89D6 觖<br />
U+8DB7 趷 juk1 足 foot hat1 乞<br />
U+4798 �走乞 jau2 走 to run hat1 乞<br />
131<br />
to beg<br />
to beg<br />
U+8D8C 趌 jau2 走 to run gat1 吉 lucky<br />
Table 7.21: Signific-Phonetic Characters Superseding Indeterminate Cases (Basis)<br />
7.4.2 Semantic Loans Superseding Indeterminate Cases<br />
Characters constructed or used according to indeterminate principles can be<br />
superseded by semantic loans, such as lat 1 ‘to lose; to get rid of’ 24 , which was first<br />
written with 甪. 甪 had a variety of prior uses, from a variant form of gok 3 角 ‘horn’<br />
in the Kangxi zidian 康熙字典 (1716: 756) to the name of a beast pronounced as<br />
*luk 1 盧谷切 in the Pianhai leibian 篇海類編 (HYDZD 1: 38), neither of which can<br />
be reconciled phonetically nor semantically. However, the sources that it appears in<br />
are apparently unaware of any prior usages, including Williams (1856), who even uses<br />
it as an example of a newly created character:<br />
Lastly, entirely new characters are made for some of them; as lat 甪 to<br />
detach; páng 碰 a knock, which of course have no currency in other<br />
parts of China, as neither their sound or meaning will be known<br />
elsewhere. (xiii)
However, in a latter source he (1909 [1874]: 542) explains its form as being contracted<br />
from gok 3 角 ‘horn’, “as if an antler had fallen”. In any case, by the 1940s (Huang<br />
1941: 18), lat 1 ‘to lose; to get rid of’ was written with 甩, whose origins are equally<br />
mysterious. 甩 is used in Mandarin for a word meaning ‘to throw away’, pronounced<br />
as shuāi according to Williams, but shuǎi according to Giles (Williams 1909 [1874]:<br />
718) and contemporary pronunciation. Meanwhile, Fenn (1942: 466) considered it to<br />
be inter<strong>change</strong>able with 摔, which is pronounced as seut 1 in Cantonese. <strong>The</strong> use of 甩<br />
for lat 1 ‘to lose; to get rid of’ is apparently a semantic loan of the Mandarin synonym<br />
shuǎi ‘to throw away’.<br />
Word Gloss Unicode Char W1856 W1874 A1909 M1947 O1959 Y1972 L1977 R1996<br />
lat1 to lose;<br />
to get<br />
rid of<br />
7.5 Summary<br />
U+752A 甪 ✓ ✓ ✓ ✓ ✓<br />
U+7529 甩 ✓ ✓ ✓<br />
Table 7.22: Semantic Loan Superseding an Indeterminate Case<br />
Signific-Phonetic characters, which can indicate both the general meaning and<br />
the pronunciation, is the most preferred character construction and usage principle. In<br />
almost all cases, phonetic loans have been superseded by signific-phonetic characters,<br />
rather than co-signific characters or semantic loans. A signific-phonetic character may<br />
supersede an unmarked phonetic loan directly, a marked phonetic loan directly, or a<br />
marked phonetic loan that was previously an unmarked phonetic loan. In almost all<br />
132
cases, the character borrowed for a phonetic loan is retained as the phonetic of a<br />
signific-phonetic character, although its form may be abbreviated graphically. In<br />
some cases, the superseding of a phonetic loan by a signific-phonetic character is due<br />
to the development of a new word or sense of a word, which is distinguished by the<br />
addition of a radical as a signific, which transforms a phonetic loan into a signific-<br />
phonetic character. On the other hand, signific-phonetic characters are never<br />
superseded by phonetic loans.<br />
<strong>The</strong> ranking of signific-phonetic characters with respect to co-signific<br />
characters and semantic loans is not as undisputed as its hegemony over phonetic<br />
loans, but the data suggests that they are also preferred over co-signific characters and<br />
semantic loans. Although there are no attested cases of signific-phonetic characters<br />
superseding co-signific characters in this study or vice versa, the numerical superiority<br />
of signific-phonetic characters over co-signific characters suggests that signific-<br />
phonetic characters are preferred over co-signific characters, and that such cases could<br />
be expected to be found in the future. On the other hand, there is a case of a signific-<br />
phonetic character superseding a semantic loan, as well as vice versa happening,<br />
although the latter occurs only under extentuating circumstances. Similarly, the<br />
numerical superiority of signific-phonetic characters also suggests that they are<br />
preferred over semantic loans.<br />
Since most phonetic loans are superseded by signific-phonetic characters if<br />
they are superseded at all, there are few opportunities for a phonetic loan to be<br />
superseded by a co-signific character or a semantic loan. However, there is a case<br />
each of a co-signific character and a semantic loan superseding a phonetic loan, which<br />
133
suggests that they are both preferred over phonetic loans. Furthermore, there are no<br />
attested cases of co-signific characters and semantic loans being superseded by<br />
phonetic loans in this study, and such cases are not expected to be found in the future.<br />
While it is clear that signific-phonetic characters are at the top of the hierarchy<br />
of preferred character construction and usage principles, phonetic loans at the bottom,<br />
and co-signific characters and semantic loans higher than phonetic loans but below<br />
signific-phonetic characters, the actual ranking of co-signific characters and semantic<br />
loans is unknown, as there are no attested cases in this study of either superseding the<br />
other. Tentatively, they are accorded equal status.<br />
In actuality, the hierarchy is complicated by characters constructed or used<br />
according to indeterminate principles. However, there are cases of signific-phonetic<br />
characters and a semantic loan superseding such characters, although there are no<br />
attested cases in this study of them being superseded by semantic loans or phonetic<br />
loans. Hence, it is unclear what their place in the hierarchy is in relation to semantic<br />
loans and phonetic loans, although the very undesirability of being unable to analyze<br />
how a character is constructed or used suggests that should rank at the very bottom,<br />
below phonetic loans. <strong>The</strong>refore, the hierarchy of character construction and usage<br />
principles from most to least preferred is: signific-phonetic characters, co-signific<br />
characters and semantic loans (equal status), phonetic loans, and indeterminate cases.<br />
134
ou 1 煲 ‘to boil; kettle’, gwui 6 癐 ‘tired’<br />
Signific-Phonetic Characters<br />
kam 2 ‘to cover’ 搇 � 冚<br />
(c.f., ham 6 冚 ‘to cover’)<br />
Co-Signific Characters<br />
cheun 1 膥 ‘animal egg’,<br />
ngan 1 奀 ‘tiny’<br />
guk 6 ‘to bake’ 局 � 焗<br />
lei 6 ‘tongue’ 唎 � 脷<br />
saang 2 ‘to scour’<br />
省 � �口省 � �扌省<br />
Phonetic Loans<br />
Marked Phonetic Loans<br />
ge 3 嘅 ‘genitive particle’, yai 5 �口兮 ‘bad’<br />
dei 6 ‘plural marker’ 地 � 哋<br />
Indeterminate Cases<br />
135<br />
nung 1 ‘to scorch’ 烘 � 燶<br />
Semantic Loans<br />
dau 3 竇 ‘den; nest’<br />
(c.f., dau 6 竇‘hole’),<br />
pok 1 泡 ‘blister’<br />
(c.f., pou 5 泡 ‘blister’)<br />
hong 6 ‘young hen’<br />
項 � �未�成母<br />
Unmarked Phonetic Loans<br />
mat 1 /me 1 乜 ‘what’ (c.f., me 2 乜 ‘to squint’),<br />
ngok 6 咢 ‘to raise the head’ (c.f., ngok 6 咢 ‘to beat a drum’)<br />
lung 5 槓 ‘trunk’, nap 6 湆 ‘sticky’<br />
laat 6 ‘row’<br />
剌� 列/迾<br />
lat 1 ‘to lose; to get rid of’ 甪 � 甩<br />
gat 6 ‘to raise up; to limp’<br />
跀, 觖 � 趷, �走乞, 趌<br />
Figure 7.1: Hierarchy of Character Construction and Usage Principles
Endnotes<br />
1 Wieger (1927) and Li’s (1977) figures are based on the Shuowen jiezi 說文解字 (AD<br />
100), which contains 9353 or 10,516 characters, depending on whether the 1163<br />
character appendix is included in the count. Williams (1909) does not specify his<br />
corpus, but it is clearly a later and larger work.<br />
2<br />
<strong>The</strong> figures from Li Xiaoding’s Hanzi shihua (Taipei: Lianjing, 1977) are cited in<br />
Norman (1988: 267).<br />
3 guk 6 ‘to bake’. ① 局. U+5C40. Williams (1856: 188) kukɔ; Williams (1909<br />
[1874]: 215) “in Cantonese”; Aubazac (1909: 12) kouk4. ② 焗. U+7117. Meyer<br />
(1947: #1329) kûk; O'Melia (1959: 4: 76) kûk; Yue (1972: 363) kʊk 3 “colloquial<br />
character”; Lau (1977: #995) guk 6 “CC”, “Coll.”; Rao (1996: 82) gug 6 .<br />
4 fan 3 ‘to sleep’. ① 訓. U+8A13. Williams (1909 [1874]: 255) “unauthorized”, “in<br />
Cantonese”. ② 瞓. U+7793. Williams (1856: 47) fan ɔ “colloquial word”; Williams<br />
(1909 [1874]: 255) “unauthorized”, “in Cantonese”; Aubazac (1909: 1) fan 3 ; Meyer<br />
(1947: #484) fàn; O'Melia (1959: 4: 33) fàn; Yue (1972: 213) fɐn 44 “colloquial<br />
character”; Lau (1977: #681) fan 3 “CC”; Rao (1996: 56) fen 3 , hen 3 (han 3 ).<br />
5 saang 2 ‘to scour’. ① 省. U+7701. Williams (1909 [1874]: 695) “in Cantonese”.<br />
② �口省. U+35C2. Williams (1856: 425) c sháng “colloquial word”; Williams<br />
(1909 [1874]: 695) “in Cantonese”. ③ �扌省. U+3A18. Aubazac (1909: 26)<br />
sháng 2 ; Meyer (1947: #2604) shaáng; O'Melia (1959: 4: 142) sháang; Yue (1972:<br />
284) sA:ŋ 35 “colloquial character”; Lau (1977: #2620) saang 2 “CC”, “Coll.”; Rao<br />
(1996: 191) sang 2 .<br />
6 lam 1 ‘bud’. ① 啉. U+5549. Williams (1856: 221) clam. ② 林. U+6797.<br />
Williams (1909 [1874]: 526) “in Cantonese”. ③ 冧. U+51A7. Aubazac (1909: 13)<br />
lam 1 ; Meyer (1947: #1478) lam; Yue (1972: 245) lɐm 53 “colloquial character”; Rao<br />
(1996: 125) lem 3 . ④ 菻. U+83FB. Lau (1977: #1800) lam 1o “CC”.<br />
7 dap 1 ‘to hang down’. ① n/a. Williams (1856: 507) tapɔ “colloquial word”. ② 嗒.<br />
U+55D2. Meyer (1947: #3019) tap, t’aàp (taap 3 ); Yue (1972: 248) tɐp 5 “colloquial<br />
character”; Lau (1977: #497) dap 1o “CC”, “Coll.”. ③ �口答. U+35F3. Lau (1977:<br />
#497) dap 1o “CC”, “Coll.”. ④ �耳荅. U+265BF. Rao (1996: 33) deb 1 . ⑤ 耷.<br />
U+8037. Rao (1996: 33) deb 1 .<br />
8 lei 6 ‘tongue’. ① 唎. U+550E. Williams (1909 [1874]: 512) “in Cantonese”. ② 脷.<br />
U+8137. Williams (1856: 235) lí ɔ “colloquial word”; Aubazac (1909: 14) li3; Meyer<br />
136
(1947: #1525) leî; Yue (1972: 254) leĭ 33 “colloquial character”; Lau (1977: #1852) lei 6<br />
“CC”, “Coll.”; Rao (1996: 125) léi 6 .<br />
9 dam 1 ‘to prolong’. ① 泵. U+6CF5. Meyer (1947: #2999) tam; Yue (1972: 245)<br />
tɐm 53 “colloquial character”; Lau (1977: #486) dam 1 “CC”, “Coll.”. ② 揼. U+63FC.<br />
Rao (1996: 37) dem 1 .<br />
10 dam 2 ‘to dump; to pound’. ① 泵. U+6CF5. Williams (1856: 498) c tam<br />
“colloquial word”; Williams (1909 [1874]: 858) “unauthorized”, “in Cantonese”. ②<br />
揼. U+63FC. Aubazac (1909: 30) tam 2 ; Meyer (1947: #3000) tám; O'Melia (1959: 4:<br />
171) tám; Yue (1972: 245) tɐm 35 “colloquial word”; Lau (1977: #487) dam 2 “Coll.”.<br />
③ 抌. U+628C. Rao (1996: 37) dem 2 .<br />
11 dam 3 ‘to drop down’. ① 泵. U+6CF5. Williams (1856: 498) tam ɔ “colloquial<br />
word”; Williams (1909 [1874]: 858) “unauthorized”, “in Cantonese”; Aubazac (1909:<br />
30) tam 3 , tam 1 (dam 1 ); Meyer (1947: #3001) tàm; O'Melia (1959: 4: 171) tàm; Yue<br />
(1972: 245) t’ɐm 44 “colloquial character”. ② �糹泵. U+260A5. Lau (1977: #488)<br />
dam 3 “CC”, “Coll.”. ③ �口�糹泵. Meyer (1947: #3000) tàm. ④ 髧. U+9AE7.<br />
Rao (1996: 37) dem 3 , dem 6 (dam 6 ).<br />
12 lau 1 ‘coat’. ① 蔞. U+851E. Williams (1856: 226) clau “colloquial word”;<br />
Aubazac (1909: 13) lao 1 ; Meyer (1947: #1491) lau. ② 褸. U+8938. Meyer (1947:<br />
#1491) lau; Lau (1977: #1814) lau 1o “CC”; Rao (1996: 128) leo 1 , leo 5 (lau 5 ). ③ 蒟.<br />
U+849F. Williams (1856: 226) clau “colloquial word”.<br />
13 gwaan 3 ‘to fall down’. ① 慣. U+6163. Williams (1856: 211) kwán ɔ ; Meyer<br />
(1947: #1360) kwaàn. ② 躀. U+8E80. Meyer (1947: #1363) kwaàng (gwaang 3 );<br />
O'Melia (1959: 4: 79) kwàang (gwaang 3 ); Yue (1972: 372) kwA:n 44 “colloquial<br />
character”; Lau (1977: #1023) gwaan 3 “CC”; Rao (1996: 79) guan 3 .<br />
14 keui 5 ‘he, she, it’. ① 渠. U+6E20. Williams (1856: 186) ck’ü (keui 4 ); Williams<br />
(1909 [1874]: 222); Rao (1996: 112) kêu 5 . ② �亻渠. U+20372. Williams (1909<br />
[1874]: 222) “in Cantonese”; Rao (1996: 112) kêu 5 . ③ 佢. U+4F62. Williams (1856:<br />
186) c k’ü; Williams (1909 [1874]: 222) “in Cantonese”; Aubazac (1909: 12) k’u2;<br />
Meyer (1947: #1318) k’uĭ; O'Melia (1959: 4: 76) k’ŭi; Yue (1972: 354) k’œy̆ 24 ; Lau<br />
(1977: #1723) kui 5 “CC”; Rao (1996: 112) kêu 5 .<br />
15 sung 3 ‘side dishes’. ① 送. U+9001. Williams (1856: 483) sung ɔ “colloquial<br />
word”; Williams (1909 [1874]: 741) “in Cantonese”; Meyer (1947: #2891) sùng. ②<br />
餸. U+9938. O'Melia (1959: 4: 162) sùng; Yue (1972: 318) sʊŋ 44 “colloquial<br />
character”; Lau (1977: #2976) sung 3 “CC”, “Coll.”; Rao (1996: 209) sung 3 .<br />
137
16 dou 6 ‘ferry’. ① 渡. U+6E21. Williams (1856: 536) tò ɔ ; Williams (1909 [1874]:<br />
849); Meyer (1947: #3149) tô; O'Melia (1959: 4: 184) tô; Yue (1972: 273) toŭ 33 ; Lau<br />
(1977: #580) do 6 . ② 艔. U+8254. Rao (1996: 47) dou 6-2 (dou 6-2 ).<br />
17 saai 1 ‘to waste’. ① n/a. Williams (1856: 404) csái “colloquial word”. ② 徙.<br />
U+5F99. Williams (1909 [1874]: 300) “in Cantonese”. ③ 嘥. U+5625. Williams<br />
(1909 [1874]: 300) “in Cantonese”; Aubazac (1909: 25) sái 1 ; Meyer (1947: #2525)<br />
saai; Yue (1972: 280) sA:ĭ 53 “colloquial character”; Rao (1996: 187) sai 1 . ④ �扌徙.<br />
U+22CDC. Meyer (1947: #2525) saai. ⑤ 漇. U+6F07. Meyer (1947: #2525) saai.<br />
⑥ �扌晒. Lau (1977: #2592) saai 1 “CC”, “Coll.”.<br />
18 nung 1 ‘to scorch’. ① 烘. U+70D8. Williams (1856: 337) cnung “colloquial<br />
word”; Williams (1909 [1874]: 381) “in Cantonese”. ② 灴. U+7074. Williams<br />
(1909 [1874]: 381) “in Cantonese”. ③ 燶. U+71F6. Aubazac (1909: 21) noung 1 ;<br />
Meyer (1947: #2124) nung; Yue (1972: 274) nʊŋ 53 “colloquial character”; Lau (1977:<br />
#2410) nung 1 ; Rao (1996: 175) nung 1 .<br />
19 sit 6 ‘to lose money’. ① 舌. U+820C. Williams (1909 [1874]: 689). ② �貝舌.<br />
U+27D73. Williams (1856: 448) shítɔ “colloquial word”; Williams (1909 [1874]:<br />
6899 “in Cantonese”; Aubazac (1909: 28) shit4; Meyer (1947: #2742) shît; O'Melia<br />
(1959: 4: 152) shît; Yue (1972: 307) si:t 3 “colloquial character”; Lau (1977: #2847)<br />
sit 6 ; Rao (1996: 230) xid 3 (sit 3 ), xig 6 . ③ 餂. U+9902. Williams (1856: 448) shítɔ<br />
“colloquial word”; Williams (1909 [1874]: 689) “in Cantonese”. ④ 蝕. U+8755.<br />
Rao (1996: 230) xid 3 (sit 3 ), xig 6 (sik 6 ).<br />
20 hong 2 ‘young hen’. ① 項. U+9805. Meyer (1947: #1004) kaihông* (gai 1 hong 6 );<br />
Rao (1996: 67) gei 1 hong 6-2 . ② �末�成母. U+236BA. Rao (1996: 67) gei 1 hong 6-2 .<br />
③ n/a. Yue (1972: 358) kaĭ 53 hɔ:ŋ 35 .<br />
21 This may only be an unmarked phonetic loan of a semi-homophonous word, since<br />
Meyer (1947) transcribes it as hông*, where the asterisk denotes an “variant tone”<br />
(“explanatory notes”, 2). Presumably, Meyer meant hong 6-2 , a pronunciation which is<br />
given in later sources (Yue 1972; Rao 1996), but this is not clearly indicated.<br />
However, the choice of hong 6 項 ‘nape’ as the basis of a phonetic loan suggests that<br />
this decision was made while ‘young hen’ was still pronounced gai 1 hong 6 , as given by<br />
earlier sources such as Williams (1856).<br />
138
22 laat 6 ‘row’. ① 剌. U+524C. Williams (1909 [1874]: 492) “in Cantonese”; Meyer<br />
(1947: #1458) laât; Yue (1972: 241) lA:t 3 “colloquial character”. ② 列. U+5217.<br />
Lau (1977: #1789) laat 6 . ③ 迾. U+8FFE. Rao (1996: 117) lad 6 .<br />
23 gat 6 ‘to raise up; to limp’. ① 跀. U+8DC0. Williams (1856: 136) katɔ<br />
“colloquial word”; Williams (1909 [1874]: 959) “in Cantonese”; Aubazac (1909: 47)<br />
kat 4 ; Meyer (1947: #1059) kât; Yue (1972: 337) kɐt 3 “colloquial character”. ② 觖.<br />
U+89D6. Williams (1909 [1874]: 230) “in Cantonese”. ③ 趷. U+8DB7. Aubazac<br />
(1909: 47) kat 4 ; Meyer (1947: #1060) kât; Yue (1972: 337) kɐt 3 “colloquial character”;<br />
Rao (1996: 66) ged 6 . ④ �走乞. U+4798. Rao (1996: 66) ged 6 . ⑤ 趌. U+8D8C.<br />
Rao (1996: 66) ged 6 .<br />
24 lat 1 ‘to lose; to get rid of’. ① 甪. U+752A. Williams (1856: 226) latɔ “colloquial<br />
word”; Williams (1909 [1874]: 542) “in Cantonese”; Aubazac (1909: 13) lat 4 ; Meyer<br />
(1947: #1490) lat; O'Melia (1959: 4: 85) lat. ② 甩. U+7529. Yue (1972: 249) lɐt 5<br />
“colloquial character”; Lau (1977: #1810) lat 1 ° “CC”; Rao (1996: 123) led 1 .<br />
139
CHAPTER 8<br />
CONCLUDING REMARKS<br />
In this work, a methodology has been introduced for analyzing the<br />
<strong>orthographic</strong> <strong>change</strong> in Cantonese dialect characters (which may be extended to<br />
Chinese characters in general) by tracing the written forms used to write a word using<br />
sources where the pronunciation and meaning are reliably indicated. <strong>The</strong>se sources<br />
have been post-mid-nineteenth century bilingual dictionaries, mostly authored by and<br />
for a foreign readership with the aid of native informants. Using a modified model<br />
based on the traditional liushu 六書 model of character construction and usage<br />
principles, the <strong>change</strong>s in the written forms have been analyzed as a transition from<br />
one principle to another, or as principle-internal optimizations. In this way, the<br />
various principles may be ranked by how preferred they are, which in the data set used<br />
yielded (in descending order of preference): signific-phonetic characters, co-signific<br />
and semantic loans (tie), and phonetic loans.<br />
A diachronic study of the various written forms used and what kinds of<br />
<strong>orthographic</strong> <strong>change</strong>s may occur has a number of practical applications, including: 1)<br />
the dating of undated documents by the particular written forms used for certain<br />
words, 2) the identification of living words in earlier documents that are written with<br />
forms that are no longer familiar, 3) additional insights into the etymology of words<br />
140
from their earlier written forms when the etymological links were still recognized, 4)<br />
assessing the most appropriate written form to use given great synchronic variation,<br />
and 5) predicting earlier and future written forms that may be expected to be found.<br />
Certain issues and questions still remain at the end of this work, such as the<br />
possibility of characters whose origins predate the earliest sources used in this study.<br />
Similarly, there may also be chronological issues with having to use later editions of<br />
Meyer (1947) and O’Melia (1959), as they may only reflect the usage that was current<br />
at the time of the first edition 1 . Furthermore, some characters, especially in Meyer<br />
(1947), were not analyzed because they appeared to be idiosyncratic to that work and<br />
rarely appeared in other sources used in the study. A number of the characters in Rao<br />
(1996) also presented a problem, as they were often very different from those in near-<br />
contemporary sources such as Lau (1977) and Yue (1972). It is suspected that there<br />
may be some prescriptivism on the parts of the compilers of Rao (1996) in providing<br />
what is believed to be the ‘etymological character’, or a regional difference in written<br />
forms. It is believed that both cases may be resolved with the addition of more<br />
sources, providing more chronological and regional detail.<br />
Given the constraints and time and resources, certain design decisions had to<br />
be made concerning the kinds of materials used in this study. However, during the<br />
search for sources, a number of non-dictionary sources were discovered and/or<br />
acquired. Some were textbooks and phrasebooks, such as T. Lathrop Stedman and<br />
K.P. Lee’s A Chinese and English Phrase Book in the Canton Dialect (1888), while<br />
others were partial Bible translations of the New Testament. While they did not meet<br />
the requirements for sources outlined in chapter 3 (see section 3.1), they may be used<br />
141
in the future to provide corroborating evidence. Some dictionary sources that did not<br />
meet the requirements, such as Chalmers (1878), may likewise be used to provide<br />
supplementary data.<br />
In some cases, it may be worth relaxing or waiving the requirements altogether<br />
for older sources, such as for Elijah Coleman Bridgman’s A Chinese Chrestomathy in<br />
the Canton Dialect (1841), Robert Morrison’s Vocabulary of the Canton Dialect<br />
(1828), or the eighteenth century Sino-Portuguese glossary in the Aomen Ji Lue<br />
澳門記略 analyzed by Chan (1982, 1994), simply to extend the time period covered to<br />
before the mid-nineteenth century. A number of words, such as m 4 唔 ‘not’, mou 5 冇<br />
‘to not have’, ma 1 孖 ‘twin’, mat 1 乜 ‘what’, and na 2 乸 ‘female’, have always been<br />
written with the same form within the sources used in this study 2 , suggesting that they<br />
have existed much earlier, or that other written forms may be found in earlier sources.<br />
It was fortunate that Samuel Wells Williams’ A Tonic Dictionary of the<br />
Chinese Language in the Canton Dialect (1856) and his A Syllabic Dictionary of the<br />
Chinese Language Arranged According to the Wu-Fang Yüan Yin (1874) 3 were<br />
available, as they essentially represent two editions of a work by the same author,<br />
allowing one to eliminate individual authors’ idiosyncrasies in the choice of written<br />
forms from the equation. However, it was not possible to acquire editions of Bernard<br />
F. Meyer and <strong>The</strong>odore F. Wempe’s <strong>The</strong> Student’s Cantonese-English Dictionary<br />
earlier than the third edition (1947), or Thomas O’Melia’s First-Year Cantonese<br />
earlier than the fourth edition (1959), which were originally published in 1935 and<br />
142
1938, respectively. If these become available, it will be worth comparing how the<br />
written forms in the earlier editions differ from that of latter editions, if at all.<br />
A number of dictionary-like sources that did fit the criteria outlined in chapter<br />
3 (see section 3.1) were discovered or acquired after the eight sources used in this<br />
study had been decided upon, such as William Lobscheid’s A Chinese and English<br />
Dictionary (1871), which would have presented another source of data for Cantonese<br />
dialect characters as used in the late nineteenth century. Near the end of completion of<br />
this work, it was discovered that there finally was interest in reprinting old Cantonese<br />
materials 4 , which will be a welcome remedy to the inevitable dwindling number of<br />
fragile originals that were used in this study. Hopefully, other sources will become<br />
available that will allow filling in some of the underrepresented time periods in this<br />
study. Furthermore, the publication of Kwan-hin Cheung and Robert S. Bauer’s<br />
forthcoming monograph enumerating several hundred Cantonese dialect characters<br />
will provide an appreciated aid in planning an expansion of the data set analyzed in<br />
this work.<br />
143
Endnotes<br />
1 I thank Professor Marjorie Chan for this observation. Hopefully this issue may be<br />
addressable in the future if the earlier editions become available.<br />
2 I thank Professor Marjorie Chan for this observation. <strong>The</strong> character 孖 with a “ma”<br />
reading was used for transliteration in the Sino-Portuguese glossary of Yin Guangren<br />
印廣任 and Zhang Rulin’s 張汝霖 mid-eighteenth century Aomen ji lue 澳門記略,<br />
such as 孖古度 or 孖古路 for Portuguese magro ‘thin’.<br />
3 A 1909 edition by the North China Union College with the entries rearranged was<br />
actually used for this study, but the content is essentially the same as the 1874 original,<br />
and has been treated as such.<br />
4 Ganesha Publishing’s reprints of four nineteenth century Cantonese dictionaries,<br />
including Morrison (1828), Williams (1856), and Williams (1874), scheduled to be<br />
distributed by the <strong>University</strong> of Chicago Press in November 2001.<br />
144
URO:<br />
APPENDIX A<br />
CHARACTERS BY UNICODE CODEPOINT<br />
U+4E2A 个 go 2<br />
U+4E2B 丫 a 1<br />
U+4E5C 乜 mat 1<br />
U+4E78 乸 na 2<br />
U+4F62 佢 keui 5<br />
U+4FC2 係 hai 2<br />
U+5003 倃 gau 6<br />
U+500B 個 go 2<br />
U+507D 偽 ngai 1<br />
U+5187 冇 mou 5<br />
U+519A 冚 ham 6<br />
U+519A 冚 kam 2<br />
U+51A7 冧 lam 1<br />
U+51A7 冧 lam 6<br />
U+5217 列 laat 6<br />
U+524C 剌 laat 6<br />
U+5366 卦 gwa 3<br />
U+536F 卯 mau 1<br />
U+5403 吃 yaak 3<br />
U+5416 吖 a 1<br />
U+5438 吸 ngap 1<br />
U+543D 吽 ngau 6<br />
U+5440 呀 a 1<br />
U+5443 呃 ngak 1<br />
U+5464 呤 laang 6<br />
U+5481 咁 gam 3<br />
U+5497 咗 jo 2<br />
U+54A2 咢 ngok 6<br />
U+54A9 咩 me 1<br />
U+54AA 咪 mai 5<br />
145<br />
that<br />
sentence-final particle<br />
what<br />
female<br />
he, she, it<br />
to be at<br />
lump<br />
that<br />
to beg<br />
to not have<br />
see ham 6 baang 6 laang 6 ‘all’<br />
to cover<br />
bud<br />
to pile up<br />
row<br />
row<br />
sentence-final particle<br />
to squat<br />
to eat<br />
sentence-final particle<br />
to jabber<br />
see ngau 6 dau 6 ‘unwell; stupid’<br />
sentence-final particle<br />
to trick<br />
see ham 6 baang 6 laang 6 ‘all’<br />
so (quantity)<br />
perfective aspect marker<br />
to raise the head<br />
sentence-final particle<br />
don’t
U+54AD 咭 kat 1<br />
U+54AF 咯 lok 3<br />
U+54CB 哋 dei 6<br />
U+54E3 哣 dau 6<br />
U+550E 唎 lei 6<br />
U+5514 唔 m 4<br />
U+551E 唞 tau 2<br />
U+5525 唥 laang 6<br />
U+5528 唨 jo 2<br />
U+552A 唪 baang 6<br />
U+5549 啉 lam 1<br />
U+555D 啝 wo 5<br />
U+5569 啩 gwa 3<br />
U+5571 啱 ngaam 1<br />
U+5572 啲 di 1<br />
U+5587 喇 la 3<br />
U+558A 喊 ham 6<br />
U+558E 喎 wo 5<br />
U+5590 喐 yuk 1<br />
U+55AB 喫 yaak 3<br />
U+55BA 喺 hai 2<br />
U+55BC 喼 gip 1<br />
U+55CC 嗌 aai 3<br />
U+55D2 嗒 dap 1<br />
U+55EE 嗮 saai 3<br />
U+55F0 嗰 go 2<br />
U+5605 嘅 ge 3<br />
U+561C 嘜 mak 1<br />
U+561E 嘞 la 3<br />
U+5622 嘢 ye 5<br />
U+5625 嘥 saai 1<br />
U+5625 嘥 saai 3<br />
U+5649 噉 gam 2<br />
U+564F 噏 ngap 1<br />
U+5664 噤 tam 3<br />
U+569C 嚜 mak 1<br />
U+569F 嚟 lai 4<br />
U+56A1 嚡 haai 4<br />
U+56A4 嚤 mo 1<br />
U+56B9 嚹 la 3<br />
U+56BF 嚿 gau 6<br />
U+5730 地 dei 6<br />
146<br />
card<br />
sentence-final particle<br />
plural marker<br />
see ngau 6 dau 6 ‘unwell; stupid’<br />
tongue<br />
not<br />
to rest<br />
see ham 6 baang 6 laang 6 ‘all’<br />
perfective aspect marker<br />
see ham 6 baang 6 laang 6 ‘all’<br />
bud<br />
sentence-final particle<br />
sentence-final particle<br />
correct<br />
some<br />
sentence-final particle<br />
see ham 6 baang 6 laang 6 ‘all’<br />
sentence-final particle<br />
to move<br />
to eat<br />
to be at<br />
bag<br />
to yell<br />
to hang down<br />
quantifying particle<br />
that<br />
genitive particle<br />
mark<br />
sentence-final particle<br />
thing<br />
to waste<br />
quantifying particle<br />
so (manner)<br />
to jabber<br />
to deceive<br />
mark<br />
to come<br />
coarse<br />
slow<br />
sentence-final particle<br />
lump<br />
plural marker
U+57DC 埜 ye 5<br />
U+5940 奀 ngan 1<br />
U+5940 奀 ngan 3<br />
U+5B32 嬲 nau 1<br />
U+5B56 孖 ma 1<br />
U+5B6D 孭 me 1<br />
U+5B7B 孻 laai 1<br />
U+5C40 局 guk 6<br />
U+5CA9 岩 ngaam 1<br />
U+5F0A 弊 bai 6<br />
U+5F99 徙 saai 1<br />
U+5F99 徙 saai 3<br />
U+60D7 惗 nam 2<br />
U+60F1 惱 nau 1<br />
U+6163 慣 gwaan 3<br />
U+6254 扔 wing 1<br />
U+6264 扤 ngat 1<br />
U+6296 抖 tau 2<br />
U+62CE 拎 ning 1<br />
U+6382 掂 dim 6<br />
U+639F 掟 deng 3<br />
U+63AF 掯 kang 3<br />
U+63B9 掹 mang 1<br />
U+63FC 揼 dam 1<br />
U+63FC 揼 dam 2<br />
U+63FE 揾 wan 2<br />
U+6407 搇 kam 2<br />
U+6423 搣 mit 1<br />
U+642D 搭 dap 6<br />
U+6435 搵 wan 2<br />
U+6469 摩 mo 1<br />
U+6498 撘 dap 6<br />
U+649A 撚 nan 2<br />
U+64C1 擁 ung 2<br />
U+64DD 擝 mang 1<br />
U+64F0 擰 ning 1<br />
U+64F8 擸 laap 3<br />
U+651E 攞 lo 2<br />
U+6541 敁 dim 6<br />
U+6562 敢 gam 2<br />
U+6625 春 cheun 1<br />
U+6652 晒 saai 3<br />
147<br />
thing<br />
tiny<br />
to jiggle the feet<br />
angry<br />
twin<br />
to carry on the back<br />
last (child)<br />
to bake<br />
correct<br />
bad<br />
to waste<br />
quantifying particle<br />
to think<br />
angry<br />
to fall down<br />
to throw away<br />
to cram<br />
to rest<br />
to carry; to bring<br />
straight<br />
to throw<br />
capable<br />
to pull<br />
to prolong<br />
to dump; to pound<br />
to find<br />
to cover<br />
to pinch; to tear<br />
to pound<br />
to find<br />
slow<br />
to pound<br />
to play with<br />
to push<br />
to pull<br />
to carry; to bring<br />
to gather together<br />
to take<br />
straight<br />
so (manner)<br />
animal egg<br />
quantifying particle
U+66F1 曱 gaat 6<br />
U+66F3 曳 yai 5<br />
U+6717 朗 long 2<br />
U+676C 杬 laam 2<br />
U+6797 林 lam 1<br />
U+67B6 架 ga 3<br />
U+68F5 棵 po 1<br />
U+69D3 槓 lung 5<br />
U+6A16 樖 po 1<br />
U+6B16 欖 laam 2<br />
U+6B6A 歪 me 2<br />
U+6C39 氹 tam 5<br />
U+6CE1 泡 pok 1<br />
U+6CF5 泵 dam 1<br />
U+6CF5 泵 dam 2<br />
U+6CF5 泵 dam 3<br />
U+6E20 渠 keui 5<br />
U+6E21 渡 dou 6<br />
U+6E46 湆 nap 6<br />
U+6F07 漇 saai 1<br />
U+7074 灴 nung 1<br />
U+70D8 烘 nung 1<br />
U+7117 焗 guk 6<br />
U+712B 焫 naat 3<br />
U+7172 煲 bou 1<br />
U+71F6 燶 nung 1<br />
U+7518 甘 gam 3<br />
U+7529 甩 lat 1<br />
U+752A 甪 lat 1<br />
U+7534 甴 jaat 6<br />
U+7650 癐 gwui 6<br />
U+7684 的 di 1<br />
U+7701 省 saang 2<br />
U+7732 眲 ngak 1<br />
U+7793 瞓 fan 3<br />
U+77F4 矴 deng 3<br />
U+7A14 稔 nam 2<br />
U+7A9E 窞 tam 5<br />
U+7AC7 竇 dau 3<br />
U+7B2A 笪 daat 3<br />
U+7B87 箇 go 2<br />
U+7BE2 篢 lung 5<br />
148<br />
see gaat 6 jaat 6 ‘cockroach’<br />
bad<br />
to rinse<br />
olive<br />
bud<br />
sentence-final particle<br />
classifier for plants<br />
trunk<br />
classifier for plants<br />
olive<br />
crooked<br />
pit; cesspool<br />
blister<br />
to prolong<br />
to dump; to pound<br />
to drop down<br />
he, she, it<br />
ferry<br />
sticky<br />
to waste<br />
to scorch<br />
to scorch<br />
to bake<br />
to burn<br />
to boil; kettle<br />
to scorch<br />
so (quantity)<br />
to lose; to get rid of<br />
to lose; to get rid of<br />
see gaat 6 jaat 6 ‘cockroach’<br />
tired<br />
some<br />
to scour<br />
to trick<br />
to sleep<br />
to throw<br />
to think<br />
pit; cesspool<br />
den; nest<br />
spot<br />
that<br />
trunk
U+7C73 米 mai 5<br />
U+7DD9 緙 kwaak 1<br />
U+7DFC 緼 wan 3<br />
U+7E15 縕 wan 3<br />
U+7E88 纈 lit 3<br />
U+7F45 罅 la 3<br />
U+7F67 罧 lam 6<br />
U+8037 耷 dap 1<br />
U+8137 脷 lei 6<br />
U+814D 腍 nam 4<br />
U+81A5 膥 cheun 1<br />
U+81B6 膶 yeun 6<br />
U+820C 舌 sit 6<br />
U+8254 艔 dou 6<br />
U+83FB 菻 lam 1<br />
U+849F 蒟 lau 1<br />
U+851E 蔞 lau 1<br />
U+8755 蝕 sit 6<br />
U+8938 褸 lau 1<br />
U+89D6 觖 gat 6<br />
U+8A13 訓 fan 3<br />
U+8AD7 諗 nam 2<br />
U+8D8C 趌 gat 6<br />
U+8DB7 趷 gat 6<br />
U+8DC0 跀 gat 6<br />
U+8E0E 踎 mau 1<br />
U+8E58 蹘 mau 1<br />
U+8E80 躀 gwaan 3<br />
U+8E9D 躝 laan 1<br />
U+8FFE 迾 laat 6<br />
U+9001 送 sung 3<br />
U+9017 逗 dau 6<br />
U+90C1 郁 yuk 1<br />
U+91CE 野 ye 5<br />
U+9209 鈉 naat 3<br />
U+93EC 鏬 la 3<br />
U+9628 阨 ngak 1<br />
U+963B 阻 jo 2<br />
U+978B 鞋 haai 4<br />
U+97DE 韞 wan 3<br />
U+9805 項 hong 2<br />
U+9902 餂 sit 6<br />
149<br />
don’t<br />
loop; to loop<br />
to confine<br />
to confine<br />
knot<br />
sentence-final particle<br />
to pile up<br />
to hang down<br />
tongue<br />
tender<br />
animal egg<br />
animal liver<br />
to lose money<br />
ferry<br />
bud<br />
coat<br />
coat<br />
to lose money<br />
coat<br />
to raise up; to limp<br />
to sleep<br />
to think<br />
to raise up; to limp<br />
to raise up; to limp<br />
to raise up; to limp<br />
to squat<br />
to squat<br />
to fall down<br />
to crawl<br />
row<br />
side dishes<br />
see ngau 6 dau 6 ‘unwell; stupid’<br />
to move<br />
thing<br />
to burn<br />
sentence-final particle<br />
to trick<br />
perfective aspect marker<br />
coarse<br />
to confine<br />
young hen<br />
to lose money
U+9938 餸 sung 3<br />
U+9AE7 髧 dam 3<br />
U+9ECE 黎 lai 4<br />
U+9ED0 黐 chi 1<br />
CJK Extension A:<br />
U+34E4 �吉刂 gat 1<br />
U+35C2 �口省 saang 2<br />
U+35CE �口架 ga 3<br />
U+35F3 �口答 dap 1<br />
U+3664 �土虖 la 3<br />
U+39DC �扌匝 laap 3<br />
U+39EC �巩手 ung 2<br />
U+3A06 �扌林 lam 6<br />
U+3A18 �扌省 saang 2<br />
U+3A97 �咅攴 tau 2<br />
U+3DDB �保灬 bou 1<br />
U+4798 �走乞 gat 6<br />
U+47F4 �足辰 ngan 3<br />
CJK Extension B:<br />
150<br />
side dishes<br />
to drop down<br />
to come<br />
to stick<br />
U+20372 �亻渠 keui 5<br />
U+20BCB �口兮 yai 5<br />
U+20C41 �口氹 tam 3<br />
U+20C53 �口危 ngai 1<br />
U+20D15 �口妙 miu 2<br />
U+20DA7 �口店 dim 6<br />
U+20E78 �口衾 kam 2<br />
U+20E98 �口浪 long 2<br />
U+20F2E �口偽 ngai 1<br />
U+20FB4 �口棒 baang 6<br />
U+20FD1 �口感 ham 6<br />
U+2103E �口�日絲 ngap 1<br />
U+210C7 �口弊 bai 6<br />
U+210C8 �口緙 kwaak 1<br />
U+210C9 �口駕 ga 3<br />
U+21681 �敝大 bai 6<br />
U+22AD5 �扌永 wing 1<br />
U+22B2E �扌戎 ung 2<br />
U+22C55 �扌耷 dap 6<br />
to stab<br />
to scour<br />
sentence-final particle<br />
to hang down<br />
sentence-final particle<br />
to gather together<br />
to push<br />
to pile up<br />
to scour<br />
to rest<br />
to boil; kettle<br />
to raise up; to limp<br />
to jiggle the feet<br />
he, she, it<br />
bad<br />
to deceive<br />
to beg<br />
to purse the lips<br />
straight<br />
to cover<br />
to rinse<br />
to beg<br />
see ham 6 baang 6 laang 6<br />
see ham 6 baang 6 laang 6<br />
to jabber<br />
bad<br />
loop; to loop<br />
sentence-final particle<br />
bad<br />
to throw away<br />
to push<br />
to pound
U+22CC6 �扌敖 ngou 4<br />
U+22CDC �扌徙 saai 1<br />
U+236BA �未�成母 hong 2<br />
U+23CB7 �氵�囗又 nap 6<br />
U+24DB8 �疒那 na 1<br />
U+24E3B �疒拏 na 1<br />
U+25EFF �米离 chi 1<br />
U+25F1D �米禽 chi 1<br />
U+260A5 �糹泵 dam 3<br />
U+265BF �耳荅 dap 1<br />
U+2688A �月暴 pok 1<br />
U+269F2 �舌累 leu 1<br />
U+27A3E �言��冖八木 tam 3<br />
U+27D2F �貝子 me 1<br />
U+27D2F �貝子 me 1<br />
U+27D73 �貝舌 sit 6<br />
U+280BE �足南 laam 3<br />
U+2814F �足嵐 laam 3<br />
U+28EF2 �阝虖 la 3<br />
U+294E5 �岳頁 ngok 6<br />
U+2994B �馬馬 ngau 6<br />
Not in Unicode as of Version 3.1:<br />
�口兜 dau 3<br />
�口扱 dap 6<br />
�口捧 baang 6<br />
�口掹 mang 1<br />
�口揖 ngap 1<br />
�口笞 chi 1<br />
�口鏬 la 3<br />
�口隘 aai 3<br />
�口隙 kwaak 1<br />
�口�糹泵 dam 3<br />
�扌冘 dam 2<br />
�扌寍 ning 1<br />
�扌寕 ning 1<br />
�扌晒 saai 1<br />
�扌灭 mit 1<br />
�未�成肉 cheun 1<br />
�火�保衣 bou 1<br />
�韋昷 wan 3<br />
den; nest<br />
to pound<br />
see ham 6 baang 6 laang 6 ‘all’<br />
to pull<br />
to jabber<br />
to stick<br />
sentence-final particle<br />
to yell<br />
loop; to loop<br />
to drop down<br />
to dump; to pound<br />
to carry; to bring<br />
to carry; to bring<br />
to waste<br />
to pinch; to tear<br />
animal egg<br />
to boil; kettle<br />
to confine<br />
151<br />
to shake<br />
to waste<br />
young hen<br />
sticky<br />
scar<br />
scar<br />
to stick<br />
to stick<br />
to drop down<br />
to hang down<br />
blister<br />
to spit out<br />
to deceive<br />
sentence-final particle<br />
to carry on the back<br />
to lose money<br />
to step over<br />
to step over<br />
sentence-final particle<br />
to raise the head
�宀甾 tam 5<br />
�敖手 ngou 4<br />
pit; cesspool<br />
to shake<br />
152
APPENDIX B<br />
CHARACTERS BY SYLLABLE<br />
a 1 ‘ sentence-final particle’ 丫 U+4E2B<br />
吖 U+5416<br />
呀 U+5440<br />
aai 3 ‘ to yell’ 嗌 U+55CC<br />
�口隘<br />
baang 6 (of ham 6 baang 6 laang 6 ‘all’) 唪 U+552A<br />
�口棒 U+20FB4<br />
�口捧<br />
bai 6 ‘ bad’ 弊 U+5F0A<br />
�口弊 U+210C7<br />
�敝大 U+21681<br />
bou 1 ‘ to boil; kettle’ �保灬 U+3DDB<br />
煲 U+7172<br />
�火�保衣<br />
cheun 1 ‘ animal egg’ 春 U+6625<br />
膥 U+81A5<br />
�未�成肉<br />
chi 1 ‘ to stick’ 黐 U+9ED0<br />
�米离 U+25EFF<br />
�米禽 U+25F1D<br />
�口笞<br />
daat 3 ‘ spot’ 笪 U+7B2A<br />
dam 1 ‘ to prolong’ 揼 U+63FC<br />
泵 U+6CF5<br />
153
dam 2 ‘ to dump; to pound’ 揼 U+63FC<br />
泵 U+6CF5<br />
�扌冘<br />
dam 3 ‘ to drop down’ 泵 U+6CF5<br />
髧 U+9AE7<br />
�糹泵 U+260A5<br />
�口�糹泵<br />
dap 1 ‘ to hang down’ �口答 U+35F3<br />
嗒 U+55D2<br />
耷 U+8037<br />
�耳荅 U+265BF<br />
dap 6 ‘ to pound’ 搭 U+642D<br />
撘 U+6498<br />
�扌耷 U+22C55<br />
�口扱<br />
dau 3 ‘ den; nest’ 竇 U+7AC7<br />
�口兜<br />
dau 6 (of ngau 6 dau 6 ‘unwell; stupid’) 哣 U+54E3<br />
逗 U+9017<br />
dei 6 ‘ plural marker’ 哋 U+54CB<br />
地 U+5730<br />
deng 3 ‘ to throw’ 掟 U+639F<br />
矴 U+77F4<br />
di 1 ‘ some’ 啲 U+5572<br />
的 U+7684<br />
dim 6 ‘ straight’ 掂 U+6382<br />
敁 U+6541<br />
�口店 U+20DA7<br />
dou 6 ‘ ferry’ 渡 U+6E21<br />
艔 U+8254<br />
fan 3 ‘ to sleep’ 瞓 U+7793<br />
訓 U+8A13<br />
154
ga 3 ‘ sentence-final particle’ �口架 U+35CE<br />
架 U+67B6<br />
�口駕 U+210C9<br />
gaat 6 of (gaat 6 jaat 6 ‘cockroach’) 曱 U+66F1<br />
gam 2 ‘ so (manner)’ 噉 U+5649<br />
敢 U+6562<br />
gam 3 ‘ so (quantity)’ 咁 U+5481<br />
甘 U+7518<br />
gat 1 ‘ to stab’ �吉刂 U+34E4<br />
gat 6 ‘ to raise up; to limp’ �走乞 U+4798<br />
觖 U+89D6<br />
趌 U+8D8C<br />
趷 U+8DB7<br />
跀 U+8DC0<br />
gau 6 ‘ lump’ 倃 U+5003<br />
嚿 U+56BF<br />
ge 3 ‘ genitive particle’ 嘅 U+5605<br />
gip 1 ‘ bag’ 喼 U+55BC<br />
go 2 ‘ that’ 个 U+4E2A<br />
個 U+500B<br />
嗰 U+55F0<br />
箇 U+7B87<br />
guk 6 ‘ to bake’ 局 U+5C40<br />
焗 U+7117<br />
gwa 3 ‘ sentence-final particle’ 卦 U+5366<br />
啩 U+5569<br />
gwaan 3 ‘ to fall down’ 慣 U+6163<br />
躀 U+8E80<br />
gwui 6 ‘ tired’ 癐 U+7650<br />
155
haai 4 ‘ coarse’ 嚡 U+56A1<br />
鞋 U+978B<br />
hai 2 ‘ to be at’ 係 U+4FC2<br />
喺 U+55BA<br />
ham 6 (of ham 6 baang 6 laang 6 ‘all’) 冚 U+519A<br />
喊 U+558A<br />
�口感 U+20FD1<br />
hong 2 ‘ young hen’ 項 U+9805<br />
�未�成母 U+236BA<br />
jaat 6 (of gaat 6 jaat 6 ‘cockroach’) 甴 U+7534<br />
jo 2 ‘ perfective aspect marker’ 咗 U+5497<br />
唨 U+5528<br />
阻 U+963B<br />
kam 2 ‘ to cover’ 冚 U+519A<br />
搇 U+6407<br />
�口衾 U+20E78<br />
kang 3 ‘ capable’ 掯 U+63AF<br />
kat 1 ‘ card’ 咭 U+54AD<br />
keui 5 ‘ he, she, it’ 佢 U+4F62<br />
渠 U+6E20<br />
�亻渠 U+20372<br />
kwaak 1 ‘ loop; to loop’ 緙 U+7DD9<br />
�口緙 U+210C8<br />
�口隙<br />
la 3 ‘ sentence-final particle’ �土虖 U+3664<br />
喇 U+5587<br />
嘞 U+561E<br />
嚹 U+56B9<br />
罅 U+7F45<br />
鏬 U+93EC<br />
�阝虖 U+28EF2<br />
�口鏬<br />
156
laai 1 ‘ last (child)’ 孻 U+5B7B<br />
laam 2 ‘ olive’ 杬 U+676C<br />
欖 U+6B16<br />
laam 3 ‘ to step over’ �足南 U+280BE<br />
�足嵐 U+2814F<br />
laan 1 ‘ to crawl’ 躝 U+8E9D<br />
laang 6 (of ham 6 baang 6 laang 6 ‘all’) 呤 U+5464<br />
唥 U+5525<br />
laap 3 ‘ to gather together’ �扌匝 U+39DC<br />
擸 U+64F8<br />
laat 6 ‘ row’ 列 U+5217<br />
剌 U+524C<br />
迾 U+8FFE<br />
lai 4 ‘ to come’ 嚟 U+569F<br />
黎 U+9ECE<br />
lam 1 ‘ bud’ 冧 U+51A7<br />
啉 U+5549<br />
林 U+6797<br />
菻 U+83FB<br />
lam 6 ‘ to pile up’ �扌林 U+3A06<br />
冧 U+51A7<br />
罧 U+7F67<br />
lat 1 ‘ to lose; to get rid of’ 甩 U+7529<br />
甪 U+752A<br />
lau 1 ‘ coat’ 蒟 U+849F<br />
蔞 U+851E<br />
褸 U+8938<br />
lei 6 ‘ tongue’ 唎 U+550E<br />
脷 U+8137<br />
leu 1 ‘ to spit out’ �舌累 U+269F2<br />
157
lit 3 ‘ knot’ 纈 U+7E88<br />
lo 2 ‘ to take’ 攞 U+651E<br />
lok 3 ‘ sentence-final particle’ 咯 U+54AF<br />
long 2 ‘ to rinse’ 朗 U+6717<br />
�口浪 U+20E98<br />
lung 5 ‘ trunk’ 槓 U+69D3<br />
篢 U+7BE2<br />
m 4 ‘ not’ 唔 U+5514<br />
ma 1 ‘ twin’ 孖 U+5B56<br />
mai 5 ‘ don’t’ 咪 U+54AA<br />
米 U+7C73<br />
mak 1 ‘ mark’ 嘜 U+561C<br />
嚜 U+569C<br />
mang 1 ‘ to pull’ 掹 U+63B9<br />
擝 U+64DD<br />
�口掹<br />
mat 1 ‘ what’ 乜 U+4E5C<br />
mau 1 ‘ to squat’ 卯 U+536F<br />
踎 U+8E0E<br />
蹘 U+8E58<br />
me 1 ‘ sentence-final particle’ 咩 U+54A9<br />
�貝子 U+27D2F<br />
me 1 ‘ to carry on the back’ 孭 U+5B6D<br />
�貝子 U+27D2F<br />
me 2 ‘ crooked’ 歪 U+6B6A<br />
mit 1 ‘to pinch; to tear’ 搣 U+6423<br />
�扌灭<br />
158
miu 2 ‘ to purse the lips’ �口妙 U+20D15<br />
mo 1 ‘ slow’ 嚤 U+56A4<br />
摩 U+6469<br />
mou 5 ‘ to not have’ 冇 U+5187<br />
na 1 ‘ scar’ �疒那 U+24DB8<br />
�疒拏 U+24E3B<br />
na 2 ‘ female’ 乸 U+4E78<br />
naat 3 ‘ to burn’ 焫 U+712B<br />
鈉 U+9209<br />
nam 2 ‘ to think’ 惗 U+60D7<br />
稔 U+7A14<br />
諗 U+8AD7<br />
nam 4 ‘ tender’ 腍 U+814D<br />
nan 2 ‘ to play with’ 撚 U+649A<br />
nap 6 ‘ sticky’ 湆 U+6E46<br />
�氵�囗又 U+23CB7<br />
nau 1 ‘ angry’ 嬲 U+5B32<br />
惱 U+60F1<br />
ngaam 1 ‘ correct’ 啱 U+5571<br />
岩 U+5CA9<br />
ngai 1 ‘ to beg’ 偽 U+507D<br />
�口危 U+20C53<br />
�口偽 U+20F2E<br />
ngak 1 ‘ to trick’ 呃 U+5443<br />
眲 U+7732<br />
阨 U+9628<br />
ngan 1 ‘ tiny’ 奀 U+5940<br />
159
ngan 3 ‘ to jiggle the feet’ �足辰 U+47F4<br />
奀 U+5940<br />
ngap 1 ‘ to jabber’ 吸 U+5438<br />
噏 U+564F<br />
�口�日絲 U+2103E<br />
�口揖<br />
ngat 1 ‘ to cram’ 扤 U+6264<br />
ngau 6 (of ngau 6 dau 6 ‘unwell; stupid’)吽 U+543D<br />
�馬馬 U+2994B<br />
ngok 6 ‘ to raise the head’ 咢 U+54A2<br />
�岳頁 U+294E5<br />
ngou 4 ‘ to shake’ �扌敖 U+22CC6<br />
�敖手<br />
ning 1 ‘ to carry; to bring’ 拎 U+62CE<br />
擰 U+64F0<br />
�扌寍<br />
�扌寕<br />
nung 1 ‘ to scorch’ 灴 U+7074<br />
烘 U+70D8<br />
燶 U+71F6<br />
po 1 ‘ classifier for plants’ 棵 U+68F5<br />
樖 U+6A16<br />
pok 1 ‘ blister’ 泡 U+6CE1<br />
�月暴 U+2688A<br />
saai 1 ‘ to waste’ 嘥 U+5625<br />
徙 U+5F99<br />
漇 U+6F07<br />
�扌徙 U+22CDC<br />
�扌晒<br />
160
saai 3 ‘ quantifying particle’ 嗮 U+55EE<br />
嘥 U+5625<br />
徙 U+5F99<br />
晒 U+6652<br />
saang 2 ‘ to scour’ �口省 U+35C2<br />
�扌省 U+3A18<br />
省 U+7701<br />
sit 6 ‘ to lose money’ 舌 U+820C<br />
蝕 U+8755<br />
餂 U+9902<br />
�貝舌 U+27D73<br />
sung 3 ‘ side dishes’ 送 U+9001<br />
餸 U+9938<br />
tam 3 ‘ to deceive’ 噤 U+5664<br />
�口氹 U+20C41<br />
�言��冖八木 U+27A3E<br />
tam 5 ‘ pit; cesspool’ 氹 U+6C39<br />
窞 U+7A9E<br />
�宀甾<br />
tau 2 ‘ to rest’ �咅攴 U+3A97<br />
唞 U+551E<br />
抖 U+6296<br />
ung 2 ‘ to push’ �巩手 U+39EC<br />
擁 U+64C1<br />
�扌戎 U+22B2E<br />
`<br />
wan 2 ‘ to find’ 揾 U+63FE<br />
搵 U+6435<br />
wan 3 ‘ to confine’ 緼 U+7DFC<br />
縕 U+7E15<br />
韞 U+97DE<br />
�韋昷<br />
wing 1 ‘ to throw away’ 扔 U+6254<br />
�扌永 U+22AD5<br />
161
wo 5 ‘ sentence-final particle’ 啝 U+555D<br />
喎 U+558E<br />
yaak 3 ‘ to eat’ 吃 U+5403<br />
喫 U+55AB<br />
yai 5 ‘ bad’ 曳 U+66F3<br />
�口兮 U+20BCB<br />
ye 5 ‘ thing’ 嘢 U+5622<br />
埜 U+57DC<br />
野 U+91CE<br />
yeun 6 ‘ animal liver’ 膶 U+81B6<br />
yuk 1 ‘ to move’ 喐 U+5590<br />
郁 U+90C1<br />
162
BIBLIOGRAPHY<br />
American Bible Society. 1899. <strong>The</strong> Gospel According to St. Matthew in English and<br />
Cantonese. Shanghai: American Presbyterian Mission Press.<br />
———. 1900. <strong>The</strong> Acts of the Apostles in English and Cantonese. Shanghai:<br />
American Presbyterian Press.<br />
———. 1910. <strong>The</strong> Gospel According to St. Luke in English and Cantonese.<br />
Shanghai: No publisher.<br />
American Tract Society. 1893. Chinese and English Dictionary Compiled from<br />
Reliable Authors. New York: American Tract Society.<br />
Aubazac, Louis. 1909. Liste des Caractères les Plus Usuels de la Langue<br />
Cantonnaise. Hong Kong: Société des Missions-Étrangères.<br />
Ball, James Dyer. 1894. Readings in Cantonese Colloquial. Hong Kong: Kelly &<br />
Walsh.<br />
———. 1908. <strong>The</strong> Cantonese Made Easy Vocabulary. 3rd edition. Hong Kong:<br />
Kelly & Walsh.<br />
——— and A. Dyer Ball. 1924. Cantonese Made Easy. 4th edition. Hong Kong:<br />
Kelly & Walsh.<br />
Bauer, Robert S. 1988. “Written Cantonese of Hong Kong”. Cahiers de Linguistique<br />
Asie Orientale 17, no. 2: 245-293.<br />
Bishop, Tom, et al. 2000. Wenlin 文林 version 2.5 computer program. Portland, OR:<br />
Wenlin Institute, Inc.<br />
Boltz, William G. 1996. “Early Chinese Writing”. In Peter T. Daniels and William<br />
Bright, eds., <strong>The</strong> World’s Writing Systems. New York: Oxford <strong>University</strong><br />
Press, 191-199.<br />
163
Bridgman, Elijah Coleman. 1841. A Chinese Chrestomathy in the Canton Dialect.<br />
Macao: S.W. Williams.<br />
British and Foreign Bible Society. 1901. <strong>The</strong> Gospel of John in Cantonese<br />
Colloquial. Canton: <strong>The</strong> British and Foreign Bible Society.<br />
Cai Jiannan 蔡劍南. 1998. Quantu chuanyi cangjiema zidian 全圖傳意倉頡碼字典.<br />
Hong Kong: Wanli jiegou 萬里機構.<br />
Chalmers, John. 1878. An English and Cantonese Dictionary. 5th edition. Hong<br />
Kong: De Souza & Co.<br />
Chan, Marjorie K.M. 陳潔雯. 1982. “A Response to Boltz’ Notes on Cantonese<br />
Dentilabialization”. Journal of the American Oriental Society (JAOS) l02, no.<br />
1: 107-109.<br />
———. 1984. “Initial Consonant Clusters in Old Chinese: Evidence from<br />
Sesquisyllabic Words in the Yue Dialects”. Fangyan 方言 4, 300-313.<br />
———. 1994. “Post-stopped Nasals and Lateral Flaps in the Zhongshan (Yue)<br />
Dialect: A Study of a Mid-Eighteenth Century Sino-Portuguese Glossary”. In<br />
Paul Jen-kuei Li, Chu-Ren Huang, and Chih-chen Tang, eds., Chinese<br />
Languages and Linguistics, vol. 2. Historical Linguistics (Symposium Series of<br />
the Institute of History and Philology, Academia Sinica, no. 2.). Taipei:<br />
Institute of History and Philology, Academia Sinica, 203-250.<br />
Chen Bohui 陳伯煇. 1998. Lun Yue fangyan ci benzi kao shi 論粵方言詞本子考釋.<br />
Hong Kong: Zhonghua 中華.<br />
——— and Wu Weixiong 吳偉雄. 1998. Shenghuo Yueyu benzi qutan<br />
生活粵語本子趣談. Hong Kong: Zhonghua 中華.<br />
Chao, Yuen Ren 趙元任. 1947. Cantonese Primer. New York: Greenwood Press,<br />
Publishers.<br />
——— and Lien Sheng Yang 楊聯陞. 1947. Concise Dictionary of Spoken Chinese.<br />
Cambridge, MA: Harvard <strong>University</strong> Press.<br />
Cheung, Kwan-hin 張群顯 and Robert S. Bauer. forthcoming. <strong>The</strong> Representation of<br />
Cantonese with Chinese Characters 以漢字寫粵語. Journal of Chinese<br />
Linguistics monograph. A July 26, 2001 pre-publication draft was used.<br />
164
Chou, Fa-kao 周法高. 1986. “A Comparative Study of the Simplified Characters as<br />
Used in Mainland China, Singapore, and Japan”. In his Papers in Chinese<br />
Linguistics and Epigraphy. Hong Kong: Chinese <strong>University</strong> Press, 55-69.<br />
Condit, I.M. 1888. English and Chinese Reader with a Dictionary. Shanghai:<br />
Huamei 華美.<br />
Cowles, Roy T. 1965. <strong>The</strong> Cantonese Speaker’s Dictionary. Hong Kong: Hong<br />
Kong <strong>University</strong> Press.<br />
DeFrancis, John. 1984. <strong>The</strong> Chinese Language: Fact and Fantasy. Honolulu:<br />
<strong>University</strong> of Hawaii Press.<br />
Eitel, Ernest John and Immanuel Gottlieb Genähr. 1910. A Chinese-English<br />
Dictionary in the Cantonese Dialect. 2nd edition. 2 vols. Hong Kong: Kelly<br />
& Walsh, Limited.<br />
Fenn, Courtenay H., Chin Hsien Tseng 金憲曾, and George D. Wilder. 1942. Fenn’s<br />
Chinese-English Pocket-Dictionary. 5th edition (revised American edition).<br />
Cambridge, MA: Harvard <strong>University</strong> Press.<br />
Gunn, Edward. 1991. Rewriting Chinese: Style and Innovation in Twentieth-Century<br />
Chinese Prose. Stanford, CA: Stanford <strong>University</strong> Press.<br />
He Wenhui 何文匯 and Zhu Guofan 朱國藩, eds. 1999. Yueyin zhengdu zihui<br />
粵音正讀字彙. Hong Kong: Xianggang jiaoyu 香港教育.<br />
Huang Xiling 黃錫凌 (S.L. Wong). 1941. Yueyu yunhui 粵語韻彙 (A Chinese<br />
Syllabary Pronounced According to the Dialect of Canton). Hong Kong:<br />
Zhonghua 中華.<br />
HYDZD Xu Zhongshu 徐中舒, et al. 1986. Hanyu da zidian 漢語大字典. 8 vols.<br />
Chengdu: Sichuan cishu 四川辭書. Reprinted as a fantizi 繁體字 (traditional<br />
character) edition. New York: U.S. International Publishing Inc.<br />
International Organization for Standardization (ISO). 2001. ISO/IEC FDIS 10646-2:<br />
Information Technology—Universal Multiple-Octet Coded Character Set<br />
(UCS)—Part 2: Supplementary Planes. Switzerland: ISO.<br />
Jones, Daniel and Kwing-tong Woo. 1912. A Cantonese Phonetic Reader. London:<br />
<strong>University</strong> of London Press.<br />
Kangxi zidian 康熙字典. 1716. Reprinted by Beijing: Zhonghua 中華.<br />
165
Karlgren, Bernhard. 1923. Analytic Dictionary of Chinese and Sino-Japanese. Paris:<br />
Librairie Orientaliste Paul Geuthner. Reprinted by Mineola, NY: Dover<br />
Publications, Inc.<br />
Lau, Sidney 劉錫祥. 1977. A Practical Cantonese-English Dictionary. Hong Kong:<br />
<strong>The</strong> Government Printer.<br />
Leng Yulong 冷玉龍, Wei Yixin 韋一心, et al. 1994. Zhonghua zi hai 中華字海.<br />
Beijing: Zhonghua 中華.<br />
Li, Fang-kuei 李方桂. 1937. “Languages and Dialects”. <strong>The</strong> Chinese Yearbook.<br />
Reprinted in 1973 with revisions, Journal of Chinese Linguistics 1, no. 1: 1-13.<br />
Lobscheid, William. 1871. A Chinese and English Dictionary. Hong Kong: Noronha<br />
& Sons.<br />
Lunde, Ken. 1999. CJKV Information Processing. Sebastopol, CA: O’Reilly &<br />
Associates.<br />
Mathews, R.H., et al. 1943. Mathews’ Chinese-English Dictionary. Revised<br />
American edition. Cambridge, MA: Harvard <strong>University</strong> Press.<br />
Matthews, Stephen and Virginia Yip. 1994. Cantonese: A Comprehensive Grammar.<br />
New York: Routledge.<br />
Meyer, Bernard F. and <strong>The</strong>odore F. Wempe. 1947. <strong>The</strong> Student’s Cantonese-English<br />
Dictionary. 3rd edition. New York: Field Afar Press. First edition 1935.<br />
Meyer, Dirk. 1998. “Dealing with Hong Kong Specific Characters.” Multilingual<br />
Computing & Technology 9, no. 3 (April), 35-38.<br />
———. 2000. “New Hong Kong Character Standard”. Multilingual Computing &<br />
Technology 11, no. 2 (March), 30-32.<br />
Morrison, Robert. 1828. A Vocabulary of the Canton Dialect. 3 parts. Macao: G.J.<br />
Steyn, and Brother.<br />
Norman, Jerry. 1988. Chinese. Cambridge: Cambridge <strong>University</strong> Press.<br />
O’Melia, Thomas A. 1959. First Year Cantonese. 4th edition. Hong Kong: Catholic<br />
Truth Society. First edition 1938.<br />
Pulleyblank, Edwin G. 1998. “Jiajie and Xiesheng”. In Alain Peyraube and Sun<br />
Chaofen, eds., Studies on Chinese Historical Syntax and Morphology:<br />
166
Linguistic Essays in Honor of Mei Tsu-lin. Paris: École des Hautes Études en<br />
Sciences Sociales, 145-164.<br />
Ramsey, S. Robert. 1987. <strong>The</strong> Languages of China. Princeton: Princeton <strong>University</strong><br />
Press.<br />
Rao Bingcai 饒秉才, Ouyang Jueya 歐陽覺亞, and Zhou Wuji 周無忌, eds. 1996.<br />
Guangzhouhua fangyan cidian 廣州話方言詞典. Hong Kong: Shangwu 商務.<br />
Originally published in 1981.<br />
Snow, Donald Bruce. 1991. Written Cantonese and the Culture of Hong Kong: <strong>The</strong><br />
Growth of a Dialect Literature. Ph.D. dissertation. Bloomington, IN: Indiana<br />
<strong>University</strong>.<br />
Stedman, T. Lathrop. and K.P. Lee. 1888. A Chinese and English Phrase Book in the<br />
Canton Dialect. New York: William R. Jenkins.<br />
Summer Institute of Linguistics (SIL) International. 2001. Ethnologue: Languages of<br />
the World. 14th ed. Online edition at . “CHN”<br />
(Chinese, Mandarin) and “YUH” (Chinese, Yue) entries.<br />
Unicode Consortium. 2000. <strong>The</strong> Unicode Standard Version 3.0. Reading, MA:<br />
Addison-Wesley.<br />
Wieger, L. 1927. Chinese Characters. 2nd edition. Translated by L. Davrout. No<br />
place: Catholic Mission Press. Reprinted by New York: Paragon Book Reprint<br />
Corp. and Dover Publications, Inc.<br />
Wisner, O.F. 1906. Beginning Cantonese. Canton: China Baptist Publication<br />
Society.<br />
Williams, Samuel Wells. 1856. A Tonic Dictionary of the Chinese Language in the<br />
Canton Dialect. Canton: Office of the Chinese Repository.<br />
———, et al. 1909. A Syllabic Dictionary of the Chinese Language Arranged<br />
According to the Wu-Fang Yüan Yin [五方元音] and Alphabetically<br />
Rearranged According to the Romanization of Sir Thomas F. Wade.<br />
Tongzhou 通州: North China Union College. Originally published in 1874.<br />
Xu Shen 許慎. 100. Shuowen jiezi (fu jianzi) 說文解字〔附檢字〕. Reprinted by<br />
Hong Kong: Zhonghua 中華.<br />
Yin Binyong 尹斌庸 and John S. Rohsenow. 1994. Modern Chinese Characters.<br />
Beijing: Sinolingua.<br />
167
Yue-Hashimoto, Oi-kan 余靄芹. 1972. Studies in Yue Dialects 1: Phonology of<br />
Cantonese. London: Cambridge <strong>University</strong> Press.<br />
ZWGW Zhongguo wenzi gaige weiyuanhui 中國文字改革委員會. 1977. “Di er ci<br />
hanzi jianhua fang’an (cao’an) 第二次漢字簡化方案(草案)”. Renmin ribao<br />
人民日報 (Dec 20): 4.<br />
168
a 1 ‘ sentence-final particle’, 67<br />
aai 3 ‘ to yell’, 102<br />
baang 6 (of ham 6 baang 6 laang 6 ‘all’), 82<br />
bai 6 ‘ bad’, 73<br />
bou 1 ‘ to boil; kettle’, 108<br />
cheun 1 ‘ animal egg ‘46<br />
chi 1 ‘ to stick’, 96<br />
daat 3 ‘ spot’, 58<br />
dam 1 ‘ to prolong’, 116<br />
dam 2 ‘ to dump; to pound’, 116<br />
dam 3 ‘ to drop down’, 116<br />
dap 1 ‘ to hang down’, 116<br />
dap 6 ‘ to pound’, 79<br />
dau 3 ‘ den; nest ‘49<br />
dau 6 (of ngau 6 dau 6 ‘unwell; stupid’), 82<br />
dei 6 ‘ plural marker’, 67<br />
deng 3 ‘ to throw’, 97<br />
di 1 ‘ some’, 70<br />
dim 6 ‘ straight’, 61<br />
dou 6 ‘ ferry’, 121<br />
fan 3 ‘ to sleep’, 114<br />
ga 3 ‘ sentence-final particle’, 71<br />
gaat 6 (of gaat 6 jaat 6 ‘cockroach’), 66<br />
gaat 6 jaat 6 ‘cockroach’, 66<br />
gam 2 ‘ so (manner)’, 67<br />
gam 3 ‘ so (quantity)’, 67<br />
gat 1 ‘ to stab’, 58<br />
gat 6 ‘ to raise up; to limp’, 129<br />
gau 6 ‘ lump’, 73<br />
ge 3 ‘ genitive particle’, 64<br />
gip 1 ‘ bag’, 64<br />
go 2 ‘ that’, 72<br />
guk 6 ‘ to bake’, 114<br />
gwa 3 ‘ sentence-final particle’, 67<br />
gwaan 3 ‘ to fall down’, 119<br />
INDEX<br />
169
gwui 6 ‘ tired’, 100<br />
haai 4 ‘ coarse’, 67<br />
ham 6 (of ham 6 baang 6 laang 6 ‘all’), 82<br />
ham 6 baang 6 laang 6 ‘all’, 82<br />
hai 2 ‘ to be at’, 67<br />
hong 2 ‘ young hen’, 126<br />
jaat 6 (of gaat 6 jaat 6 ‘cockroach’), 66<br />
jo 2 ‘ perfective aspect marker’, 77<br />
kam 2 ‘ to cover’, 124<br />
kang 3 ‘ capable’, 101<br />
kat 1 ‘ card’, 63<br />
keui 5 ‘ he, she, it’, 120<br />
kwaak 1 ‘ loop; to loop’, 67<br />
la 3 ‘ sentence-final particle’, 77<br />
laai 1 ‘ last (child) ‘45<br />
laam 2 ‘ olive’, 50<br />
laam 3 ‘ to step over’, 99<br />
laan 1 ‘ to crawl’, 97<br />
laang 6 (of ham 6 baang 6 laang 6 ‘all’), 82<br />
laap 3 ‘ to gather together’, 100<br />
laat 6 ‘ row’, 128<br />
lai 4 ‘ to come’, 67<br />
lam 1 ‘ bud’, 114<br />
lam 6 ‘ to pile up’, 98<br />
lat 1 ‘ to lose; to get rid of’, 131<br />
lau 1 ‘ coat’, 119<br />
lei 6 ‘ tongue’, 116<br />
leu 1 ‘ to spit out’, 99<br />
lit 3 ‘ knot ‘49<br />
lo 2 ‘ to take’, 58<br />
lok 3 ‘ sentence-final particle’, 63<br />
long 2 ‘ to rinse’, 80<br />
lung 5 ‘ trunk’, 50<br />
m 4 ‘ not’, 61<br />
ma 1 ‘ twin ‘49<br />
mai 5 ‘ don’t’, 67<br />
mak 1 ‘ mark’, 65<br />
mang 1 ‘ to pull’, 105<br />
mat 1 ‘ what’, 61<br />
mau 1 ‘ to squat’, 103<br />
me 1 ‘ sentence-final particle’, 78<br />
me 1 ‘ to carry on the back ‘46<br />
me 2 ‘ crooked ‘49<br />
mit 1 ‘ to pinch; to tear’, 97<br />
miu 2 ‘ to purse the lips’, 65<br />
170
mo 1 ‘ slow’, 67<br />
mou 5 ‘ to not have ‘49<br />
na 1 ‘ scar’, 97<br />
na 2 ‘ female’, 101<br />
naat 3 ‘ to burn’, 58<br />
nam 2 ‘ to think’, 102<br />
nam 4 ‘ tender’, 58<br />
nan 2 ‘ to play with’, 102<br />
nap 6 ‘ sticky’, 50<br />
nau 1 ‘ angry’, 58<br />
ngaam 1 ‘ correct’, 67<br />
ngai 1 ‘ to beg’, 73<br />
ngak 1 ‘ to trick’, 77<br />
ngan 1 ‘ tiny ‘46<br />
ngan 3 ‘ to jiggle the feet’, 61<br />
ngap 1 ‘ to jabber’, 80<br />
nga5t 1 ‘ to cram’, 58<br />
ngau 6 (of ngau 6 dau 6 ‘unwell; stupid’), 82<br />
ngau 6 dau 6 ‘unwell; stupid’, 82<br />
ngok 6 ‘ to raise the head’, 58<br />
ngou 4 ‘ to shake’, 107<br />
ning 1 ‘ to carry; to bring’, 58<br />
nung 1 ‘ to scorch’, 123<br />
po 1 ‘ classifier for plants’, 52<br />
pok 1 ‘ blister ‘49<br />
saai 1 ‘ to waste’, 122<br />
saai 3 ‘ quantifying particle’, 75<br />
saang 2 ‘ to scour’, 114<br />
sit 6 ‘ to lose money’, 125<br />
sung 3 ‘ side dishes’, 121<br />
tam 3 ‘ to deceive’, 75<br />
tam 5 ‘ pit; cesspool’, 52<br />
tau 2 ‘ to rest’, 70<br />
ung 2 ‘ to push’, 61<br />
wan 2 ‘ to find’, 58<br />
wan 3 ‘ to confine’, 60<br />
wing 1 ‘ to throw away’, 104<br />
wo 5 ‘ sentence-final particle’, 81<br />
yaak 3 ‘ to eat’, 53<br />
yai 5 ‘ bad’, 63<br />
ye 5 ‘ thing’, 70<br />
yeun 6 ‘ animal liver’, 96<br />
yuk 1 ‘ to move’, 70<br />
171