Frequently Asked Questions:
Warning!
Please do not cite or publish anything using this database without the authors' permission.
1. The extended characters in this database require a modern webbrowser which understands the Unicode specification to be seen properly. Internet Explorer is not standards compliant, and we recommend the use of a real web-browser like Firefox Mozilla, Safari, Konqueror, or Opera.
2.There are likely to be transcription errors and the cognate sets are preliminary. We would welcome assistance from professional linguists to correct any mistakes, improve the cognacy judgments and to add further languages.
Contact us by email with any queries, comments or suggestions: Simon Greenhill
How do I cite this website?
To cite this database, please reference the following paper. You should also include the date that you obtained the data, as we are constantly improving and correcting the information contained here.
Greenhill, S.J., Blust. R, & Gray, R.D. (2008). The Austronesian Basic Vocabulary Database: From Bioinformatics to Lexomics. Evolutionary Bioinformatics, 4:271-283.You should also include the date that you obtained the data, as we are constantly improving and correcting the information contained here. If you have any questions, please contact Simon Greenhill.
What are these boxes everywhere?
The boxes show up when the fonts you have installed don't have the specific glyph (character) we're asking for. This problem generally occurs with combining characters, such as when we have a lower case character with a diacritic. We're looking at ways around this, but note that you can cut'n'paste the area into a unicode capable text editor and this will display the glyph correctly.
Example:
Common Missing Glyphs:
Glyph: | Languages: | Prevalence: |
---|---|---|
Pazeh, Kavalen, Yami, Saaroa, Palauan Batak | 232 | |
Kairiru, Lamogai (Mulakaino) | 59 | |
Puyuma | 43 | |
Bunun | 26 | |
Nguna | 18 | |
Bunun | 17 | |
Nias, Letinese | 12 | |
Canala | 2 |
The common missing glyphs, and the languages they appear in. Ranked according to number of instances in database.