Introduction of LIVAC Synchronous Corpus


LIVAC ( is an uncommon synchronous Chinese corpus which rigorously, regularly and concurrently draws and analyzes data to determine linguistic and other developments in the printed Chinese media of major Chinese communities such as Hong Kong, Taiwan, Beijing, Shanghai, Singapore, and Macau. More than 570 million characters of news media texts have been processed and analyzed since 1995 and have yielded an expanding Pan-Chinese dictionary of 2 million words from the Pan-Chinese printed media. with useful quantitative information on their characteristic usage for qualitative analysis and applications in the IT field. They provide ready keywords or indices for LIVAC to offer KWIC search capabilities so that it can serve as a common and rigorous platform for meaningful synchronic and/or diachronic comparisons of cultural artifacts and societal issues in terms of locale, time, domain as well as for language engineering and information service provision in the Pan-Chinese context. By means of additional customization in search capabilities, LIVAC has benefited numerous public snd private organizations, including some involved with language engineering, information service provision, lexicography, or education. It was launched in 1995 by the Language Information Sciences Research Centre of the City University of Hong Kong, and nurtured in part by Chilin (HK) Ltd and its subsidiary, ChiLinStar Ltd in Zhuhai, China, under the aegis of the CityU Enterprise Ltd of Hong Kong. From 2010 to 2013, LIVAC was hosted by the Hong Kong Institute of Education's Research Centre on Linguistics and Language Information Sciences. From July 2013, LIVAC' has been exclusively hosted and maintained by Chilin (HK) Ltd. (Read more details)

For first-time visitors, data from 1995 to 1997 is available.

Try LIVAC now