frequency lexicon of Italian, frequency dictionary of Italian, frequency corpus of Italian, frequency of Italian words
CoLFIS is a lexical database of written Italian, with the following features:
it is based on a balanced corpus of over 3 millions words, reflecting the reading habits of the Italian population as inferred by ISTAT data;
the lexical data are fully lemmatized and part-of-speech annotated;
it provides a frequency lexicon/dictionary for both lemmas (“lemmario”) and forms (“formario”).
CoLFIS was realized by a research teams including P.M. BERTINETTO1, C. BURANI2, A. LAUDANNA3, L. MARCONI4, D. RATTI4, C. ROLANDO4 e A.M. THORNTON5 and with the financial support of the CNR.
1 Scuola Normale Superiore, Pisa
2Istituto di Scienze e Tecnologie della Cognizione, CNR, Roma
3 Università di Salerno
4Istituto di Linguistica Computazionale, Unità Staccata di Genova, CNR, Genova
5Università de L'Aquila
Web-sites for download:
- Scuola Normale Superiore PISA - linguistica.sns.it/CoLFIS/CoLFIS_home.htm
- Istituto di Scienze e Tecnologie della Cognizione - ROMA www.istc.cnr.it/material/database/colfis/
- Istituto di Linguistica Computazionale - GENOVA www.ge.ilc.cnr.it/strumenti.php
How to quote CoLFIS:
Bertinetto Pier Marco, Burani Cristina, Laudanna Alessandro, Marconi Lucia, Ratti Daniela, Rolando Claudia, Thornton Anna Maria. 2005. Corpus e Lessico di Frequenza dell'Italiano Scritto (CoLFIS). http://linguistica.sns.it/CoLFIS/CoLFIS_home.htm
(the web-site may be any one of those indicated above)