1 |
Cavnar, W. B. and Trenkle, J. M. N-gram-based text categorization. In Proceedings of SDAIR-
94, 3rd Annual Symposium on Document Analysis and Information Retrieval. pp. 161–175, 1994.
|
|
2 |
CPDOC. Dicionário histórico-biográfico da primeira república. imigração, 2000. Available
at: https://cpdoc.fgv.br/sites/default/files/verbetes/primeira-republica/IMIGRA%C3%
87%C3%83O.pdf. Accessed on June 4, 2025.
|
|
3 |
Heringer, R. Affirmative action policies in higher education in brazil: outcomes and future chal-
lenges. Social Sciences 13 (3): 132, 2024.
|
|
4 |
IBGE. Brasil: 500 anos de povoamento. IBGE, 2007.
|
|
5 |
IBGE. Características étnico-raciais da população : um estudo das categorias de classificação de cor
ou raça : 2008. IBGE, 2011.
|
|
6 |
Jauhiainen, T., Lui, M., Zampieri, M., Baldwin, T., and Lindén, K. Automatic language
identification in texts: A survey. Journal of Artificial Intelligence Research vol. 65, pp. 675–782,
2019.
|
|
7 |
Monasterio, L. M. Sobrenomes e ancestralidade no brasil. Tech. rep., Instituto de Pesquisa
Econômica Aplicada (Ipea), 2016.
|
|
8 |
Nelson, J. R. and Shekaramiz, M. Authorship verification via linear correlation methods of n-
gram and syntax metrics. In 2022 Intermountain Engineering, Technology and Computing (IETC).
IEEE, pp. 1–6, 2022.
|
|
9 |
Ribeiro, C. A. C. and Carvalhaes, F. Research on social stratification in brazil. Sociology
Compass 18 (9): e13266, 2024.
|
|
10 |
Schwartzmann, S. Fora de foco: diversidade e identidades étnicas no brasil. Novos Estudos CE-
BRAP vol. 55, pp. 83–96, 1999.
|
|
11 |
Tromp, E. and Pechenizkiy, M. Graph-based n-gram language identification on short texts. In
Proc. 20th Machine Learning conference of Belgium and The Netherlands. sn, pp. 27–34, 2011.
|
|
12 |
Vogel, J. and Tresner-Kirsch, D. Robust language identification in short, noisy texts: Improve-
ments to liga. In Proceedings of the 3rd international Workshop on Mining Ubiquitous and Social
Environments. pp. 43–50, 2012.
|
|