SBBD

Paper Registration

1

Select Book

2

Select Paper

3

Fill in paper information

4

Congratulations

Fill in your paper information

English Information

(*) To change the order drag the item to the new position.

Authors
# Name
1 Michele Brandão(michele.brandao@dcc.ufmg.br)
2 Mariana Silva(mariana.santos@dcc.ufmg.br)
3 Gabriel Oliveira(gabrielpoliveira@dcc.ufmg.br)
4 Henrique Hott(henriquehott@dcc.ufmg.b)
5 Anisio Lacerda(anisio@dcc.ufmg.br)
6 Gisele Pappa(glpappa@dcc.ufmg.br)

(*) To change the order drag the item to the new position.

Reference
# Reference
1 Albalawi, Y., Buckley, J., and Nikolov, N. S. (2021). Investigating the impact of pre-processing techniques and pre-trained word embeddings in detecting arabic health information on social media. J. Big Data, 8(1):95
2 Bambroo, P. and Awasthi, A. (2021). Legaldb: long distil-bert for legal document classification. In 2021 International Conference on Advances in Electrical, Computing, Communication and Sustainable Technologies (ICAECT), pages 1–4. IEEE
3 Belém, F. M., Ganem, M., França, C., Carvalho, M., Laender, A. H. F., and Gonçalves, M. A. (2022). Reforço e delimitação contextual para reconhecimento de entidades e relações em documentos oficiais. In SBBD, pages 292–303. SBC
4 Word2vec. Natural Language Engineering, 23(1):155–162.
5 Coelho, G. M., Ramos, A. C., de Sousa, J., Cavaliere, M., de Lima, M. J., Mangeth, A., Frajhof, I. Z., Cury, C., and Casanova, M. A. (2022). Text classification in the brazilian legal domain. In ICEIS (1), pages 355–363
6 de Araujo, P. H. L., de Almeida, A. P. G. S., Braz, F. A., da Silva, N. C., de Barros Vidal, F., and de Campos, T. E. (2023). Sequence-aware multimodal page classification of brazilian legal documents. Int. J. Document Anal. Recognit., 26(1):33–49.
7 Kim, H.-Y. (2014). Statistical notes for clinical researchers: Nonparametric statistical methods: 2. nonparametric methods for comparing three or more groups and repeated measures. Restorative Dentistry & Endodontics, 39(4):329–332.
8 Lima, M., Silva, R., Lopes de Souza Mendes, F., R. de Carvalho, L., Araujo, A., and de Barros Vidal, F. (2020). Inferring about fraudulent collusion risk on Brazilian public works contracts in official texts using a Bi-LSTM approach. In Findings of the Association for Computational Linguistics, pages 1580-1588, Online. Association for Computational Linguistics.
9 Luz de Araujo, P. H., de Campos, T. E., Ataides Braz, F., and Correia da Silva, N. (2020). VICTOR: a dataset for Brazilian legal documents classification. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 1449–1458, Marseille, France. European Language Resources Association.
10 Noguti, M. Y., Vellasques, E., and Oliveira, L. S. (2020). Legal document classification: An application to law area prediction of petitions to public pro-secution service. In 2020 International Joint Conference on Neural Networks, IJCNN 2020, Glasgow, United Kingdom, July 19-24, 2020, pages 1–8. IEEE.
11 Oliveira, G. P., Reis, A. P. G., Mendes, B. M. A., Bacha, C. A., Costa, L. L., Canguçu, G. L., Silva, M. O., Caetano, V., Brandão, M. A., Lacerda, A., and Pappa, G. L. (2022). Ferramentas open-source de qualidade de dados para licitações públicas: Uma análise comparativa. In SBBD, pages 116–127. SBC.
12 Pennington, J., Socher, R., and Manning, C. D. (2014). Glove: Global vectors for word representation. In EMNLP, pages 1532–1543. ACL.
13 Poetsch, M., Correa, U. B., and de Freitas, L. A. (2019). A word embedding analysis towards ontology enrichment. Res. Comput. Sci., 148(11):153–164.
14 Silva, M. O., Paula, A. F., Oliveira, G. P., Vaz, I. A. D., Hott, H., Gomide, L. D., Reis, A. P. G., Mendes, B. M. A., Bacha, C. A., Costa, L. L., Brandão, M. A., Lacerda, A., and Pappa, G. L. (2022). LiPSet: Um conjunto de Dados com Documentos Rotulados de Licitações Públicas. In SBBD DSW, pages 13–24, Porto Alegre, RS, Brasil. SBC.
15 Souza Júnior, A. P., Cecilio, P., Viegas, F., Cunha, W., de Albergaria, E. T., and da Rocha, L. C. D. (2022). Evaluating topic modeling pre-processing pipelines for portuguese texts. In WebMedia, pages 191–201. ACM
16 Zhang, J., Li, Y., Tian, J., and Li, T. (2018). Lstm-cnn hybrid model for text classification. In 2018 IEEE 3rd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), pages 1675–1680. IEEE.