Uma Aplicacão para Explicabilidade de Predições de um SVM em Tweets de COVID-19

Ivo de Abreu Araújo; Renato Hidaka Torres; Nelson Cruz Sampaio Neto

Uma Aplicacão para Explicabilidade de Predições de um SVM em Tweets de COVID-19

Ivo de Abreu Araújo ^[1] ; Renato Hidaka Torres ^[2] ; Nelson Cruz Sampaio Neto ^[2]
1. [1] Universidade Federal do Sul e Sudeste do Pará
  
  Universidade Federal do Sul e Sudeste do Pará
  
  Brasil
2. [2] Universidad Federal de Pará
  
  Universidad Federal de Pará
  
  Brasil
Localización: RISTI: Revista Ibérica de Sistemas e Tecnologias de Informação, ISSN-e 1646-9895, Nº. 54, 2024, págs. 121-136
Idioma: portugués
DOI: 10.17013/risti.54.121-137
Enlaces
- Texto completo
Resumen
- português
  Este trabalho propõe uma aplicação web que usa um modelo de caixa preta Support Vector Machine (SVM) com 79% de acurácia para classificar o sentimento de tweets sobre a COVID-19 integrando o framework LIME de forma interativa para explicar decisões sobre previsões. Além do ganho de transparência em relação à avaliação de amostras falso-positivas, notou-se também que o modelo SVM tende a falhar ao associar um teste positivo de COVID-19 a um bom sentimento e se confunde em previso˜es envolvendo palavras sobre a COVID-19, como Omicron, que indica falta de representatividade na base de dados. Além disso, a partir dos resultados do LIME, foi possível melhorar a acura´cia do modelo para 81% ao incluir as stopwords ”not” e ”no”.
- English
  The present work proposes a web application that uses an Support Vector Machine (SVM) black box model with 79% accuracy to classify sentiment from tweets about COVID-19 integrating the LIME framework in an interactive way to explain decisions about predictions. Besides the gain in transparency in relation to the evaluation of false positive samples, it was also noted that the SVM model tends to fail when associating a positive COVID-19 test with a good sentiment and gets confused in specific predictions involving words related to COVID-19 variants such as Omicron, which indicate lack of representativeness in the database. In addition, from the LIME results, it was possible to improve the model accuracy to 81% by including the stopwords “not” and “no”.
Referencias bibliográficas
- Aguayo, R., Lizarraga, C., López-Bojórquez, M., Quiñonez, Y., & Cabrera, A. (2022). Implementación de plan de contingencia ante la pandemia...
- Ahmad, I. (2020). 40 algorithms every programmer should know: Hone your problem-solving skills by learning different algorithms and their...
- Araújo, I. (2022a). Códigos de NLP. https://github.com/ivoaabreu/dissertacao-mestrado-ufpa-codigos-nlp
- Araújo, I. (2022b). Endereço da aplicação web. https://twitter-explainer.onrender.com
- Araújo, I. (2022c). Lista de stopwords. https://github.com/ivoaabreu/dissertacao-mestrado-ufpa-codigos-nlp-lime/blob/main/lista-stop-words.pdf
- Ayoub, J., Yang, X. J., & Zhou, F. (2021). Combat COVID-19 infodemic using explainable natural language processing models. Information...
- Behl, S., Rao, A., Aggarwal, S., Chadha, S., & Pannu, H. (2021). Twitter for disaster relief through sentiment analysis for COVID-19 and...
- Bonaccorso, G. (2017). Machine learning algorithms. Packt Publishing Ltd.
- Garcia, K., & Berton, L. (2021). Topic detection and sentiment analysis in Twitter content related to COVID-19 from Brazil and the USA....
- Glowacki, E. M., Wilcox, G. B., & Glowacki, J. B. (2021). Identifying addiction concerns on Twitter during the COVID-19 pandemic: A text...
- Hall, P., Gill, N., & Cox, B. (2021). Responsible machine learning.
- Hapke, H., Howard, C., & Lane, H. (2019). Natural Language Processing in Action: Understanding, Analyzing, and Generating Text with Python....
- Kelleher, J. D., Mac Namee, B., & D’Arcy, A. (2015). Fundamentals of Machine Learning for Predictive Data Analytics: Algorithms, Worked...
- Khan, R., Shrivastava, P., Kapoor, A., Tiwari, A., & Mittal, A. (2020). Social media analysis with AI: Sentiment analysis techniques for...
- Kolluri, N. L., & Murthy, D. (2021). Coverifi: A COVID-19 news verification system. Online Social Networks and Media, 22, 100123. https://doi.org/10.1016/j.osnem.2021.100123
- Kouvela, M., Dimitriadis, I., & Vakali, A. (2020). Bot-detective: An explainable Twitter bot detection service with crowdsourcing functionalities....
- Lad, R. (2020). Parkinson’s disease classification. Available at https://www.kaggle.com/richalad/parkinsons-predictions
- Liu, Z., Guo, Y., & Mahmud, J. (2021). When and why does a model fail? A human-in-the-loop error detection framework for sentiment analysis....
- López, M. P. V., de Freitas, P. O., & Vargas, S. M. L. (2023). A relação entre a inovação tecnológica e o desempenho nos meios de hospedagem...
- Lundberg, S. M., & Lee, S.-I. (2017). A unified approach to interpreting model predictions. In Advances in Neural Information Processing...
- Meena, R., & Bai, V. T. (2019). Study on machine learning based social media and sentiment analysis for medical data applications. In...
- Miglani, A. (2020). Coronavirus tweets NLP - text classification. Available at https://www.kaggle.com/sagarkhambad/text-classification/data
- Molnar, C. (2019). Interpretable Machine Learning: A Guide for Making Black Box Models Explainable. Leanpub.
- Nielsen, A. (2020). Practical Fairness. O’Reilly Media.
- Recuero, R. (2009). Redes sociais na internet, difusão de informação e jornalismo: elementos para discussão. Metamorfoses Jornalísticas, 2,...
- Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). “Why should I trust you?” Explaining the predictions of any classifier. In Proceedings...
- Rothman, D. (2020). Hands-On Explainable AI (XAI) with Python: Interpret, Visualize, Explain, and Integrate Reliable AI for Fair, Secure,...
- Shalev-Shwartz, S., & Ben-David, S. (2014). Understanding Machine Learning: From Theory to Algorithms. Cambridge University Press.
- Silva, H., Andrade, E., Araújo, D., & Dantas, J. (2021). Sentiment analysis of tweets related to SUS before and during the COVID-19 pandemic....
- Turek, M. (2021). Explainable Artificial Intelligence.
- Redondo, A. M. F., & Cárdenas, F. de J. N. (2022). DevOps: Un vistazo rápido. Ciencia Huasteca Boletín Científico de la Escuela Superior...

Mi Ágora

Selección

Opciones de artículo

Seleccionado

Opciones de compartir

Opciones de entorno

Sugerencia / Errata

Acceso de usuarios registrados

Uma Aplicacão para Explicabilidade de Predições de um SVM em Tweets de COVID-19

Universidade Federal do Sul e Sudeste do Pará

Universidad Federal de Pará

Mi Ágora

Opciones de artículo

Opciones de compartir

Opciones de entorno