publicações | ATISLabs

2024

Fainess
Fair Transition Loss: From label noise robustness to bias mitigation

Ygor Canalli, Filipe Braida, Leandro Alvim, and 1 more author

Knowledge-Based Systems, 2024

Abs Bib HTML

The Machine learning widespread adoption has inadvertently led to the amplification of societal biases and discrimination, with many consequential decisions now influenced by data-driven systems. In this scenario, fair machine learning techniques has become a frontier for AI researchers and practitioners. Addressing fairness is intricate; one cannot solely rely on the data used to train models or the metrics that assess them, as this data is often the primary source of bias — akin to noisy data. This paper delves into the convergence of these two research domains, highlighting the similarities and differences between fairness and noise in machine learning. We introduce the Fair Transition Loss, a novel method for fair classification inspired by label noise robustness techniques. Traditional loss functions tend to ignore distributions of sensitive features and their impact on outcomes. Our approach uses transition matrices to adjust predicted label probabilities based on this ignored data. The empirical evaluation indicates that this method outperforms many benchmarked approaches in a variety of scenarios and remains competitive when compared with prominent fair classification strategies.
@article{CANALLI2024111711, title = {Fair Transition Loss: From label noise robustness to bias mitigation}, journal = {Knowledge-Based Systems}, volume = {294}, pages = {111711}, year = {2024}, issn = {0950-7051}, doi = {https://doi.org/10.1016/j.knosys.2024.111711}, url = {https://www.sciencedirect.com/science/article/pii/S0950705124003460}, author = {Canalli, Ygor and Braida, Filipe and Alvim, Leandro and Zimbrão, Geraldo}, keywords = {Fair machine learning, Classification in the presence of label noise, Multi-Objective Optimization} }
LLM
The Economic Implications of Large Language Model Selection on Earnings and Return on Investment: A Decision Theoretic Model

Geraldo Xexéo, Filipe Braida, Marcus Parreiras, and 1 more author

2024

Abs Bib HTML

Selecting language models in business contexts requires a careful analysis of the final financial benefits of the investment. However, the emphasis of academia and industry analysis of LLM is solely on performance. This work introduces a framework to evaluate LLMs, focusing on the earnings and return on investment aspects that should be taken into account in business decision making. We use a decision-theoretic approach to compare the financial impact of different LLMs, considering variables such as the cost per token, the probability of success in the specific task, and the gain and losses associated with LLMs use. The study reveals how the superior accuracy of more expensive models can, under certain conditions, justify a greater investment through more significant earnings but not necessarily a larger RoI. This article provides a framework for companies looking to optimize their technology choices, ensuring that investment in cutting-edge technology aligns with strategic financial objectives. In addition, we discuss how changes in operational variables influence the economics of using LLMs, offering practical insights for enterprise settings, finding that the predicted gain and loss and the different probabilities of success and failure are the variables that most impact the sensitivity of the models.
@misc{xexéo2024economicimplicationslargelanguage, title = {The Economic Implications of Large Language Model Selection on Earnings and Return on Investment: A Decision Theoretic Model}, author = {Xexéo, Geraldo and Braida, Filipe and Parreiras, Marcus and Xavier, Paulo}, year = {2024}, eprint = {2405.17637}, archiveprefix = {arXiv}, primaryclass = {cs.AI}, url = {https://arxiv.org/abs/2405.17637}, }

2023

An online platform for COVID-19 diagnostic screening using a machine learning algorithm

Erito Marques de Souza Filho, Rodrigo de Souza Tavares, Bruno José Dembogurski, and 6 more authors

Revista da Associação Médica Brasileira, 2023

@article{covidjournal,
  title = {An online platform for COVID-19 diagnostic screening using a machine learning algorithm},
  volume = {69},
  issn = {0104-4230},
  url = {https://doi.org/10.1590/1806-9282.20221394},
  doi = {10.1590/1806-9282.20221394},
  number = {4},
  journal = {Revista da Associação Médica Brasileira},
  publisher = {Associação Médica Brasileira},
  author = {Souza Filho, Erito Marques de and Tavares, Rodrigo de Souza and Dembogurski, Bruno José and Gagliano, Alice Helena Nora Pacheco and Pacheco, Luiz Carlos de Oliveira and Pacheco, Luiz Gabriel de Resende Nora and Carmo, Filipe Braida do and Alvim, Leandro Guimarães Marques and Monteiro, Alexandra},
  year = {2023}
}

2022

IA
Uma análise empı́rica do efeito do Ruı́do de Classe no aprendizado de Redes Neurais Artificiais

Lı́via Azevedo, and Filipe Braida

In Anais Estendidos do XXXVII Simpósio Brasileiro de Bancos de Dados, 2022

Abs Bib HTML

O ruído de classe consiste no erro de rotulação da classe. Ele pode afetar negativamente o desempenho de um modelo, podendo variar com relação ao modelo escolhido. Por essa razão, surgiram trabalhos que avaliam a resistência natural dos modelos de Aprendizado de Máquina ao ruído de classe. Sendo assim, seria relevante avaliar a resistência natural da rede neural artificial ao ruído de classe, dado a sua relevância ao Aprendizado Profundo. O objetivo deste trabalho é realizar um experimento para avaliar a influência do ruído de classe nas redes neurais artificiais, treinando-as sob base de dados ruidosas. Os resultados mostraram que a complexidade da rede pode influenciar a resistência delas ao ruído de classe.
@inproceedings{de2022analise, title = {Uma an{\'a}lise emp{\'\i}rica do efeito do Ru{\'\i}do de Classe no aprendizado de Redes Neurais Artificiais}, author = {de Azevedo, L{\'\i}via and Braida, Filipe}, booktitle = {Anais Estendidos do XXXVII Simp{\'o}sio Brasileiro de Bancos de Dados}, pages = {14--19}, year = {2022}, organization = {SBC}, doi = {https://doi.org/10.5753/sbbd_estendido.2022.21837} }
Registro de Software

AICovid-v2

A. M. V. Monteiro, Leandro G.M. Alvim, Filipe Braida, and 3 more authors

2022

Patente: Programa de Computador. Número do registro: BR512022002678-3, data de registro: 31/10/2022, Instituição de registro: INPI - Instituto Nacional da Propriedade Industrial

2021

Recsys
Simulating real profiles for shilling attacks: A generative approach

Julio Barbieri, Leandro G.M. Alvim, Filipe Braida, and 1 more author

Knowledge-Based Systems, 2021

Abs Bib HTML

Collaborative Filtering (CF) approaches are vulnerable to Shilling Attacks, in which malicious users or companies inject a large number of fake profiles in a system in order to manipulate its recommendations. One problem of current Shilling Attack models is that they commonly use straightforward statistical templates, producing profiles with different rating patterns than actual system data, which facilitates its detection, requiring a larger amount of profiles to achieve its goals. To address this problem and create profiles closer to reality, we propose using a generative model, Variational Autoencoder (VAE) to map original data distribution. With VAE, it is possible to generate new profiles based on real data, without explicit copying their actual ratings. Its generated profiles are converted to malicious profiles by adding target item rating value. We test our attack model on MovieLens 100k data set and compare to literature attack models. Our results indicate that our model outperforms all other models in model-based CF system, especially using low attack sizes (from 3% to 5%). Also, analysis comparing profiles generated from it and other approaches shows that our model ratings pattern are very similar to real profiles, which may indicate that attacks mounted using our approach may be less likely to be detected by detection approaches. Thus, we show that our attack model represents an advance on Shilling Attack models, since its superior results in model-based CF and possible indistinction from real profiles may be useful as a baseline to test detection techniques and other tasks among Shilling Attack area.
@article{BARBIERI2021107390, title = {Simulating real profiles for shilling attacks: A generative approach}, journal = {Knowledge-Based Systems}, volume = {230}, pages = {107390}, year = {2021}, issn = {0950-7051}, doi = {https://doi.org/10.1016/j.knosys.2021.107390}, url = {https://www.sciencedirect.com/science/article/pii/S0950705121006523}, author = {Barbieri, Julio and Alvim, Leandro G.M. and Braida, Filipe and Zimbrão, Geraldo}, keywords = {Recommender systems, Collaborative Filtering, Shilling Attack, Generative Model, Variational Autoencoder} }

2020

IA
Redes de bits como alternativa às redes neurais em problemas de aprendizado por reforço

Nickolas R Machado, Pedro CF Machado, Juliana MNS Zamith, and 3 more authors

In Anais da VI Escola Regional de Alto Desempenho do Rio de Janeiro, 2020

Abs Bib HTML

Redes neurais artificiais (RNAs) são amplamente utilizadas, como por exemplo, em jogos digitais através de agentes inteligentes que replicam o comportamento humano. Contudo, o custo computacional de treinamento delas costuma ser alto, exigindo a escolha entre maior processamento ou menor qualidade. Desta forma, este trabalho propõe o uso de redes de bits (RBs) como uma alternativa capaz de maximizar o processamento enquanto minimiza o consumo de memória. Comparados às RNAs de precisão simples, os primeiros resultados mostram um speedup de até 91 vezes utilizando 32 vezes menos memória.
@inproceedings{machado2020redes, title = {Redes de bits como alternativa {\`a}s redes neurais em problemas de aprendizado por refor{\c{c}}o}, author = {Machado, Nickolas R and Machado, Pedro CF and Zamith, Juliana MNS and Braida, Filipe and Alvim, Leandro and Ferreira, Raul S}, booktitle = {Anais da VI Escola Regional de Alto Desempenho do Rio de Janeiro}, pages = {51--53}, year = {2020}, organization = {SBC}, doi = {https://doi.org/10.5753/eradrj.2020.14519} }
HD
Evidências, códigos e classificações: o ofício do historiador e o mundo digital

Alexandre Fortes , and Leandro Guimarães Marques Alvim

Esboços: histórias em contextos globais, Jun 2020

Abs Bib HTML

<p>O artigo examina o impacto da difusão global das tecnologias digitais sobre o ofício do historiador. Parte da análise sobre a relação entre a prática da profissão e natureza do conhecimento histórico formuladas por alguns dos maiores historiadores do século XX. Examina a natureza social da linguagem e seu papel na constituição das evidências e fontes históricas, articulando essa análise com os avanços tecnológicos do “processamento de linguagem natural”. Discute conceitos de diversos ramos das ciências sociais relevantes para a compreensão do processo de desenvolvimento do conhecimento humano e o papel da codificação de informações na elaboração de narrativas e na pesquisa histórica. Por fim, apresenta um panorama das principais metodologias no campo da inteligência artificial atualmente aplicadas à pesquisa histórica</p>
@article{Fortes_Alvim_2020, title = {Evidências, códigos e classificações: o ofício do historiador e o mundo digital}, volume = {27}, url = {https://periodicos.ufsc.br/index.php/esbocos/article/view/2175-7976.2020.e68270}, doi = {10.5007/2175-7976.2020.e68270}, number = {45}, journal = {Esboços: histórias em contextos globais}, author = {Fortes, Alexandre and Alvim, Leandro Guimarães Marques}, year = {2020}, month = jun, pages = {207–227} }
Registro de Software

AICovid

Leandro G.M. Alvim, Filipe Braida, E. M. Souza Filho, and 2 more authors

Jun 2020

Patente: Programa de Computador. Número do registro: BR512020002469-6, data de registro: 13/05/2020, Instituição de registro: INPI - Instituto Nacional da Propriedade Industrial
Registro de Software

XRayCovid-19

Leandro G.M. Alvim, Filipe Braida, E. M. Souza Filho, and 1 more author

Jun 2020

Patente: Programa de Computador. Número do registro: BR512020002470-0, data de registro: 19/04/2020, Instituição de registro: INPI - Instituto Nacional da Propriedade Industrial

2019

IA
AMANDA: Semi-supervised density-based adaptive model for non-stationary data with extreme verification latency

Raul S. Ferreira, Geraldo Zimbrão, and Leandro G.M. Alvim

Information Sciences, Jun 2019

Abs Bib HTML

Concept drift refers to an alteration in the relations between input and output data in the distribution over time. Thus, a gradual concept drift alludes to a smooth and gradual change in these relations. It generates a model obsolescence and quality decrease in predictions. Besides, there is a challenging task: the extreme verification latency to certify the labels. For batch scenarios, state-of-the-art methods do not properly tackle the problems aforementioned due to their high computational time, lack of representing samples of the drift or even for having several hyperparameters for tuning. Therefore, we propose AMANDA, a semi-supervised density-based adaptive model for non-stationary data. It has two variations: AMANDA-FCP, which selects a fixed number of samples; and AMANDA-DCP, which, in turn, dynamically selects samples from data. Our results indicate that these two variations outperform the state-of-the-art methods for almost all synthetic and real datasets, with an improvement up to 27.98% regarding the average error. AMANDA-FCP improved the results for a gradual concept drift, even with a small size of initial labeled data. Moreover, our results indicate that semi-supervised classifiers are improved when they work along with our density-based methods. Therefore, we emphasize the importance of research directions based on this approach.
@article{FERREIRA2019219, title = {AMANDA: Semi-supervised density-based adaptive model for non-stationary data with extreme verification latency}, journal = {Information Sciences}, volume = {488}, pages = {219-237}, year = {2019}, issn = {0020-0255}, doi = {https://doi.org/10.1016/j.ins.2019.03.025}, url = {https://www.sciencedirect.com/science/article/pii/S0020025519302233}, author = {Ferreira, Raul S. and Zimbrão, Geraldo and Alvim, Leandro G.M.}, keywords = {Semi-supervised learning, Concept drift, Core support extraction, Non-stationary environments, Extreme verification latency} }

2018

IA
A Cognitive Architecture for Agent-Based Artificial Life Simulation

Ronaldo Vieira, Bruno Dembogurski, Leandro Alvim, and 1 more author

In Computational Science and Its Applications–ICCSA 2018: 18th International Conference, Melbourne, VIC, Australia, July 2-5, 2018, Proceedings, Part I 18, Jun 2018

Abs Bib HTML

The ability to simulate living beings that behave in a credible way is a fundamental aspect in digital games. This is due to its interdisciplinary characteristic, that brings together different fields of knowledge to better understand biological life and its processes. In this context, the design of an intelligent agent is a hard task as it involves a complex system, which has several interconnected components. In this work a virtual mind architecture for intelligent agents is proposed, where it simulates the cognitive processes of an actual brain, in this case attention and memory, in order to reproduce behaviors similar to those of actual living beings. A prototype is then proposed, where the architecture is applied on agents that represent virtual animals in a semantic-modeled ecosystem, and conduct a proof-of-concept experiment with it to demonstrate its effectiveness. In this experiment, the behavior of the virtual animals were consistent with reality, thus, validating the architecture’s ability to simulate living beings.
@inproceedings{vieira2018cognitive, title = {A Cognitive Architecture for Agent-Based Artificial Life Simulation}, author = {Vieira, Ronaldo and Dembogurski, Bruno and Alvim, Leandro and Braida, Filipe}, booktitle = {Computational Science and Its Applications--ICCSA 2018: 18th International Conference, Melbourne, VIC, Australia, July 2-5, 2018, Proceedings, Part I 18}, pages = {197--213}, year = {2018}, organization = {Springer International Publishing}, }
IA
Density-Based Core Support Extraction for Non-stationary Environments with Extreme Verification Latency

Raul Sena Ferreira, Bruno Silva, Wendell Teixeira, and 2 more authors

In 2018 7th Brazilian Conference on Intelligent Systems (BRACIS), Jun 2018

Abs Bib HTML

Machine learning solutions usually consider that the train and test data has the same probabilistic distribution, that is, the data is stationary. However, in streaming scenarios, data distribution generally change through the time, that is, the data is non-stationary. The main challenge in such online environment is the model adaptation for the constant drifts in data distribution. Besides, other important restriction may happen in online scenarios: the extreme latency to verify the labels. Worth to mention that the incremental drift assumption is that class distributions overlap at subsequent time steps. Hence, the core region of data distribution have significant overlap with incoming data. Therefore, selecting samples from these core regions helps to retain the most important instances that represent the new distribution. This selection is denominated core support extraction (CSE). Thus, we present a study about density-based algorithms applied in non-stationary environments. We compared KDE, GMM and two variations of DBSCAN against single semi-supervised approaches. We validated these approaches in seventeen synthetic datasets and a real one, showing the strengths and weaknesses of these CSE methods through many metrics. We show that a semi-supervised classifier is improved up to 68% on a real dataset when it is applied along with a density-based CSE algorithm. The results between KDE and GMM, as CSE methods, were close but the approach using KDE is more practical due to having less parameters.
@inproceedings{8575610, author = {Sena Ferreira, Raul and M. A. da Silva, Bruno and Teixeira, Wendell and Zimbrão, Geraldo and Alvim, Leandro}, booktitle = {2018 7th Brazilian Conference on Intelligent Systems (BRACIS)}, title = {Density-Based Core Support Extraction for Non-stationary Environments with Extreme Verification Latency}, year = {2018}, pages = {181-187}, keywords = {Complexity theory;Predictive models;Training;Prediction algorithms;Clustering algorithms;Machine learning;Data models;Non-stationary environments;concept-drift;adaptive learning;extreme verification latency;density-based core support extraction}, doi = {10.1109/BRACIS.2018.00039} }

2017

Recsys
Autoencoders and recommender systems: COFILS approach

Julio Barbieri, Leandro G.M. Alvim, Filipe Braida, and 1 more author

Expert Systems with Applications, Jun 2017

Abs Bib HTML

Collaborative Filtering to Supervised Learning (COFILS) transforms a Collaborative Filtering (CF) problem into classical Supervised Learning (SL) problem. Applying COFILS reduces data sparsity and makes it possible to test a variety of SL algorithms rather than matrix decomposition methods. Its main steps are: extraction, mapping and prediction. Firstly, a Singular Value Decomposition (SVD) generates a set of latent variables from a ratings matrix. Next, on the mapping phase, a new data set is generated where each sample contains a set of latent variables from a user and each rated item; and a target that corresponds the user rating for that item. Finally, on the last phase, a SL algorithm is applied. One problem of COFILS is its dependency on SVD, that is not able to extract non-linear features from data and it is not robust to noisy data. To address this problem, we propose switching SVD to a Stacked Denoising Autoencoder (SDA) on the first phase of COFILS. With SDA, more useful and complex representations can be learned in a neural network with a local denoising criterion. We test our novel technique, namely Autoencoder COFILS (A-COFILS), on MovieLens, R3 Yahoo! Music and Movie Tweetings data sets and compare to COFILS, as a baseline, and state of the art CF techniques. Our results indicate that A- COFILS outperforms COFILS for all the data sets and with an improvement up to 5.9%. Also, A-COFILS achieves the best result for the MovieLens 100k data set and ranks on the top three algorithms for these data sets. Thus, we show that our technique represents an advance on COFILS methodology, improving its results and making it a suitable method for CF problem.
@article{BARBIERI201781, title = {Autoencoders and recommender systems: COFILS approach}, journal = {Expert Systems with Applications}, volume = {89}, pages = {81-90}, year = {2017}, issn = {0957-4174}, doi = {https://doi.org/10.1016/j.eswa.2017.07.030}, url = {https://www.sciencedirect.com/science/article/pii/S0957417417305079}, author = {Barbieri, Julio and Alvim, Leandro G.M. and Braida, Filipe and Zimbrão, Geraldo}, keywords = {Recommender systems, Collaborative filtering, Autoencoder, COFILS, A-COFILS} }

2015

Recsys
Transforming collaborative filtering into supervised learning

Filipe Braida, Carlos E. Mello, Marden B. Pasinato, and 1 more author

Expert Systems with Applications, Jun 2015

Abs Bib HTML

Collaborative Filtering (CF) is a well-known approach for Recommender Systems (RS). This approach extrapolates rating predictions from ratings given by user on items, which are represented by a user-item matrix filled with a rating ri,j given by an user i on an item j. Therefore, CF has been confined to this data structure relying mostly on adaptations of supervised learning methods to deal with rating predictions and matrix decomposition schemes to complete unfilled positions of the rating matrix. Although there have been proposals to apply Machine Learning (ML) to RS, these works had to transform the rating matrix into the typical Supervised Learning (SL) data set, i.e., a set of pairwise tuples (x,y), where y is the correspondent class (the rating) of the instance x∈Rk. So far, the proposed transformations were thoroughly crafted using the domain information. However, in many applications this kind of information can be incomplete, uncertain or stated in ways that are not machine-readable. Even when it is available, its usage can be very complex requiring specialists to craft the transformation. In this context, this work proposes a domain-independent transformation from the rating matrix representation to a supervised learning dataset that enables SL methods to be fully explored in RS. In addition, our transformation is said to be straightforward, in the sense that, it is an automatic process that any lay person can perform requiring no domain specialist. Our experiments have proven that our transformation, combined with SL methods, have greatly outperformed classical CF methods.
@article{BRAIDA20154733, title = {Transforming collaborative filtering into supervised learning}, journal = {Expert Systems with Applications}, volume = {42}, number = {10}, pages = {4733-4742}, year = {2015}, issn = {0957-4174}, doi = {https://doi.org/10.1016/j.eswa.2015.01.023}, url = {https://www.sciencedirect.com/science/article/pii/S095741741500038X}, author = {Braida, Filipe and Mello, Carlos E. and Pasinato, Marden B. and Zimbrão, Geraldo}, keywords = {Recommender system, Collaborative filtering, Dimensionality reduction, Supervised learning} }

2013

Recsys
Group Recommender Systems: Exploring Underlying Information of the User Space

Pedro Rougemont, Filipe Braida do Carmo, Marden Braga Pasinato, and 2 more authors

In 2013 BRICS Congress on Computational Intelligence and 11th Brazilian Congress on Computational Intelligence, Jun 2013

Abs Bib HTML

This work proposes a new methodology for the Group Recommendation problem. In this approach we choose the Most Representative User (MRU) as the group medoid in a user space projection, and then generate the recommendation list based on his preferences. We evaluate our proposal by using the well-known dataset Movie lens. We have taken two different measures so as to evaluate the group recommender strategies. The obtained results seem promising and our strategy has shown an empirical robustness compared with the baselines in the literature.
@inproceedings{rougemont2013group, author = {Rougemont, Pedro and Carmo, Filipe Braida do and Pasinato, Marden Braga and Mello, Carlos Eduardo and Zimbrão, Geraldo}, booktitle = {2013 BRICS Congress on Computational Intelligence and 11th Brazilian Congress on Computational Intelligence}, title = {Group Recommender Systems: Exploring Underlying Information of the User Space}, year = {2013}, volume = {}, number = {}, pages = {540-545}, keywords = {Recommender systems;Prediction algorithms;Computational intelligence;Classification algorithms;Sparse matrices;Proposals;Robustness;group recommender systems;space transformation;singular value decomposition;social choice theory}, doi = {10.1109/BRICS-CCI-CBIC.2013.95}, }