In this study, we analyzed publicly accessible data related to the Staphylococcus aureus NorA protein, a well-known efflux pump involved in antimicrobial resistance. Our analysis revealed several inconsistencies in data annotation, and significant issues concerning the homogeneity across datasets, which compromise the reliability of data-driven approaches aimed at identifying novel Staphylococcus aureus NorA efflux pump inhibitors (EPIs). To address these challenges, we propose a standardized pipeline for experimental procedures and data annotation, designed to enhance the consistency and quality of EPI datasets submitted to repositories, thereby increasing the utility of publicly available datasets for the discovery of potential EPIs. By implementing this framework, the findings reported herein aim to foster more reliable and reproducible research outcomes in drug discovery projects targeting NorA or other efflux pumps.
Addressing Data Point Homogeneity and Annotation Challenges to Enhance Data‐Driven Approaches: The S. aureus NorA Efflux Pump Case Study
Astolfi, Andrea;Cernicchi, Giada;Primavera, Erika;Rocchi, Marco;Manfroni, Giuseppe;Sabatini, Stefano;Letizia Barreca, Maria
2025
Abstract
In this study, we analyzed publicly accessible data related to the Staphylococcus aureus NorA protein, a well-known efflux pump involved in antimicrobial resistance. Our analysis revealed several inconsistencies in data annotation, and significant issues concerning the homogeneity across datasets, which compromise the reliability of data-driven approaches aimed at identifying novel Staphylococcus aureus NorA efflux pump inhibitors (EPIs). To address these challenges, we propose a standardized pipeline for experimental procedures and data annotation, designed to enhance the consistency and quality of EPI datasets submitted to repositories, thereby increasing the utility of publicly available datasets for the discovery of potential EPIs. By implementing this framework, the findings reported herein aim to foster more reliable and reproducible research outcomes in drug discovery projects targeting NorA or other efflux pumps.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


