In this study, we analyzed publicly accessible data related to the Staphylococcus aureus NorA protein, a well-known efflux pump involved in antimicrobial resistance. Our analysis revealed several inconsistencies in data annotation, and significant issues concerning the homogeneity across datasets, which compromise the reliability of data-driven approaches aimed at identifying novel Staphylococcus aureus NorA efflux pump inhibitors (EPIs). To address these challenges, we propose a standardized pipeline for experimental procedures and data annotation, designed to enhance the consistency and quality of EPI datasets submitted to repositories, thereby increasing the utility of publicly available datasets for the discovery of potential EPIs. By implementing this framework, the findings reported herein aim to foster more reliable and reproducible research outcomes in drug discovery projects targeting NorA or other efflux pumps.

Addressing Data Point Homogeneity and Annotation Challenges to Enhance Data‐Driven Approaches: The S. aureus NorA Efflux Pump Case Study

Astolfi, Andrea;Cernicchi, Giada;Primavera, Erika;Rocchi, Marco;Manfroni, Giuseppe;Sabatini, Stefano;Letizia Barreca, Maria
2025

Abstract

In this study, we analyzed publicly accessible data related to the Staphylococcus aureus NorA protein, a well-known efflux pump involved in antimicrobial resistance. Our analysis revealed several inconsistencies in data annotation, and significant issues concerning the homogeneity across datasets, which compromise the reliability of data-driven approaches aimed at identifying novel Staphylococcus aureus NorA efflux pump inhibitors (EPIs). To address these challenges, we propose a standardized pipeline for experimental procedures and data annotation, designed to enhance the consistency and quality of EPI datasets submitted to repositories, thereby increasing the utility of publicly available datasets for the discovery of potential EPIs. By implementing this framework, the findings reported herein aim to foster more reliable and reproducible research outcomes in drug discovery projects targeting NorA or other efflux pumps.
2025
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11391/1610876
Citazioni
  • ???jsp.display-item.citation.pmc??? 1
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact