Identification of Relevant Protein Interactions with Partial Knowledge: A Complex Network and Deep Learning Approach

Pilar Ortiz-Vilchis, Jazmin Susana De-la-Cruz-García, Aldo Ramirez-Arellano

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Protein–protein interactions (PPIs) are the basis for understanding most cellular events in biological systems. Several experimental methods, e.g., biochemical, molecular, and genetic methods, have been used to identify protein–protein associations. However, some of them, such as mass spectrometry, are time-consuming and expensive. Machine learning (ML) techniques have been widely used to characterize PPIs, increasing the number of proteins analyzed simultaneously and optimizing time and resources for identifying and predicting protein–protein functional linkages. Previous ML approaches have focused on well-known networks or specific targets but not on identifying relevant proteins with partial or null knowledge of the interaction networks. The proposed approach aims to generate a relevant protein sequence based on bidirectional Long-Short Term Memory (LSTM) with partial knowledge of interactions. The general framework comprises conducting a scale-free and fractal complex network analysis. The outcome of these analyses is then used to fine-tune the fractal method for the vital protein extraction of PPI networks. The results show that several PPI networks are self-similar or fractal, but that both features cannot coexist. The generated protein sequences (by the bidirectional LSTM) also contain an average of 39.5% of proteins in the original sequence. The average length of the generated sequences was 17% of the original one. Finally, 95% of the generated sequences were true.

Original languageEnglish
Article number140
JournalBiology
Volume12
Issue number1
DOIs
StatePublished - Jan 2023

Keywords

  • bidirectional LSTM
  • complex networks
  • fractals
  • protein–protein interactions networks
  • scale-free

Fingerprint

Dive into the research topics of 'Identification of Relevant Protein Interactions with Partial Knowledge: A Complex Network and Deep Learning Approach'. Together they form a unique fingerprint.

Cite this