hu.MAP 3.0
Human Protein Complex Map
Download
Complex Map Files
- Protein Complex Map
- Description: Complexes generated from two stage clustering of fully intergrated protein interaction network
- Format: HuMAP3_ID,ComplexConfidence,Uniprot_ACCs,genenames
- ComplexConfidence maps to complexes identified in 6 individual clusterings. 1=Extremely High, 2=Very High, 3=High, 4=Moderate High, 5=Medium High, 6=Medium
- Protein Interactions in Complexes with probability scores (Uniprot),
(genename)
- Description: Protein pair predictions present in hu.MAP3.0 complexes with the corresponding ML probability score.
- Format: protein_id [tab] protein_id [tab] score
- Protein Interaction Network with probability scores (Uniprot gzip),
- Description: All protein pair predictions with the corresponding ML probability score.
- Format: protein_id [tab] protein_id [tab] score
Test and training data
- Train Complexes (Uniprot)
- Description: List of training complexes used in protein complex discovery pipeline
- Format: protein_id, protein_id, protein_id ... (one complex per line)
- Test Complexes (Uniprot)
- Description: List of test complexes used in protein complex discovery pipeline
- Format: protein_id, protein_id, protein_id ... (one complex per line)
- Train Positive PPIs (Uniprot)
- Description: List of train postive ppis used in protein complex discovery pipeline
- Format: protein_id, protein_id
- Train Negative PPIs (Uniprot)
- Description: List of train negative ppis used in protein complex discovery pipeline
- Format: protein_id, protein_id
- Test Positive PPIs (Uniprot)
- Description: List of test positive ppis used in protein complex discovery pipeline
- Format: protein_id, protein_id
- Test Negative PPIs (Uniprot)
- Description: List of test negative ppis used in protein complex discovery pipeline
- Format: protein_id, protein_id
Feature Matrix
- Feature Matrix (Uniprot gzip)
- Description: Table of features from integrated datasets for pairs of proteins (Uniprot ACCs). Also includes Weighted Matrix Model features
- Format: protein_id,protein_id,[features]
Code
License
- CC0 (+BY)
- Data associated with this website are free to download and share. They are governed by the Creative Commons Zero license, which means that they are a part of the public domain, and every use of them is allowed. If you make extensive use of data from this data set, please credit the authors and when appropriate the authors of the source data (see about for references).