Sourced from the CFDE-CC Documentation Wiki
This table must not be manually created. Users should skip this, and all other tables marked "Built by script" in this summary, preparing only the rest of their datapackage's TSV files (those marked "Prepared by submitter") for submission. Once the "Prepared by submitter" tables are ready, users should then use the C2M2 submission prep script to automatically proteinrate this table (and the other "Built by script" tables) using the information in the "Prepared by submitter" tables.
The protein.tsv table will have as many rows as the number of unique UniProt terms appearing in the protein
column of biosample_protein.tsv and collection_protein
Field | Field Description | Required? | Field Value Type | Extra Info |
---|---|---|---|---|
id | A UniProt Knowledgebase (UniProtKB) protein accession (AC) | Required | string | Example: Q6GZX4 |
name | The UniProt entry name (formerly "ID") of this protein | Required | string | Example: 001R_FRG3G |
description | A description of this protein | Optional | string | |
synonyms | A list of alternate names for this protein | Optional | array | |
organism | OPTIONAL: An NCBI Taxonomy Database ID identifying this protein's source organism (e.g. 'NCBI:txid9606') | string |