collection_protein.tsv

Sourced from the CFDE-CC Documentation Wiki

Association between a collection and a UniProt term

For collections with protein metadata, this table will have one row for each protein associated with each such collection.

All fields are required: this table can be empty (header-row only), but any non-header rows must leave no fields blank.

Some examples:

  • If you don't have any collections associated with proteins, this table should be left empty.
  • If you have exactly one protein associated with each collection, this table will have as many rows as collection.tsv.
  • If you have five proteins associated with each collection, this table will have five times as many rows as collection.tsv.
  • If some but not all of your collections are associated with one or more proteins, this table will contain one row for each protein assigned to each such collection (and the resulting row count will not have any obvious relationship to the number of rows in collection.tsv, which is both expected and fine in such a case).
FieldField DescriptionRequired?Field Value TypeExtra Info
collection_id_namespaceIdentifier namespace for this collectionRequiredstringThis will be the value of id_namespace in the row in collection.tsv corresponding to the collection referenced in this row. If your program has not registered multiple CFDE identifier namespaces, this will be exactly the same value for all rows.
collection_local_idThe ID of this collectionRequiredstringThis will be the value of local_id in the row in collection.tsv corresponding to the collection referenced in this row.
proteinA UniProt Knowledgebase (UniProtKB) protein accession (AC)RequiredstringExample: Q6GZX4

Return to C2M2 Documentation