Quantify site specific iodination using PSM level data
This Jupyter notebook quantifies iodination events on histidine (H) and tyrosine (Y) residues in peptide-spectrum match (PSM) data.
- Loads PSM data from a TSV file
- Filters and processes peptides with modifications
- Identifies and quantifies iodination on H/Y residues
- Calculates frequency and percentage of iodination per residue
- Python 3.x
- pandas
-
Prepare your PSM data
- Ensure you have a
psm.tsvfile with columns:Peptide,Modified Peptide,Protein Start,Protein End,Assigned Modifications,Entry Name.
- Ensure you have a
-
Set the file path
- Edit the
filepathvariable in the notebook to point to yourpsm.tsvfile:filepath = "/path/to/your/psm.tsv"
- Edit the
-
Run all cells
- Execute each cell in order. The notebook will:
- Load and clean the data
- Extract and process modifications
- Quantify iodination events
- Output frequency and percentage tables
- Execute each cell in order. The notebook will:
-
Customize protein entry
- To analyze a specific protein, change the
entry_nameargument in:Replaceiodo_quant(merge_psm, "MYG_HORSE")
"MYG_HORSE"with your protein of interest.
- To analyze a specific protein, change the
- Tables showing iodination frequency and percentage for H/Y residues in the selected protein.
# Set file path
filepath = "/Users/nithesh/Documents/Iodo_script/20240227_FPI_apomyoglobin_NvD_plus-minusHis-correctDB/2024_02_25_KB_NoFAIMS_MV_004_A2/psm.tsv"
# Run quantification
iodo_quant_results = iodo_quant(merge_psm, "MYG_HORSE")
total_pep_appearances(iodo_quant_results, merge_psm, psm_clean)