In this part of the homework, you will be using online resources and 3D visualization software to answer questions about proteins.
pilA protein nanowire from geobacter sulfurreducens [https://www.uniprot.org/uniprotkb/Q74D23/entry#structure]
3D Structure of pilA from Uniprot:

Briefly describe the protein you selected and why you selected it.
Identify the amino acid sequence of your protein.
How long is it? What is the most frequent amino acid? You can use this notebook to count most frequent amino acid - https://colab.research.google.com/drive/1vlAU_Y84lb04e4Nnaf1axU8nQA6_QBP1?usp=sharing
from collections import Counter
# Protein sequence provided by the user
protein_sequence = "MANYPHTPTQAAKRRKETLMLQKLRNRKGFTLIELLIVVAIIGILAAIAIPQFSAYRVKAYNSAASSDLRNLKTALESAFADDQTYPPES"
# Removing spaces for proper analysis
cleaned_sequence = protein_sequence.replace(" ", "")
# Count the frequency of each amino acid
amino_acid_count = Counter(cleaned_sequence)
# Find the most common amino acid
most_common_amino_acid = amino_acid_count.most_common(1)
print(f"The most common amino acid is: {most_common_amino_acid[0][0]}, which appears {most_common_amino_acid[0][1]} times.")
The most common amino acid is: A, which appears 14 times.How many protein sequence homologs are there for your protein?Hint: Use the pBLAST tool to search for homologs and ClustalOmega to align and visualize them. Tutorial Here

Does your protein belong to any protein family?
Identify the structure page of your protein in RCSB
Open the structure of your protein in any 3D molecule visualization software:
fetch 7TGG



PyMOL>cmd.show("spheres" ,"All")
PyMOL>cmd.show("sticks" ,"All")
PyMOL>cmd.set("sphere_scale", 0.3, "All")
PyMOL>cmd.set("stick_radius", 0.2, "All")



cmd.color("yellow", "resn ALA+VAL+LEU+ILE+MET+PHE+TRP+PRO") # Hydrophobic residues
cmd.color("blue", "resn ARG+LYS+HIS") # Positively charged (basic) residues
cmd.color("red", "resn ASP+GLU") # Negatively charged (acidic) residues
cmd.color("green", "resn ASN+GLN+SER+THR+TYR") # Polar, uncharged residues
cmd.color("cyan", "resn GLY") # Glycine (special case)
cmd.color("magenta", "resn CYS") # Cysteine (sulfur-containing)

<aside> <img src="/icons/snippet_lightgray.svg" alt="/icons/snippet_lightgray.svg" width="40px" /> Resources: https://colab.research.google.com/drive/1hXStRY9VCyw52n17uWdWQBj__IcR2ztK?usp=sharing#scrollTo=38gFJBazNdzJ
</aside>