Skip to content
Snippets Groups Projects
Commit 6fd6a91f authored by Blaise Li's avatar Blaise Li
Browse files

Adding an example of usage of SeqIO.

parent ad4915a7
No related branches found
No related tags found
No related merge requests found
Pipeline #10532 passed
#!/usr/bin/env python3
"""Example of use of SeqIO from biopython
Here, we parse a fasta file, put the records in a list, search for a motif in
the sequence of the first record, and create a subsequence around this motif.
"""
from Bio import SeqIO
records = list(SeqIO.parse(fasta_filename, "fasta"))
records
# [SeqRecord(seq=Seq('GCCTCGGCCTCTGCATAAATAAAAAAAATTAGTCAGCCATGGGGCGGAGAATGG...GCG', SingleLetterAlphabet()), id='gi|965480|gb|J02400.1|SV4CG', name='gi|965480|gb|J02400.1|SV4CG', description='gi|965480|gb|J02400.1|SV4CG Simian virus 40 complete genome', dbxrefs=[])]
sequence = records[0].seq
sequence
# Seq('GCCTCGGCCTCTGCATAAATAAAAAAAATTAGTCAGCCATGGGGCGGAGAATGG...GCG', SingleLetterAlphabet())
motif = "TAAAT"
sequence.find(motif)
# 15
motif_pos = sequence.find(motif)
subseq_start = motif_pos - 10
subseq_end = motif_pos + len(motif) + 11
subseq = sequence[subseq_start:subseq_end]
subseq
# Seq('GGCCTCTGCATAAATAAAAAAAATTA', SingleLetterAlphabet())
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment