CSC 691: Bioinformatics (Introduction to bioinformatics algorithms)

Course Description

Bioinformatics is a rapidly evolving field that studies biological systems and biological data (such as DNA/protein sequences, macromolecular structures and functional genomics data) using analytic theory and practical tools of computer science, mathematics and statistics. The topics include concepts of molecular genetics, biological databases, database searching, sequence alignments, phylogenetic trees, structure prediction, and microarray data analysis.

Textbook

Introduction to Bioinformatics 4e. Arthur M. Lesk.  http://global.oup.com/uk/orc/xedition/leskbioinf4exe/

Course Schedule   (Tentative schedule, slides will be uploaded, more topics will be added)

Project  Assembling ORFs from metagenomic sequences

Task 1:  Data acquisition (Due March 21st, in clas presentation)

  • Find a metagenomic project of your choice. It is better to find project that has annotation information in order to help in your testing later.
  • Download the data and read in the files (fasta or fastq). You may use your prefered language. Note that there are libraries and toolboxes for Bioinformatics for most major scripting and compiled languages.

Task 2:  ORF extraction. (Due April 4th)

  • Using ORFExtractor or by implementing your own ORF extractor: Write a program to extract complete and incomplete ORFs from the metagenomic reads in your project.
  • If you use ORFExtractor, you would need to modify the code to save complete and incomplete ORFs seperately.

Taks 3:  Assembling the ORFs

Task 4:  Testing and comparison

Task 5: Documnetiaon
 

ملف مرفق: 
المرفقالحجم
Office presentation icon ch02-genome-organization.ppt626.5 كيلوبايت
Office presentation icon ch04-archives-and-information-retrieval.ppt943.5 كيلوبايت
ملف ch01-introduction.pptx2.26 ميغابايت
Office presentation icon ch05-alignment-and-phlogenetictrees.ppt2.67 ميغابايت
ملحقات المادة الدراسية