Here are some useful links for the biocomputer scientist who likes efficient and fast programming language:
- Beginning Python for Bioinformatics
- Ruby Bioinformatics code
- Beginning Bioinformatics for Perl Programmers
As computing skills aren’t the sole competence required for Bioinformatics, I’d like recommend some more theoretical or pragmatic lectures, such as:
- M. Moorhouse and P. Barry. Bioinformatics, Biocomputing and Perl. Wiley, 2004.
- C. Gibas and P. Jambeck. Developing Bioinformatics Computer Skills. O’Reilly, 2001.
Additional material related to Moorhouse & Barry’s book can be found on the companion website. Vincent Zoonekynd also wrote some interesting notes, oriented toward the algorithmic approach to Bioinformatics. Two excellent books were also wrote about Perl and Bioinformatics (see also Tisdall’s article Beginning Bioinformatics), namely :
- J. Tisdall. Beginning Perl for Bioinformatics. O’Reilly, 2001.
- J. Tisdall. Mastering Perl for Bioinformatics. O’Reilly, 2003.
Working with such kind of data involves dedicated visualization techniques, in particular for viewing molecules or proteins in 3D. Fortunately, there are a lot of solutions, among which:
- RasMol and OpenRasMol, the standard toolkit for molecular graphics visualization
- emboss, an integrated package including several computational tools (sequence alignment, nucleotide sequence pattern analysis, etc.)
- PyMOL, a molecular visualization system
- CLC Combined Workbench, Mac OS X only (not free!), provides an integrated solution to popular analyses: assembly for DNA sequence data, molecular cloning, advanced RNA structure prediction and editing, integrated 3D molecular view, etc.
More general software or libraries can be used for that purpose, of course. For example, you can read M.J. Morton’s article, 3-D Data Visualization on Mac OS X, to get an idea of how the open source VTK software system might help in building large-scale project for complex data visualization. For a more exhaustive list of available software, please have a look at the Open Directory Project. However, a growing list of open source solutions can be found on the Bioconductor project webpage. Interestingly, it is interfaced with the open source R statistical software package.