Bioinformatics with Mac OS X


Doing Bioinformatics on a Mac OS X isn’t really difficult given the number of software that currently support this platform.

Here are some useful links for the biocomputer scientist who likes efficient and fast programming language:

The BioPerl project contains a lot of scripts, as well as the Biopython project.

As computing skills aren’t the sole competence required for Bioinformatics, I’d like recommend some more theoretical or pragmatic lectures, such as:

  • M. Moorhouse and P. Barry. Bioinformatics, Biocomputing and Perl. Wiley, 2004.
  • C. Gibas and P. Jambeck. Developing Bioinformatics Computer Skills. O’Reilly, 2001.

Additional material related to Moorhouse & Barry’s book can be found on the companion website. Vincent Zoonekynd also wrote some interesting notes, oriented toward the algorithmic approach to Bioinformatics. Two excellent books were also wrote about Perl and Bioinformatics (see also Tisdall’s article Beginning Bioinformatics), namely :

  • J. Tisdall. Beginning Perl for Bioinformatics. O’Reilly, 2001.
  • J. Tisdall. Mastering Perl for Bioinformatics. O’Reilly, 2003.

Working with such kind of data involves dedicated visualization techniques, in particular for viewing molecules or proteins in 3D. Fortunately, there are a lot of solutions, among which:

  • RasMol and OpenRasMol, the standard toolkit for molecular graphics visualization
  • emboss, an integrated package including several computational tools (sequence alignment, nucleotide sequence pattern analysis, etc.)
  • PyMOL, a molecular visualization system
  • CLC Combined Workbench, Mac OS X only (not free!), provides an integrated solution to popular analyses: assembly for DNA sequence data, molecular cloning, advanced RNA structure prediction and editing, integrated 3D molecular view, etc.

Finally, as you are also likely to do some Proteins or DNA sequence analysis, you will need additional tools like Fasta or BLAST.

More general software or libraries can be used for that purpose, of course. For example, you can read M.J. Morton’s article, 3-D Data Visualization on Mac OS X, to get an idea of how the open source VTK software system might help in building large-scale project for complex data visualization. For a more exhaustive list of available software, please have a look at the Open Directory Project. However, a growing list of open source solutions can be found on the Bioconductor project webpage. Interestingly, it is interfaced with the open source R statistical software package.


Articles with the same tag(s):

El Capitan
Why I am still using Emacs
Tmux and OS X
OS X Yosemite
Some useful Mac Apps for data scientists
Collecting email usage statistics from mu
From Beamer to Deckset
Fixing some critical keyboard shortcuts in OS X terminal
A modular configuration for Emacs
Common lisp on Mavericks