Main Menu
Login Form
| Sequence editors for biologists on Linux |
|
|
| Friday, 05 October 2007 09:10 | |||||||||||||||||||||||||||||||||||||||||||||||||||
I migrated a year ago from Windows to Ubuntu "Edgy Eft" Linux which quickly became my main operating system. It's there that I got the problem that each migrant gets at one point: How do I replace those specific softwares which I so badly need for work? I was well accustomed (addicted?) to Vector NTi (Developed by Invitrogen), the reference for DNA/Protein sequences management and edition softwares in the Windows environment. Here, I write about a software which is a good alternative to Vector NTi for basic needs, and works in the Linux, Windows and Mac OS environments, please let me introduce CLC Free Workbench. Vector NTi is a nice piece of software for biological sequences edition. A free version with basic functionalities is even proposed to students. This software only runs on Windows and was the last reason I was keeping Windows with a dual boot configuration. Of course, I don't have to stress too much how rebooting only for one specific program can be inefficient and annoying over time. A first workaround was offered by VMware Server, a virtualization software which is able to run Windows from its partition in Linux. This allowed me to just switch from one system to the other through a simple window switch... Sounds perfect?... Well, that was just no good! VMware has to boot and run Windows on the same resources as Linux resulting in a mutual slowdown, and giving a software (a wonderful software by the way, but programmed by humans nonetheless) direct write permissions to your raw partition is really hard for the nerves.... After several months of struggle, I digged a little further in an alternative solution which had not worked in some first trials because of my inexperience with Linux: CLC Free Workbench. This review is about this nice biological sequence editor developed by CLC Bio which has nearly all the functions I need: A nice and easy graphical interface, a simple database to manage the sequences of genes and plasmids and basic notions of molecular biology (ORF finding, restriction analysis, etc...). CLC Free Workbench 4.0.2CLC Free Workbench is a free and basic version of the CLC Combined Workbench by CLC Bio. It was developed in Java and therefore runs on either Windows, MacOS X or any Linux system on which the Java interpreter is installed. InstallationSun Java Development Kit installationCLC Free Workbench was developed in Java and may therefore be run on either Windows, MacOS X, or any Linux system on which the Java interpreter is installed. So first things first, one needs to install Sun's Java packages on his linux distribution (Here described for Kubuntu "Gutsy Gibbon") before attempting anything else. This is made easy with the following command in a console: $ sudo apt-get install sun-java6-jdk A quick check that the Sun Java platform is activated by default is recommended: $ sudo update-alternatives --config java If it is not, instructions are given to activate it. CLC Free Workbench 4.0.2 installationCLC Free Workbench may be downloaded on CLC Bio's website. The installer comes as a single .sh file which must be made executable before it may be launched as root to be granted with access to the system folders: $ chmod a+x CLCFreeWorkbench_4_0_2.sh A wizard guides the user through easy intallation steps. Accept the invitation to start CLC Free Workbench to accept the license (you are still root). One small trap: do not start right away to use the software as everything will be saved in root's folder and with root privileges. You may relaunch it later as a simple user through the installed shortcuts or through the command: $ clcfreewb4 FunctionsI'll detail here the CLC Free Workbench's functions for DNA sequences. It also manages RNA and protein sequences, but does not feature specific functions for those that are not proposed for DNA. LayoutDNA sequences may be displayed in two modes: a sequence and a map mode.
The software gives a nice look and feel and offers many display parameters to improve readability according to anyone's standards. My only disappointment was that linear DNA fragments may only be viewed as circular in map mode... Strange... Restriction analysesThe main tool for molecular biologists will certainly be restriction analysis... which CLC Free Workbench does definitely well! It comes with a preloaded and quite comprehensive database of restriction enzymes which may be sorted by supplier, overhang types, "palindromicity" and popularity. Custom lists may be saved to improve work efficiency. Restriction sites are shown in the display pane with their names and overhangs while one will find the total number of sites for each enzymes in the settings pane.
![]() Restriction analysis
I guess the only preferences missing are a conditional display depending on the number of sites present in the considered sequence and highlighting of unique cutters in the display pane. Alignements and philogenyAlignements and philogenic trees are two basic features for molecular biologists. CLC Free Workbench's main weakness might reside here. Only simple tools are offered with only few options. The parameters for alignements are the traditional "gap opening" and "gap extension" and "gap closing" costs with two levels of accuracy. The user may choose neither a specific algorithm nor a specific score matrix. In conclusion, CLC Free Workbench's alignement tool and its depending phylogenic tree builber suffice for basic analyses, but might not be suitable for more careful bioinformatic analyses.
ORF findingThe open reading frame (ORF) finding tool is also an essential. It is included in CLC Free Workbench and its output may be displayed in both layouts and saved in the "annotations" list of each sequence. The algorithm was updated in version 4.0.2 to correct a bug in ORF finding for circular DNA sequences. ConclusionI've now been working with CLC Free Workbench for six months and I am still happy with it. Although it features relatively basic functions, I found a software that was free and matched my needs for everyday's work as a molecular biologist. CLC Combined WorkbenchAs I mentioned above CLC Free Workbench is in fact a free teaser to advertise for CLC Bio's software suite, CLC Combined Workbench. Although I did not test it, the Combined Workbench contains functions for BLAST searches, primers design, chromatogram viewing, 3D molecular modeling and more... It seems really nice! I would definitely have purchased it if it was sold for a few hundred dollars. However the price is still prohibiting for me: 2250 USD for an academic license (4500 USD for a comercial license). The price may be lowered to 1500 USD (3000 USD for a commercial license) by purchasing only one of the three sets of tools (for DNA, RNA and protein sequences) which are comprised in CLC Combined Workbench... Well, anyway I need features from at least two of them... GeneiousTo my knowledge, no free comparable GUI biological sequence manager is available for Linux at present time. Note that I precise "free" as another Java-based software exists, I named Geneious (Developed by Biomatters Ltd). I was thinking at first of writing a comparison between the two softwares, but the free version of Geneious does not compare to CLC Free Workbench. It is merely a demo version with a 14 days trial which is not suitable for the basic needs of a biologist. This may come from the fact that the marketing strategies behind the two suites are somewhat different: While CLC Combined Workbench is relatively expensive, its free version is well equiped. On the other hand, the free Geneious version consists only of a demo, but the full software is relatively less expensive (249 USD for students, 495 USD for academics and 995 USD for commercials).
Only registered users can write comments!
Powered by !JoomlaComment 4.0alpha3
!joomlacomment 4.0 Copyright (C) 2009 Compojoom.com . All rights reserved."
|
|||||||||||||||||||||||||||||||||||||||||||||||||||
| Last Updated on Friday, 03 July 2009 21:47 | |||||||||||||||||||||||||||||||||||||||||||||||||||






