Conda is one of the package management tools in GNU-Linux and OS X with packages bundled in. On windows, it still has to catch up esp. in bioinformatics area. As many bioinformaticians/bioinformatics scientists work with GNU-Linux, dual boot systems are commonly used. For laptops, it is pain and power management in GNU-Linux is not optimized as much as in Windows. Windows 10 creators update allows the user to run GNU-Linux side by side without third party software which is kind of very interesting and useful for many people who use linux regularly.
For those who are very new to using linux on windows, first you need to enable windows subsystem for linux in windows 10. There are several online resources (blogs/websites) detailing how to activate windows subsystem for linux. Briefly, you need to do following things:
- You need to enable developer mode in settings ( Settings > Update and Security > For developers)
- Enable windows subsystem for linux (control panel > Programs and features > Turn windows features on or off)
- Run "lxrun /install". This would download ubuntu by default and installs the ubuntu linux in user folder (linux distribution would be installed to : C:\Users\<user-id>\AppData\Local\lxss). This also would ask for user account creation.
Above steps activate windows subsystem for linux (WSL) and install ubuntu. Now that ubuntu is installed (xenial as of 20th Jan, 2018), launch bash shell. Bash shell would be available as option in start menu ("Bash on Ubuntu on Windows"). Click on it. It would launch bash shell and try to run few commands such as ls, echo $PATH etc. Windows partitions are available on /mnt/. So user can access windows user desktop (path would like this: /mnt/c/Users/<user_id>/Desktop).
If every thing works, now install conda environment. Conda is different from Pip in that conda installs other packages as well, in addition to python packages. Conda is a big distribution. For installing bioinformatics you would not need entire installation. Instead download miniconda from here. Download linux version and for any serious bioinformatics work, I would suggest 64 bit installer and python version is 3.6. Now that miniconda installer is downloaded, launch bash shell (in windows) and navigate to the folder where installer is downloaded. Run following command: sh Miniconda3-latest-Linux-x86_64.sh. This should install , miniconda. Now that miniconda is installed, try running "$ which conda" (without $ prompt). This would print some thing like this: /home/<user_id>/.miniconda3/bin/conda. Now that conda is available, you need to add bioconda channel to conda.
Execute following lines (copy/pasted from :https://bioconda.github.io/) in following order:
- conda config --add channels r
- conda config --add channels defaults
- conda config --add channels conda-forge
- conda config --add channels bioconda
Now that bioconda channels are available, in conda, you can install almost all of the bioinformatics packages in bioconda, using conda. You can view all the packages from here: https://bioconda.github.io/recipes.html.
for installing any package, run: conda install <package name> (for eg. conda install bwa). This would resolve package dependencies and installs bwa with all the dependencies. If you want to know, if some package is available or not, use "conda search <package_name>" (for eg. conda search hisat2 command searches for hisat2 and prints out the results). But make sure bioconda channel is added to conda for searching bioconda packages.
Now that you have installed bioinformatics in ubuntu within windows, you have best of the both worlds.