Skip to main content

Public

Test Datasets

Abbreviation Description Link Notes
ECOLI-MC E. coli K-12, strain MG1655, standard genomic DNA prepared from culture browse Can be also found as "E. coli reference, lane 6" here and as "EMBL-EBI Sequence Read Archive, ascension number ERA000206" here.
ECOLI-SC E. coli K-12, strain MG1655, single cell MDA browse Can be also found as "E. coli first single cell MDA, lane 1" here.
SAR324 Deltaproteobacteria, single cell MDA, lane 1 browse Can be also found as "Deltaproteobacteria, single cell MDA, lane 1" here.

 

SPAdes Manual

SPAdes stands for St. Petersburg genome assembler. It is intended for both single cell and standard (multicell) assemblies. This manual will help you to install and run SPAdes.

We recommend to run SPAdes with pre-processing (error correction) and postprocessing (contig refinement) steps.

In our experience, the error correction tools BayesHammer and Quake work well for multicell datasets. However, for single cell datasets, we recommend BayesHammer rather than Quake; Quake was not designed for single-cell datasets, and produces inferior results. The performance of SPAdes on single-cell datasets deteriorates significantly without running BayesHammer. 

While SPAdes produces accurate assemblies, we recommend running NGS-Refine after SPAdes to further reduce the number of small errors (single nucleotide substitutions and small indels).

BayesHammer and NGS-Refine will be released after the papers describing these tools are accepted. Meanwhile, if you need to run these tools please contact SPAdes support (spades_support at spbau.ru).

 

Getting SPAdes
The latest version of the source code can be downloaded from here. The following code shows how to download and unpack the archived le directly from the command line.
wget http://bioinf.spbau.ru/sites/default/files/spades.zip
unzip spades.zip
 
Requirements: packages
The list of packages required for using SPAdes is given below. The command
sudo ./install_prerequirements
installs all these packages automatically if you are using apt (advanced package tool). If your operating
system does not support apt-get command you need to install the following packages manually.
 
package description recommended version
gcc++-4.4 GNU C Compiler 4.4
python2.6-dev Python 2.6
cmake make system  2.6
cmake-curses-gui curses based user interface for cmake 2.6
liblog4cxx10-dev logging library for C++  
libboost1.42-all-dev Boost C++ libraries 1.42
zlib-bin compression library  
 
Requirements: RAM
It is recommended to run SPAdes on a 64-bit linux system. E.g., on a multi-cell E. coli dataset SPAdes uses about 700Mb of RAM, while on a single-cell E. coli datset SPAdes needs about 6Gb of RAM.
 
Compiling
When all the required packages are installed just run
./prepare_cfg
in the root directory. This collects all dependencies and runs cmake.
 
Preparing input data
SPAdes requires paired end reads to be in separate les. Additionally, SPAdes can use unpaired reads that normally appear after discarding one read of the paired read during error correction step. Thus input reads should be arranged into four les: paired reads left parts, paired reads right parts, unpaired reads which originally were left parts, and unpaired reads which originally were right parts. The first two les should contain the same number of reads, while there are no requirements on the number of reads in the last two files (any of them can even be empty). Files are expected to be in fasta or fastq formats and can be compressed.
 
In fi le configs/debruijn/datasets.info add a new entry according to the following self-explaining pattern (recall that parts of lines starting from semicolon are comments). Note that this fi le may contain any number of such entries.
 

ECOLI_IS220_QUAKE

{

first             E.coli/s_6_1.fastq.gz ; paired left

second            E.coli/s_6_2.fastq.gz ; paired right

single_first      E.coli/s_6_1.single.fastq.gz ; unpaired left (optional)

single_second     E.coli/s_6_2.single.fastq.gz ; unpaired right (optional)

RL                100 ; read length

single_cell       false ; true if input data was obtained

                        ; with mda (single cell) technology

reference_genome  E.coli/MG1655-K12.fasta.gz ; optional

}

 

Note that you do not need to specify the insert size and its deviation as SPAdes computes them itself.

 

Running SPAdes

To run SPAdes type
./spades.py config.info
 
By default (i.e., if no confi g fi le is given) SPAdes uses the fi le spades_config.info. Running ./spades.py just after downloading and compiling it runs SPAdes on the test dataset (the rst 1Kb of E. coli) that is provided together with the source code of SPAdes.
Below we fi rst give an example of a con fig fi le and then explain its contents in detail.
 
iterative_K       21 33 55
paired_mode       true
dataset           ECOLI_IS220_QUAKE_1K
input_dir         ./data/input/
output_dir        ./data/debruijn/
measure_quality   true
output_to_console true
 
  • iterative_K allows to set several k-mer sizes. Informally, smaller values of k make graph more connected, but at the same time more tangled, while higher values of k may defragment the graph, but allow to resolve short repeats. See the paper for more details.
  • paired_mode turns on/o the repeat resolver.
  • dataset is the name of the dataset as it is given in configs/debruijn/datasets.info (see subsection 5.2).
  • input_dir is the directory where the corresponding dataset is stored.
  • output_dir is the output directory.
  • measure_quality flag allows to call quality estimation tool after the assembly is performed (the tool computes usual metrics like N50, genome coverage, number of misassemblies, etc).
  • output_to_console flag controls outputting log messages to the console.

 

Understanding the output

 

Results can be found in data/debruijn/DATASET_NAME/DATE_TIME. The speci c folder is given at the end of the log. Also, there is a folder containing statistics on di erent metrics (like N50) of the resulting contigs.
 
All the resulting information can be found here: ./data/debruijn/SAUREUS_JCVI_BH/build_02 .07_19 .05.56/
* Resulting contigs are called final_contigs.fasta
* Assessment of their quality is in quality_results
Thank you for using SPAdes!
== Assembling finished . Log can be found here :
./data/debruijn/SAUREUS_JCVI_BH/build_02 .07_19 .05.56/spades.log

Practical Info

  Санкт-Петербург  
  • Venue
    The conference is held in ...
     
  • Getting to ...
    Internet
    WiFi settings
  • Useful links
    Currency rate.
    Weather.
    Places of interest on Saint-Petersburg.com, on wikitravel.
     
  • Where to eat
    Below is an incomplete list of cafes and restaurants close to ....

    There are much more restaurants in the city center. There are lots of them between the subway stations Ploschad Vosstanyia, Nevskiy prospekt and Sennaya ploschad.

    Просмотреть Restraunts near PDMI на карте большего размера
    name address open hours notes
    Ideal'naya Chashka Vladimirsky Prospekt, 1 7:00-23:00 cafe
    Dostoevsky Vladimirsky prospect, 9   Russian and French cuisine
    Tarkhun Karavannaya, 14 12:00-23:00 Caucasian cuisine
    Chainaya lozhka Nevskiy, 44 09:00-22:00 cafe
    Elki-Palki Nevskiy, 88 24/7 Bistro

  • Transportation
    • Underground
      After you enter the St. Petersburg underground you can unlimitedly move inside, paying only once, on entrance. A single-trip token costs 25 rubles and goes into a special slot on a subway turnstile. Another option is a smart card with some preset number of trips; to pass, you should bring it to the white circle located on top of the turnstile. If you need more trips, you can extend your card in the ticket window at any time (25 rubles/trip, a little cheaper if you buy many trips).
      Please, do not take photos or record videos in the underground: it is prohibited and may cause trouble with the police. It is a good idea to stick to the right side of the moving stairway – those in a hurry come rushing down the left side. The doors are automatic, so in case of a crowd, watch out not to get your clothes stuck.
       
    • Municipal routes
      In municipal buses (numbers without letters T or K), trams, and trolley-buses you should pay a ticket-collector (he/she will most often wear a red arm-band and orange jacket). One ticket costs 21 rubles.
      NB: You do not need to punch the ticket.
      NB: This transport has only regular stops and you can not get out wherever you want, so do not try to find a stop-button or pull the emergency brake.
      NB: It does not matter which door you use to come in or out except buses without ticket-collector: in such situation you should pay a driver and leave through front door (you won't get mistaken because in these buses only the front door opens at first, and the other doors open when passengers have already left the bus).
       
    • Commercial routes
      In commercial buses (marked by letters T or K) you should also pay a ticket-collector, while in minibuses you pay the driver when you enter. At some (bigger) buses you pay the driver when you exit. The price depends on the line (usually 30 rubles), and it is always clearly marked on a label.
       
    • Taxi
      Taxi can be ordered by calling these numbers:
      +7 (812) 324 77 77
      +7 (812) 700 00 00
      +7 (812) 600 00 00
  • Phone calls
    You may get a temporary local mobile phone number for 200-300 roubles - you need only your passport for this. It can be done in any communication service, for instance there is one called "Евросеть" (EVROSET') near the Ploshad Muzhsestva subway station. As for the service providers, Megafon and MTS provide slightly better coverage (no difference in the city, but far-away suburbs may get better reception), while Beeline and TELE2 are cheaper.
     
  • Places of interest
    There are a lot of places of interest in St. Petersburg. Only a few examples are given below.

    Kazan Cathedral St Isaac's Cathedral The Peter and Paul Fortress
    The State Hermitage Museum The Mariinsky Opera and Ballet Theater The Bronze Horseman

  • The most important Don'ts
    Please,
    • do not drink unboiled water from the tap;
    • do not buy anything (especially alcohol) in the street;
    • do not exchange currency with private dealers;
    • do not walk alone late at the night in unpopulated streets;
    • do not cross the road in inappropriate places (the drivers are absolutely not polite) and be careful even when a crossing is marked.

Dates & Deadlines

Deadline for submissions: January 20, 2012 at 23:59 CET

Notification of acceptance: XXX

Camera-ready (?): XXX

Conference dates: XXX

Syndicate content