New paper published in Nucleic Acids Research, Web Server Issue

Our latest service update article titled The EMBL-EBI search and sequence analysis tools APIs in 2019 has been published in Nucleic Acids Research, Web Server Issue.

In this paper, we describe the various enhancements made recently to EBI Saerch and Job Dispatcher services. Additionally, examples of integration between the resources are provided, which aim to demonstrate the ‘Software as a Service’ (SaaS) capabilities of the APIs.

Visual Abstract

For those not so familiar with Job Dispatcher and EBI Search, the abstract should give a good idea what these are about:

The EMBL-EBI provides free access to popular bioinformatics sequence analysis applications as well as to a full-featured text search engine with powerful cross-referencing and data retrieval capabilities. Access to these services is provided via user-friendly web interfaces and via established RESTful and SOAP Web Services APIs (https://www.ebi.ac.uk/seqdb/confluence/display/JDSAT/EMBL-EBI+Web+Services+APIs+-+Data+Retrieval). Both systems have been developed with the same core principles that allow them to integrate an ever-increasing volume of biological data, making them an integral part of many popular data resources provided at the EMBL-EBI. Here, we describe the latest improvements made to the frameworks which enhance the interconnectivity between public EMBL-EBI resources and ultimately enhance biological data discoverability, accessibility, interoperability and reusability.

New and updated bioinformatics tools available through Job Dispatcher in 2019. The OpenAPI user interface for these tools is available from: https://www.ebi.ac.uk/Tools/common/tools/help

Category Tools
Multiple Sequence Alignment Clustal Omega, Kalign, MAFFT, MUSCLE, T-Coffee, MView and WebPRANK
Pairwise Sequence Alignment Needle, Stretcher, Water, Matcher, LALIGN, and GeneWise
Phylogeny Analysis Simple Phylogeny
[Protein Functional]Analysis (https://www.ebi.ac.uk/Tools/pfa/) InterProScan 5, PfamScan, Phobius, Pratt, RADAR, HMMER3 phmmer and HMMER3 hmmscan
RNA Analysis Infernal cmscan and MapMi
Sequence Format Conversion Seqret and MView
Sequence Operation Seqcksum
[Sequence Similarity]Search (https://www.ebi.ac.uk/Tools/sss/) NCBI BLAST+, PSI-BLAST, FASTA, SSEARCH, FASTM/S/F, GGSEARCH, GLSEARCH, PSI-Search and PSI-Search2
Sequence Statistics SAPS, Pepinfo, Pepstats, Pepwindow, Cpgplot, Newcpgreport, Isochore, Dotmatcher, Dottup, Dotpath and Polydot
Sequence Translation Transeq, Sixpack, Backtranseq and Backtranambig

New and Updated Data resources available through Job Dispatcher in 2019.

Category Datasets
UniProtKB protein sequences UniProtKB, SwissProt, SwissProt Isoforms, TrEMBL, UniProtKB Taxonomic Subsets (13 subgroups, including: bacteria, archaea, eukaryota, etc.), Reference Proteomes, Representative Proteomes (15, 35, 55, 75), UniProt Reference (UniRef 50, 90 and 100), UniParc, Unimes and UniProtKB-PDB
Patent protein sequences EPO, JPO, KIPO, UPSPTO
Structures protein sequences PDBe and PSI structure targets
Protein families Pfam, TIGRFAM, Superfamily, Gene3D, PIRSF and TreeFam
Other protein sequences Enzyme Portal, IntAct, IPD-IMGT/HLA, IPD-KIR, IPD-MHC, MEROPS (MP, MPEP and MPRO), ChEMBL and Quest for Orthologs
ENA nucleotide sequences ENA sequence releases and updates for Coding, Non-coding, Barcode, Geospatial and others (10 subgroups, including: Expressed Sequence Tag, Genome Survey Sequence, etc.)
Ensembl Genomes sequences Genomes from Bacteria, Fungi, Plants, Metazoa and Protists
Structures of nucleotide sequences PDBe
Other nucleotide sequences IMGT/LIGM-DB, IMGT/HLA (CDS and genomic), IPD-KIR (CDS and genomic) and IPD-MHC (CDS and genomic)