1. Introduction
Phylogeny.fr has been designed to provide a high performance platform that transparently chains programs relevant to phylogenetic analysis in a comprehensive and flexible pipeline. Although phylogenetic aficionados will be able to find most of their favorite tools and run sophisticated analyzes, the primary philosophy of Phylogeny.fr is to assist biologists with no experience in phylogeny in analyzing their data in a robust way.
"One Click" mode
Targets users that do not wish to deal with program and parameter selection. The pipeline is set up to run and connect programs recognized for their accuracy and speed.
Advanced mode
Proposes the succession of the same programs but users can choose the steps to perform (multiple sequence alignment, phylogenetic reconstruction, tree drawing) and customize options.
"A la carte" mode
Offers the possibility of running and testing more alignment and phylogeny programs such as MAFFT, Muscle, T-Coffee, RAxML, Iq-Tree, and others with complete control.
Alternatively, users can run each program independently in the Tools section.
For each mode and tool, an optional email address can be submitted to receive a notification when the analysis completes. This address is retained only for the duration of the job and deleted as soon as the job completion confirmation email has been sent. No sensitive data is therefore stored on the server.
2. Phylogeny Analysis
2.1 "One Click" mode
This is the "default" mode which proposes a pipeline already set up to run and connect programs recognized for their accuracy and speed to reconstruct a robust phylogenetic tree from a set of sequences:
- MAFFT has been chosen for rapid and robust multiple sequence alignment.
- ClipKit can optionally be used for alignment curation (trimming unreliable regions).
- FastTree (Fast mode) or IQ-TREE (Accurate mode) can be selected for phylogeny.
You have the ability to toggle the curation step (ClipKit) to eliminate poorly aligned positions and highly divergent regions before the tree inference step. At the end of the analysis, an interactive and exportable rendition of the generated tree will be available.
Default Parameters
For transparency, below is the exhaustive list of the parameters used by the tools behind the scenes during a One Click run:
| Parameter | Default Value |
|---|---|
| Mafft (Alignment) | |
| Alignment strategy | Default (FFT-NS-2) |
| ClipKit (Alignment Curation) | |
| Mode | Gappy (removes columns with high gap frequency) |
| Gaps threshold | 0.4 (40%) |
| FastTree (Fast Phylogeny) | |
| Evolutionary Model | Default (JTT+CAT for proteins, JC+CAT for DNA) |
| Branch Support | SH-like local supports |
| IQ-TREE (Accurate Phylogeny) | |
| Model Selection | Auto (ModelFinder) |
| Branch Support | Ultrafast Bootstrap (5000 replicates) |
| Burn-in iterations | 200 |
| PhyloTree Tool (Tree Utilities) | |
| Rooting | Midpoint Rooting |
2.2 Advanced mode
The Advanced mode follows the same robust pipeline philosophy as "One Click" mode, utilizing a predefined succession of programs (MAFFT, ClipKit, and IQ-TREE). However, it provides you with complete control over the parameters for each step of the analysis.
2.3 "A la carte" mode
The "A la carte" mode provides the ultimate flexibility for building your phylogenetic analysis pipeline. This mode allows you to handpick a tool for each step of the analysis, giving complete control over the methodology.
Two-Phase Workflow
The A la carte mode operates in two distinct phases:
- Pipeline Design: Select the tools to include in the analysis. A visual pipeline at the top of the page shows the current selection in real-time.
- Configuration & Launch: After confirming the tool selection, configure the settings for each tool individually, upload the input FASTA file, and launch the analysis.
How It Works
- Navigate to the A la carte page from the main menu.
- Click on the tool tiles to select your preferred tool for each step (Alignment, Curation, Phylogeny). Selected tools are highlighted with a checkmark.
- Click "Continue to Configuration" to proceed.
- Upload your FASTA file or paste your sequences directly.
- Configure each selected tool's parameters using the provided forms.
- Optionally enter your email to receive a notification when the analysis completes.
- Click "Submit Analysis" to start the pipeline.
3. BLAST Explorer
The BLAST Explorer is used to search for sequences similar to a given query within major biological databases. Beyond standard BLAST searches, our platform automatically builds a phylogenetic tree from the top hits, enabling you to explore the evolutionary relationships among homologous sequences interactively.
3.1 Submitting a query
To run a BLAST search, a single sequence in FASTA format has to be provided either as a file or by copying and pasting it directly into the text box.
BLAST Programs
Users have to select the appropriate program based on the query and target database types:
| Program | Query | Database | Use case |
|---|---|---|---|
| blastp | Protein | Protein | Find homologous proteins |
| blastn | Nucleotide | Nucleotide | Find similar DNA/RNA sequences |
| blastx | Nucleotide | Protein | Translate DNA and search proteins |
| tblastn | Protein | Nucleotide | Search protein against translated DNA |
Available Databases
A variety of sequence databases are available:
| Database | Type | Description |
|---|---|---|
| Protein Databases | ||
| nr_cluster | Protein | NCBI non-redundant protein sequences (clustered) |
| SwissProt | Protein | Curated, high-quality UniProt entries |
| UniRef90 | Protein | UniProt clustered at 90% identity |
| PDB | Protein | Protein Data Bank sequences with known 3D structures |
| RefSeq Viral Protein | Protein | NCBI curated viral protein sequences |
| Nucleotide Databases | ||
| Core Nucleotide | Nucleotide | NCBI core nucleotide collection |
| PDB | Nucleotide | PDB nucleotide sequences |
| RefSeq Viral Genomic | Nucleotide | NCBI curated viral genomic sequences |
E-value Threshold
The E-value (Expect value) represents the number of hits expected by chance. A lower E-value indicates a more significant match:
- 1.e-5 (default): Good balance between sensitivity and specificity.
- 1.e-10 to 1.e-30: More stringent, returns only highly confident hits.
- 0.1 to 1: More permissive, may include distant homologs.
Optional Clustering
MMseqs2 can be used to cluster all results. For instance, as the number of returned sequences is limited, this can avoid the over-representation of one kind of organism.
3.2 Interactive results
Once the BLAST job completes, an interactive visualization is displayed, combining a phylogenetic tree with a floating Toolbox panel dedicated to filtering and exporting results.
Phylogenetic Tree Display
The tree displays the query sequence alongside the top BLAST hits, making it possible to visualize evolutionary relationships at a glance:
- Query highlighting: The input sequence is visually emphasized in the tree.
- Taxonomy coloring: Sequences are colored according to taxonomic groups (with adjustable depth).
- Interactive selection: Tree leaves can be clicked directly to select or deselect sequences.
- External links: Sequence IDs provide access to the corresponding original database entries.
The Toolbox Panel
The floating Toolbox serves as the main control center for exploring BLAST results. It can be moved freely on the screen and provides two distinct modes accessible through tabs.
Tree View Tab
This tab focuses on the top hits displayed in the phylogenetic tree (typically the highest-scoring sequences). It is intended for precise, small-scale selection:
- Selection counter: Displays the number of currently selected sequences relative to the total number of tree leaves.
- Quick select buttons: Allows all tree sequences to be selected or deselected instantly.
- Bitscore filter pills: Filters sequences according to score ranges (<40, 40–50, 50–80, 80–200, >200) using color-coded filters.
- Taxonomy depth slider: Adjusts the displayed taxonomic classification level.
- Gap-free indicator: Displays the percentage of alignment positions without gaps for the current selection, providing an indicator of alignment quality.
All Hits Tab
This tab operates on all BLAST hits, limited to 500, not only those displayed in the tree. It is intended for large-scale filtering across the complete result set:
- E-value threshold: Dynamically filters hits according to E-value (from 1e-5 to 1e-100).
- Histogram filters: Interactive histograms allow sequence selection based on:
- Score (bitscore): Filtering by alignment score strength.
- Similarity (% identity): Filtering by sequence similarity percentage.
- Coverage (% query coverage): Filtering by query coverage percentage.
- Taxonomy filter: A hierarchical taxonomy tree enables selection of sequences belonging to specific taxonomic groups (for example, only Bacteria or only Vertebrates).
- Select in table: A selection tool based on the BLAST table results.
- Reset selection: Clears all filters and restores the initial state.
Exporting & Pipeline Integration
Both tabs provide export options for the selected sequences:
- Download FASTA: Export selected sequences as a FASTA file for external analysis.
- Send to Pipeline: Directly transfer selected sequences to the One Click or Advanced phylogeny pipelines. The sequences are automatically loaded into the analysis form.
Taxonomy Legend
Below the tree, a dynamic legend displays the taxonomic groups present in the results with their associated colors. The groups shown depend on the taxonomy depth setting in the Toolbox.
4. Tools & Versions
Below is the list of all bioinformatics tools integrated in the platform, along with their current version.
| Tool | Description | Version | Max Sequences |
|---|---|---|---|
| Blast | Basic Local Alignment Search Tool | 2.16.0 | 1 |
| MMseqs2 | Ultra-fast and sensitive sequence search and clustering | sse2 | — |
| Mafft | Multiple sequence alignment program for large datasets | 7.526 | 200 |
| Muscle | Fast and accurate multiple sequence alignment | 5.3 | 200 |
| Clustal Omega | Fast multiple sequence alignment | 1.2.4 | 200 |
| TCoffee | Advanced multiple sequence alignment program | 13.46.0.919e8c6b | 200 |
| GBlocks | Alignment curation — eliminates poorly aligned and divergent regions | 0.91b | 200 |
| ClipKit | Alignment curation using smart-gap trimming | 2.11.4 | 200 |
| BMGE | Alignment curation using gap trimming | 2.0 | 200 |
| FastTree | Approximately-maximum-likelihood phylogenetic tree inference | 2.2.0 | 200 |
| IQ-Tree | Maximum-likelihood phylogenetic inference with automatic model selection | 2.4.0 | 200 |
| RAxML-NG | Randomized Axelerated Maximum Likelihood phylogenetic inference | 2.0.0 | 200 |
| MrBayes | Bayesian inference of phylogenetic trees | 3.2.7a | 50 |
| PhyML | Maximum-likelihood phylogenetic inference | 3.3.20241207 | 50 |
| Fastphylo | Fast tools for phylogenetics — distance computation and neighbor-joining | 1.0.1 | — |
| BioNJ | Neighbor-joining phylogenetic inference | 1.0 | 50 |
New to the platform?
Take the interactive guided tour — 2 minutes to cover all the key features.