1. Introduction

Phylogeny.fr has been designed to provide a high performance platform that transparently chains programs relevant to phylogenetic analysis in a comprehensive and flexible pipeline. Although phylogenetic aficionados will be able to find most of their favorite tools and run sophisticated analyzes, the primary philosophy of Phylogeny.fr is to assist biologists with no experience in phylogeny in analyzing their data in a robust way.

Note: The Phylogeny.fr platform offers a phylogeny pipeline which can be executed through three main modes, designed to fit your specific expertise level and analysis requirements.

"One Click" mode

Targets users that do not wish to deal with program and parameter selection. The pipeline is set up to run and connect programs recognized for their accuracy and speed.

Advanced mode

Proposes the succession of the same programs but users can choose the steps to perform (multiple sequence alignment, phylogenetic reconstruction, tree drawing) and customize options.

"A la carte" mode

Offers the possibility of running and testing more alignment and phylogeny programs such as MAFFT, Muscle, T-Coffee, RAxML, Iq-Tree, and others with complete control.

Alternatively, users can run each program independently in the Tools section.

For each mode and tool, an optional email address can be submitted to receive a notification when the analysis completes. This address is retained only for the duration of the job and deleted as soon as the job completion confirmation email has been sent. No sensitive data is therefore stored on the server.

2. Phylogeny Analysis

2.1 "One Click" mode

This is the "default" mode which proposes a pipeline already set up to run and connect programs recognized for their accuracy and speed to reconstruct a robust phylogenetic tree from a set of sequences:

MAFFT has been chosen for rapid and robust multiple sequence alignment.
ClipKit can optionally be used for alignment curation (trimming unreliable regions).
FastTree (Fast mode) or IQ-TREE (Accurate mode) can be selected for phylogeny.

You only have to upload a FASTA file or paste some sequences, toggle on the alignment curation if needed, and select an execution mode. The system will handle everything else for you, automatically formatting data between steps and applying appropriate parameters.

You have the ability to toggle the curation step (ClipKit) to eliminate poorly aligned positions and highly divergent regions before the tree inference step. At the end of the analysis, an interactive and exportable rendition of the generated tree will be available.

Default Parameters

For transparency, below is the exhaustive list of the parameters used by the tools behind the scenes during a One Click run:

Parameter	Default Value
Mafft (Alignment)
Alignment strategy	Default (FFT-NS-2)
ClipKit (Alignment Curation)
Mode	Gappy (removes columns with high gap frequency)
Gaps threshold	0.4 (40%)
FastTree (Fast Phylogeny)
Evolutionary Model	Default (JTT+CAT for proteins, JC+CAT for DNA)
Branch Support	SH-like local supports
IQ-TREE (Accurate Phylogeny)
Model Selection	Auto (ModelFinder)
Branch Support	Ultrafast Bootstrap (5000 replicates)
Burn-in iterations	200
PhyloTree Tool (Tree Utilities)
Rooting	Midpoint Rooting

2.2 Advanced mode

The Advanced mode follows the same robust pipeline philosophy as "One Click" mode, utilizing a predefined succession of programs (MAFFT, ClipKit, and IQ-TREE). However, it provides you with complete control over the parameters for each step of the analysis.

Tip: If you are unsure about a specific parameter, the Advanced mode forms provide descriptive tooltips for each option to help you make scientifically sound and appropriate choices.

2.3 "A la carte" mode

The "A la carte" mode provides the ultimate flexibility for building your phylogenetic analysis pipeline. This mode allows you to handpick a tool for each step of the analysis, giving complete control over the methodology.

Ideal if: you want to compare different tools, test specific methodologies, or have particular requirements that the standard pipelines do not cover.

Two-Phase Workflow

The A la carte mode operates in two distinct phases:

Pipeline Design: Select the tools to include in the analysis. A visual pipeline at the top of the page shows the current selection in real-time.
Configuration & Launch: After confirming the tool selection, configure the settings for each tool individually, upload the input FASTA file, and launch the analysis.

Flexible Selection: It is not required to select a tool for every step. For example, you can run only an alignment tool, or skip the curation step entirely. However, at least one tool must be selected to proceed.

How It Works

Navigate to the A la carte page from the main menu.
Click on the tool tiles to select your preferred tool for each step (Alignment, Curation, Phylogeny). Selected tools are highlighted with a checkmark.
Click "Continue to Configuration" to proceed.
Upload your FASTA file or paste your sequences directly.
Configure each selected tool's parameters using the provided forms.
Optionally enter your email to receive a notification when the analysis completes.
Click "Submit Analysis" to start the pipeline.

3. BLAST Explorer

The BLAST Explorer is used to search for sequences similar to a given query within major biological databases. Beyond standard BLAST searches, our platform automatically builds a phylogenetic tree from the top hits, enabling you to explore the evolutionary relationships among homologous sequences interactively.

Unique Feature: Unlike traditional BLAST services, Phylogeny.fr automatically aligns your query with a maximum of 100 best hits using Mafft and reconstructs a phylogenetic tree using FastTree, giving you instant evolutionary context.

3.1 Submitting a query

To run a BLAST search, a single sequence in FASTA format has to be provided either as a file or by copying and pasting it directly into the text box.

BLAST Programs

Users have to select the appropriate program based on the query and target database types:

Program	Query	Database	Use case
blastp	Protein	Protein	Find homologous proteins
blastn	Nucleotide	Nucleotide	Find similar DNA/RNA sequences
blastx	Nucleotide	Protein	Translate DNA and search proteins
tblastn	Protein	Nucleotide	Search protein against translated DNA

Available Databases

A variety of sequence databases are available:

Database	Type	Description
Protein Databases
nr_cluster	Protein	NCBI non-redundant protein sequences (clustered)
SwissProt	Protein	Curated, high-quality UniProt entries
UniRef90	Protein	UniProt clustered at 90% identity
PDB	Protein	Protein Data Bank sequences with known 3D structures
RefSeq Viral Protein	Protein	NCBI curated viral protein sequences
Nucleotide Databases
Core Nucleotide	Nucleotide	NCBI core nucleotide collection
PDB	Nucleotide	PDB nucleotide sequences
RefSeq Viral Genomic	Nucleotide	NCBI curated viral genomic sequences

E-value Threshold

The E-value (Expect value) represents the number of hits expected by chance. A lower E-value indicates a more significant match:

1.e-5 (default): Good balance between sensitivity and specificity.
1.e-10 to 1.e-30: More stringent, returns only highly confident hits.
0.1 to 1: More permissive, may include distant homologs.

Tip: If your initial search returns no hits, try increasing the E-value threshold or switching to a broader database like nr_cluster.

Optional Clustering

MMseqs2 can be used to cluster all results. For instance, as the number of returned sequences is limited, this can avoid the over-representation of one kind of organism.

3.2 Interactive results

Once the BLAST job completes, an interactive visualization is displayed, combining a phylogenetic tree with a floating Toolbox panel dedicated to filtering and exporting results.

Phylogenetic Tree Display

The tree displays the query sequence alongside the top BLAST hits, making it possible to visualize evolutionary relationships at a glance:

Query highlighting: The input sequence is visually emphasized in the tree.
Taxonomy coloring: Sequences are colored according to taxonomic groups (with adjustable depth).
Interactive selection: Tree leaves can be clicked directly to select or deselect sequences.
External links: Sequence IDs provide access to the corresponding original database entries.

The Toolbox Panel

The floating Toolbox serves as the main control center for exploring BLAST results. It can be moved freely on the screen and provides two distinct modes accessible through tabs.

Tree View Tab

This tab focuses on the top hits displayed in the phylogenetic tree (typically the highest-scoring sequences). It is intended for precise, small-scale selection:

Selection counter: Displays the number of currently selected sequences relative to the total number of tree leaves.
Quick select buttons: Allows all tree sequences to be selected or deselected instantly.
Bitscore filter pills: Filters sequences according to score ranges (<40, 40–50, 50–80, 80–200, >200) using color-coded filters.
Taxonomy depth slider: Adjusts the displayed taxonomic classification level.
Gap-free indicator: Displays the percentage of alignment positions without gaps for the current selection, providing an indicator of alignment quality.

All Hits Tab

This tab operates on all BLAST hits, limited to 500, not only those displayed in the tree. It is intended for large-scale filtering across the complete result set:

E-value threshold: Dynamically filters hits according to E-value (from 1e-5 to 1e-100).
Histogram filters: Interactive histograms allow sequence selection based on:
- Score (bitscore): Filtering by alignment score strength.
- Similarity (% identity): Filtering by sequence similarity percentage.
- Coverage (% query coverage): Filtering by query coverage percentage.
Taxonomy filter: A hierarchical taxonomy tree enables selection of sequences belonging to specific taxonomic groups (for example, only Bacteria or only Vertebrates).
Select in table: A selection tool based on the BLAST table results.
Reset selection: Clears all filters and restores the initial state.

Tip: Use the All Hits tab when you need to work with sequences not included in the tree visualization. The histogram filters let you identify and select sequences based on precise score ranges.

Exporting & Pipeline Integration

Both tabs provide export options for the selected sequences:

Download FASTA: Export selected sequences as a FASTA file for external analysis.
Send to Pipeline: Directly transfer selected sequences to the One Click or Advanced phylogeny pipelines. The sequences are automatically loaded into the analysis form.

Taxonomy Legend

Below the tree, a dynamic legend displays the taxonomic groups present in the results with their associated colors. The groups shown depend on the taxonomy depth setting in the Toolbox.

4. Tools & Versions

Below is the list of all bioinformatics tools integrated in the platform, along with their current version.

Tool	Description	Version	Max Sequences
Blast	Basic Local Alignment Search Tool	2.16.0	1
MMseqs2	Ultra-fast and sensitive sequence search and clustering	sse2	—
Mafft	Multiple sequence alignment program for large datasets	7.526	200
Muscle	Fast and accurate multiple sequence alignment	5.3	200
Clustal Omega	Fast multiple sequence alignment	1.2.4	200
TCoffee	Advanced multiple sequence alignment program	13.46.0.919e8c6b	200
GBlocks	Alignment curation — eliminates poorly aligned and divergent regions	0.91b	200
ClipKit	Alignment curation using smart-gap trimming	2.11.4	200
BMGE	Alignment curation using gap trimming	2.0	200
FastTree	Approximately-maximum-likelihood phylogenetic tree inference	2.2.0	200
IQ-Tree	Maximum-likelihood phylogenetic inference with automatic model selection	2.4.0	200
RAxML-NG	Randomized Axelerated Maximum Likelihood phylogenetic inference	2.0.0	200
MrBayes	Bayesian inference of phylogenetic trees	3.2.7a	50
PhyML	Maximum-likelihood phylogenetic inference	3.3.20241207	50
Fastphylo	Fast tools for phylogenetics — distance computation and neighbor-joining	1.0.1	—
BioNJ	Neighbor-joining phylogenetic inference	1.0	50

New to the platform?

Take the interactive guided tour — 2 minutes to cover all the key features.

phylogeny.fr