Tutorial

Database Introduction

Our research group established a Carbapenem-resistant Enterobacteriaceae (CRE) network to investigate the epidemiology of CRE in China starting from 2014.
There were 2 stages to this study: first, from 2012 to 2013, we collected 150 CRE isolates from 16 tertiary hospitals, and second, starting in 2014, we expanded the collection area.
The number of participating hospitals increased from 25 in 2014 to 65 in 2016.
Peking University People’s Hospital was the lead unit in this project and was responsible for the collection, identification, and sorting of isolates.

We downloaded 846 Klebsiella pneumoniae isolates form refseq database, did the kleborate type, and incorporated them into this database

After comparing with the strains in this database, we will output the virulence genes and antimicrobial genes carried by the strains closest to the isolate. However, since the isolate itself may carry different virulence or antimicrobial genes from the strains in the database, for example, a antimicrobial plasmid is obtained.

For the sake of insurance, this database will also compare the submitted data with the VFDB database and the resfinder database, and output the virulence genes and antimicrobial genes of the isolate sequence for comparison.
If it is a nosocomial outbreak, the evolution of the strain can be briefly viewed from this.

Webserver Introduction

The purpose of PathoTracker is to solve how to find the isolate closest to the newly sequencing data and timely detect the outbreak of the isolates in the large database.

If the sequencing samples to be detected in the same area over a period of time, by comparing with the PathoTracker, the results of the output neighboring isolates are similar (here, the closest isolates are basically the same with sample), the output virulence genes and antibiotic resistance genes are basically the same, and it can be considered that there may have an isolate outbreak.

At the same time, this service can also trace the source of samples and the geographical and phylogenetic evolution. In particular, PathoTracker incorporates not only isolates from China but also isolates submitted worldwide in public databases, at which point its possible to trace the evolution of isolates in both spatial as well as temporal dimensions.

PathoTracker relies on CRE network. For all the self-collected isolates, we measured the minimum inhibitory concentration (MIC) of the 23 antimicrobials by utilizing broth microdilution method.

The 23 antimicrobials were as follows:
        meropenem (MEM), imipenem (IMP), ertapenem (ETP), cefoxitin (FOX), cefepime (FEP),
        ceftazidime (CAZ) ,ceftazidime/avibactam (CAZ‑AVI), ceftazidime/clavulanic acid (CLC), cefotaxime (CTX),
        ceftriaxone (CRO), Cefoperazone/sulbactam (CSL), Piperacillin/tazobactam (TZP), amikacin (AMK), ciprofloxacin (CIP),
        levofloxacin (LVX), meromycin (MIN), colistin (COL), tigecycline (TGC), chloramphenicol (CHL),
        fosfomycin (FOS), aztreonam (ATM), aztreonam/avibactam (ATM-AVI), cefotaxime/clavulanic acid (CTC)

The advantage of this service is the ability to report genomic and susceptibility information for isolates close to the sample. Such that the clinic is capable for dramatically reducing the time required for pure culture, give patients timely precise medication, symptomatic treatment, rescue patients at risk, and restrain abuse of antibiotics, based on genomic information and self-collected susceptibility information from PathoTracker service report.

Example file format and email address

For preassembled partial, complete genomes, the input sequence must be in FASTA format.
    The allowed alphabet (not case sensitive) is the following:
        For DNA sequences: A C G T and N (unknown)

PathoTracker is only available for Klebsiella pneumoniae right now. More pathogens will be added..

Click Example sequence format to download sample file and view the format

Select_threshold


Please make sure you fill in the correct email address, the software will take some time to run, we will send the result file to your email.

Limit parameters for comparsion

All of the following parameters are the limits you wish to set when comparing with public database data
    You can choose threshold for minimum % identity (virulence genes)
    You can choose threshold for minimum % coverage (virulence genes)
    You can choose threshold for minimum % identity (antimicrobial genes)
    You can choose threshold for minimum % coverage (antimicrobial genes)

For each limitation, You can choose 100% 90% 80% 60% level

Select_threshold
Select_threshold

Submit data type

Please select the type of data to submit

For metagenomic data type:
     fasta format is support
    Only 1 fasta file can be uploaded

For unassembled TGS sequencing reads
        We recommend use "seqtk seq -A filename-kpn-nouniq.fq > filename.fasta" to get fasta
        Only 1 fasta file can be uploaded
If "Culture isolate (NGS)" is chosen:
        Only 1 assembled fasta file can be uploaded

Select_dadatype1 Select_dadatype2


For mNGS raw data (fastq), we will provide clean the data, remove its human reads in the future. For NGS raw data (fastq), we will provide clean the data, sequence assembly in the future.

We strongly recommend that for Metagenomic next-generation sequencing (TGS) data, it is better to have the reads of Klebsiella pneumoniae greater than about 1000. The results report will show how many reads Klebsiella pneumoniae has.
The more reads, the more accurate the results will be.

Result

The results of the PathoTracker will be sent to you by email.
Since it takes time to generate the result, please make sure you fill in your email address correctly so that we can send you the results.
The results of the PathoTracker will also be displayed on the results page.


The reads matching results will be sent to your email

PathoTracker_result1


The result of PathoTracker

PathoTracker_result2 PathoTracker_result3


The result of sequences comparing with resfinder database and VFDB database

PathoTracker_result4 PathoTracker_result5

Webserver Introduction

Species detection can achieve online species detection through Centrifuge .

Submission

The data type should be:
    Metagenomic next-generation sequencing Third generation sequencing(TGS) remove host background

Click Example sequence format to download sample file and view the format

Select_threshold


Please make sure you fill in the correct email address, the software will take some time to run, we will send the result file to your email.

Result

The results of the PathoTracker will be sent to you by email.
Since it takes time to generate the result, please make sure you fill in your email address correctly so that we can send you the results.
The results of the PathoTracker will also be displayed on the results page.

We clean the data, remove its human origin and detect species
The reads matching results will be sent to your email

PathoTracker_result1

Webserver Introduction

Phylogenetic tree enables uploading assembled sequencing data, constructing the core genome of these sequences through Roary , and building phylogenetic tree of the core genome through iqtree2 .

Isolate sequences will be annotated using Prokka
Roary will be used to calculate the core genome for phylogenetic analysis, which takes annotated assemblies in GFF3 format (produced by Prokka)
Iqtree2 will be used to infer phylogenetic trees by maximum likelihood in core genome alignment


Since the isolate genome is large magnitude, so it takes a long time to obtain the genome sequence for constructing the phylogenetic tree by conventional mapping and find SNP or MEGA, PathoTracker uses Roary to extract the core genome.

Example file format and email address

For preassembled partial, complete genomes, the input sequence must be in FASTA format.
    The allowed alphabet (not case sensitive) is the following:
        For DNA sequences: A C G T and N (unknown)

Click Example sequence format to download sample file and view the format

Please make sure you fill in the correct email address, the software will take some time to run, we will send the result file to your email.

Tree number

Phylogenetic trees require a minimum of 4 sequences, and this webserver currently can handle a maximum of 6 sequences

Submit data type

Please select all fasta files at the same time before uploading them

submit_data_tree

Tree result

The results of the phylogenetic tree construction will be sent to you by email.
Since it takes time to generate the phylogenetic tree, please make sure you fill in your email address correctly so that we can send you the results.
The results of the phylogenetic tree will also be displayed on the results page.

tree_result

Statement

We welcome you to join the CRE network. If you do not want to keep your data in the database after using this database, please tick "Do not keep data" when submitting the data, and we will delete the data after the data analysis is completed.

If you would like to keep your data and join us, please select yes

If you select yes, the city is mandatory

Select_save_data


We recommend that you use Chrome, Safari, Firefox, 360 browsers to successfully select your country. If you would like to join us but do not have one of these browsers, please do not hesitate to contact us by email, thank you.

Select_country