.. _query_lineage:
Step 1 - get taxonomic information
==================================
Given a species name or taxonomic ID, the query species lineage information is
extracted with the help of the `ete3 python toolkit `_
and the `NCBI taxonomy `_ (Huerta-Cepas et al., 2016).
This information is needed alongside with the taxonomic classifications for all species used in the OrthoFinder comparison.
.. note::
The NCBI taxonomy database needs to be downloaded using the package `ete3`.
A parsed version of it will be stored at your home directory: `~/.etetoolkit/taxa.sqlite`.
`oggmap` provides a command line function to download or upgrade the NCBI taxonomy database (see :ref:`cmd_ncbitax`).
How to get taxonomic information about your query species
---------------------------------------------------------
.. note::
The oggmap submodule `qlin `_
helps to get taxonomic information for you with the `qlin.get_qlin() `_ function.
The function will output the following list:
- query name `query_lineage[0]`
- query taxID `query_lineage[1]`
- query lineage `query_lineage[2]`
- query lineage dictionary `query_lineage[3]`
- query lineage zip `query_lineage[4]`
- query lineage names `query_lineage[5]`
- reverse query lineage `query_lineage[6]`
- query kingdom `query_lineage[7]`
Please download the notebooks from `here `_
or please click below to view the content.
.. toctree::
../notebooks/query_lineage