get_unknown_taxids

get_unknown_taxids(data, taxid_col='taxid')

Get a list of taxids from a data frame which are not in Lifemap data.

Parameters

Name Type Description Default
data pl.DataFrame | pd.DataFrame Pandas or polars dataframe with original data. required
taxid_col str Name of the column storing taxonomy ids, by default “taxid”. 'taxid'

Returns

Name Type Description
list Missing taxids

See also

get_duplicated_taxids : function to get a list of duplicated taxids.

Examples

>>> from pylifemap import get_unknown_taxids
>>> import polars as pl
>>> d = pl.DataFrame({"taxid_values": [33154, 33090, 2, -14, 1], "value": [10, 5, 100, 1, 2]})
>>> get_unknown_taxids(d, taxid_col="taxid_values")
[-14, 1]