get_unknown_taxids
get_unknown_taxids(data, taxid_col='taxid')Get a list of taxids from a data frame which are not in Lifemap data.
Parameters
| Name | Type | Description | Default |
|---|---|---|---|
| data | pl.DataFrame | pd.DataFrame |
Pandas or polars dataframe with original data. | required |
| taxid_col | str | Name of the column storing taxonomy ids, by default “taxid”. | 'taxid' |
Returns
| Name | Type | Description |
|---|---|---|
| list | Missing taxids |
See also
get_duplicated_taxids : function to get a list of duplicated taxids.
Examples
>>> from pylifemap import get_unknown_taxids
>>> import polars as pl
>>> d = pl.DataFrame({"taxid_values": [33154, 33090, 2, -14, 1], "value": [10, 5, 100, 1, 2]})
>>> get_unknown_taxids(d, taxid_col="taxid_values")
[-14, 1]