mgkit.mappings.pandas_map module¶
Module that contains mapping operations on pandas data structures
- 
mgkit.mappings.pandas_map.calc_coefficient_of_variation(dataframe)[source]¶
- Calculate coefficient of variation for a DataFrame. Uses formula from Wikipedia - The formula used is \(\left (1 + \frac {1}{4n} \right ) * c_{v}\) where \(c_{v} = \frac {s}{\bar{x}}\) 
- 
mgkit.mappings.pandas_map.concatenate_and_rename_tables(dataframes, roots)[source]¶
- Concatenates a list of - pandas.DataFrameinstances and renames the columns prepending a string to each column in each table from a list of prefixes.- Parameters
- dataframes (iterable) – iterable of DataFrame instances 
- roots (iterable) – list of prefixes to append to the column names of each DataFrame 
 
- Return DataFrame
- returns a DataFrame instance 
 - Todo - move to pandas_utils? 
 
- 
mgkit.mappings.pandas_map.group_dataframe_by_mapping(dataframe, mapping, root_taxon, name_dict=None)[source]¶
- Return a - pandas.DataFramefiltered by mapping and root taxon, the values for each column is averaged over all genes mapping to a category.
- 
mgkit.mappings.pandas_map.make_stat_table(dataframes, roots)[source]¶
- Produces a - pandas.DataFramethat summarise the supplied DataFrames. The stats include mean, stdev and coefficient of variation for each root taxon.- Parameters
- dataframes (iterable) – iterable of DataFrame instances 
- roots (iterable) – list of root taxa to which each table belongs 
 
- Return DataFrame
- returns a DataFrame instance