Metadata-Version: 2.1
Name: multivis
Version: 0.5.5
Summary: MultiVis is a data visualisation package that produces both static and interactive visualisations targeted towards the Omics community.
Home-page: https://github.com/brettChapman/multivis
Author: Brett Chapman
Author-email: brett.chapman@murdoch.edu.au, brett.chapman78@gmail.com
License: MIT License
Platform: UNKNOWN
Requires-Python: >=3.5
Description-Content-Type: text/markdown
License-File: LICENSE.txt
Requires-Dist: numpy (==1.20.2)
Requires-Dist: openpyxl (==2.6.1)
Requires-Dist: pandas (==1.2.5)
Requires-Dist: matplotlib (==3.4.2)
Requires-Dist: seaborn (==0.11.1)
Requires-Dist: networkx (==2.4.0)
Requires-Dist: statsmodels (==0.12.2)
Requires-Dist: scikits-bootstrap (==1.0.1)
Requires-Dist: scipy (==1.6.3)
Requires-Dist: scikit-learn (==0.24.2)
Requires-Dist: tqdm (==4.36.1)
Requires-Dist: xlrd (==1.2.0)

# MultiVis
The MultiVis package contains the necessary tools for visualisation of multivariate data.

## Installation

### Dependencies
multivis requires:
- Python (>=3.5)
- NumPy (==1.20.2)
- OpenPyXL (==2.6.1)
- Pandas (==1.2.5)
- Matplotlib (==3.4.2)
- Seaborn (==0.11.1)
- Networkx (==2.4.0)
- statsmodels (==0.12.2)
- scikits-bootstrap (==1.0.1)
- SciPy (==1.6.3)
- Scikit-learn (==0.24.2)
- tqdm (==4.36.1)
- xlrd (==1.2.0)

### User installation
The recommend way to install multivis and dependencies is to using ``conda``:
```console
conda install -c brett.chapman multivis
```
or ``pip``:
```console
pip install multivis
```
Alternatively, to install directly from github:
```console
pip install https://github.com/brettChapman/multivis/archive/master.zip
```

### API
For further detail on the usage refer to the docstring.

#### multivis
- [Edge](https://github.com/brettChapman/multivis/blob/master/multivis/Edge.py): Builds nodes and edges and is the base class for the Network class.
	- [init_parameters](https://github.com/brettChapman/multivis/blob/master/multivis/Edge.py#L34-L51)
		- [peaktable] : Pandas dataframe containing peak data. Must contain 'Name' and 'Label'.
		- [datatable] : Pandas dataframe matrix containing scores.
		- [pvalues] : Pandas dataframe matrix containing score/similarity pvalues (if available, otherwise set to None).
	- [methods](https://github.com/brettChapman/multivis/blob/master/multivis/Edge.py#L53-L148)
		- [set_params] : Set parameters
			- [filter_type] : The value type to filter the data on (default: 'pvalue')
			- [hard_threshold] : Value to filter the data on (default: 0.005)
			- [withinBlocks] : Include scores within blocks if building multi-block network (default: False)
			- [sign] : The sign of the score/similarity to filter on ('pos', 'neg' or 'both') (default: 'both')
		
		- [help] : Print this help text
	
		- [build] : Builds the nodes and edges.
		- [getNodes] : Returns a Pandas dataframe of all nodes.
		- [getEdges] : Returns a Pandas dataframe of all edges.        	

- [Network](https://github.com/brettChapman/multivis/blob/master/multivis/Network.py): Builds nodes and edges, with added NetworkX functionality. Inherits from Edge.
	- [init_parameters](https://github.com/brettChapman/multivis/blob/master/multivis/Network.py#L33-L37)
		- [peaktable] : Pandas dataframe containing peak data. Must contain 'Name' and 'Label'.
		- [datatable] : Pandas dataframe matrix containing scores.
		- [pvalues] : Pandas dataframe matrix containing score/similarity pvalues.
	- [methods](https://github.com/brettChapman/multivis/blob/master/multivis/Network.py#L39-L62)
		- [set_params] : Set parameters
			- [filter_type] : The value type to filter the data on (default: 'pvalue')
			- [hard_threshold] : Value to filter the data on (default: 0.005)
			- [link_type] : The value type to represent links in the network (default: 'score')
			- [withinBlocks] : Include scores within blocks if building multi-block network (default: False)
			- [sign] : The sign of the score/similarity to filter on ('pos', 'neg' or 'both') (default: 'both')

		- [help] : Print this help text
					
		- [build] : Builds nodes, edges and NetworkX graph.
		- [getNetworkx] : Returns a NetworkX graph.
		- [getLinkType] : Returns the link type parameter used in building the network.

- [edgeBundle](https://github.com/brettChapman/multivis/blob/master/multivis/edgeBundle.py): Produces an interactive hierarchical edge bundle in D3.js, from nodes and edges.
	- [init_parameters](https://github.com/brettChapman/multivis/blob/master/multivis/edgeBundle.py#L51-56)
		- [nodes] : Pandas dataframe containing nodes generated from Edge.
		- [edges] : Pandas dataframe containing edges generated from Edge.
	- [methods](https://github.com/brettChapman/multivis/blob/master/multivis/edgeBundle.py#L58-L216)
		- [set_params] : Set parameters
			- [html_file] : Name to save the HTML file as (default: 'hEdgeBundle.html')
			- [innerRadiusOffset] : Sets the inner radius based on the offset value from the canvas width/diameter (default: 120)
			- [blockSeparation] : Value to set the distance between different segmented blocks (default: 1)
			- [linkFadeOpacity] : The link fade opacity when hovering over/clicking nodes (default: 0.05)
			- [mouseOver] : Setting to 'True' swaps from clicking to hovering over nodes to select them (default: True)
			- [fontSize] : The font size in pixels set for each node (default: 10)
			- [backgroundColor] : Set the background colour of the plot (default: 'white')
			- [foregroundColor] : Set the foreground colour of the plot (default: 'black')
			- [node_data] : Peak Table column names to include in the mouse over information (default: 'Name' and 'Label')			
			- [nodeColorScale] : The scale to use for colouring the nodes ("linear", "reverse_linear", "log", "reverse_log", "square", "reverse_square", "area", "reverse_area", "volume", "reverse_volume", "ordinal", "reverse_ordinal") (default: 'linear')
			- [node_color_column] : The Peak Table column to use for node colours (default: None sets to black)
			- [node_cmap] : Set the CMAP colour palette to use for colouring the nodes (default: 'brg')
			- [edgeColorScale] : The scale to use for colouring the edges, if edge_color_value is 'pvalue' ("linear", "reverse_linear", "log", "reverse_log", "square", "reverse_square", "area", "reverse_area", "volume", "reverse_volume", "ordinal", "reverse_ordinal") (default: 'linear')
			- [edge_color_value] : Set the values to colour the edges by. Either 'sign', 'score' or 'pvalue' (default: 'score')
			- [edge_cmap] : Set the CMAP colour palette to use for colouring the edges (default: 'brg')
			- [addArcs] : Setting to 'True' adds arcs around the edge bundle for each block (default: False)
			- [arcRadiusOffset] : Sets the arc radius offset from the inner radius (default: 20)
			- [extendArcAngle] : Sets the angle value to add to each end of the arc (default: 2)
			- [arc_cmap] : Set the CMAP colour palette to use for colouring the arcs (default: 'Set1')

		- [help] : Print this help text

		- [build] : Generates the JavaScript embedded HTML code, writes to a HTML file and opens it in a browser.
		- [buildDashboard] : Generates the JavaScript embedded HTML code in a dashboard format, writes to a HTML file and opens it in a browser.
		
- [plotNetwork](https://github.com/brettChapman/multivis/blob/master/multivis/plotNetwork.py): Produces a static spring-embedded network from a NetworkX graph.
	- [init_parameters](https://github.com/brettChapman/multivis/blob/master/multivis/plotNetwork.py#L47-L51)
		- [g] : NetworkX graph.
	- [methods](https://github.com/brettChapman/multivis/blob/master/multivis/plotNetwork.py#L53-L211)
		- [set_params] : Set parameters
			- [imageFileName] : The image file name to save to (default: 'networkPlot.jpg')
			- [edgeLabels] : Setting to 'True' labels all edges with the score/similarity value (default: True)
			- [saveImage] : Setting to 'True' will save the image to file (default: True)
			- [layout] : Set the NetworkX layout type ('circular', 'kamada_kawai', 'random', 'spring', 'spectral') (default: 'spring')
			- [transparent] : Setting to 'True' will make the background transparent (default: False)
			- [dpi] : The number of Dots Per Inch (DPI) for the image (default: 200)
			- [figSize] : The figure size as a tuple (width,height) (default: (30,20))
			- [node_cmap] : The CMAP colour palette to use for nodes (default: 'brg')
			- [colorScale] : The node colour scale to apply ("linear", "reverse_linear", "log", "reverse_log", "square", "reverse_square", "area", "reverse_area", "volume", "reverse_volume", "ordinal", "reverse_ordinal") (default: 'linear')
			- [node_color_column] : The Peak Table column to use for node colours (default: None sets to black)
			- [sizeScale] : The node size scale to apply ("linear", "reverse_linear", "log", "reverse_log", "square", "reverse_square", "area", "reverse_area", "volume", "reverse_volume", "ordinal", "reverse_ordinal") (default: 'reverse_linear')
			- [size_range] : The node size scale range to apply. Tuple of length 2. Minimum size to maximum size (default: (150,2000))
			- [sizing_column] : The node sizing column to use (default: sizes all nodes to 1)	
			- [alpha] :  Node opacity value (default: 0.5)
			- [nodeLabels] : Setting to 'True' will label the nodes (default: True)
			- [fontSize] : The font size set for each node (default: 15)
			- [keepSingletons] : Setting to 'True' will keep any single nodes not connected by edges in the NetworkX graph (default: True)
			- [column] : Column from Peak Table to filter on (default: no filtering)
			- [threshold] : Value to filter on (default: no filtering)
			- [operator] : The comparison operator to use when filtering (default: '>')
			- [sign] : The sign of the score to filter on ('pos', 'neg' or 'both') (default: 'pos')
		
		- [help] : Print this help text

		- [build] : Generates and displays the NetworkX graph.

- [springNetwork](https://github.com/brettChapman/multivis/blob/master/multivis/springNetwork.py): Interactive spring-embedded network which inherits data from the NetworkX graph.
	- [init_parameters](https://github.com/brettChapman/multivis/blob/master/multivis/springNetwork.py#L48-L52)
		- [g] : NetworkX graph.
	- [methods](https://github.com/brettChapman/multivis/blob/master/multivis/springNetwork.py#L54-L187)
		- [set_params] : Set parameters
			- [node_size_scale] : dictionary(Peak Table column name as index: dictionary('scale': ("linear", "reverse_linear", "log", "reverse_log", "square", "reverse_square", "area", "reverse_area", "volume", "reverse_volume", "ordinal", "reverse_ordinal")
                	                                                                            'range': a number array of length 2 - minimum size to maximum size)) (default: sizes all nodes to 10 with no dropdown menu)
			- [node_color_scale] : dictionary(Peak Table column name as index: dictionary('scale': ("linear", "reverse_linear", "log", "reverse_log", "square", "reverse_square", "area", "reverse_area", "volume", "reverse_volume", "ordinal", "reverse_ordinal") (default: colours all nodes to 'black')
			- [html_file] : Name to save the HTML file as (default: 'springNetwork.html')
			- [backgroundColor] : Set the background colour of the plot (default: 'white')
			- [foregroundColor] : Set the foreground colour of the plot (default: 'black')
			- [chargeStrength] : The charge strength of the spring-embedded network (force between nodes) (default: -120)
			- [groupByBlock] : Setting to 'True' will group nodes by 'Block' if present in the data (default: False)
			- [groupFociStrength] : Set the strength of foci for each group (default: 0.2)
			- [intraGroupStrength] : Set the strength between each group (default: 0.01)
			- [groupLayoutTemplate] : Set the layout template to use for grouping (default: 'treemap')
			- [node_text_size] : The text size for each node (default: 15)
			- [fix_nodes] : Setting to 'True' will fix nodes in place when manually moved (default: False)
			- [displayLabel] : Setting to 'True' will set the node labels to the 'Label' column, otherwise it will set the labels to the 'Name' column from the Peak Table (default: False)
			- [node_data] : Peak Table column names to include in the mouse over information (default: 'Name' and 'Label')
			- [link_type] : The link type used in building the network (default: 'score')
			- [link_width] : The width of the links (default: 0.5)
			- [pos_score_color] : Colour value for positive scores. Can be HTML/CSS name, hex code, and (R,G,B) tuples (default: 'red')
			- [neg_score_color] : Colour value for negative scores. Can be HTML/CSS name, hex code, and (R,G,B) tuples (default: 'black')
		
		- [help] : Print this help text

		- [build] : Generates the JavaScript embedded HTML code and writes to a HTML file and opens it in a browser.
		- [buildDashboard] : Generates the JavaScript embedded HTML code in a dashboard format, writes to a HTML file and opens it in a browser.

- [clustermap](https://github.com/brettChapman//multivis/blob/master/multivis/clustermap.py): Produces a Hierarchical Clustered Heatmap.
	- [init_parameters](https://github.com/brettChapman//multivis/blob/master/multivis/clustermap.py#L49-L57)
		- [scores] : Pandas dataframe scores.
        	- [row_linkage] : Precomputed linkage matrix for the rows from a linkage clustered distance/similarities matrix
        	- [col_linkage] : Precomputed linkage matrix for the columns from a linkage clustered distance/similarities matrix
	- [methods](https://github.com/brettChapman//multivis/blob/master/multivis/clustermap.py#L59-L401)
		- [set_params] : Set parameters
			- [xLabels] : A Pandas Series for labelling the X axis
			- [yLabels] : A Pandas Series for labelling the Y axis
			- [imageFileName] : The image file name to save to (default: 'clusterMap.png')
			- [saveImage] : Setting to 'True' will save the image to file (default: True)
			- [dpi] : The number of Dots Per Inch (DPI) for the image (default: 200)
			- [figSize] : The figure size as a tuple (width,height) (default: (80,70))
			- [dendrogram_ratio_shift] : The ratio to shift the position of the dendrogram in relation to the heatmap (default: 0.0)
			- [dendrogram_line_width] : The line width of the dendrograms (default: 1.5)
			- [background_colour] : Set the background colour (default: 'white')
			- [transparent] : Setting to 'True' will ignore background_colour and make the background transparent (default: False)
			- [fontSize] : The font size for all text (default: 30)
			- [heatmap_annotation] : Annotate the heatmap with values (default: False)
			- [heatmap_cmap] : The CMAP colour palette to use for the heatmap (default: 'RdYlGn')
			- [cluster_cmap] : The CMAP colour palette to use for the branch separation of clusters in the dendrogram (default: 'Set1')
			- [rowColorCluster] : Setting to 'True' will display a colour bar for the clustered rows (default: False)
			- [colColorCluster] : Setting to 'True' will display a colour bar for the clustered columns (default: False)
			- [row_color_threshold] : The colouring threshold for the row dendrogram (default: 10)
			- [col_color_threshold] : The colouring threshold for the column dendrogram (default: 10)
		
		- [help] : Print this help text

		- [build] : Generates and displays the Hierarchical Clustered Heatmap (HCH).

- [polarDendrogram](https://github.com/brettChapman/multivis/blob/master/multivis/polarDendrogram.py): Polar dendrogram
	- [init_parameters](https://github.com/brettChapman/multivis/blob/master/multivis/polarDendrogram.py#L55-L59)
		- [dn] : Dendrogram dictionary labelled by Peak Table index
	- [methods](https://github.com/brettChapman/multivis/blob/master/multivis/polarDendrogram.py#61-L322)
		- set_params : Set parameters
			- [imageFileName] : The image file name to save to (default: 'polarDendrogram.png')
			- [saveImage] : Setting to 'True' will save the image to file (default: True)
			- [branch_scale] : The branch distance scale to apply ('linear', 'log', 'square') (default: 'linear')
			- [gap] : The gap size within the polar dendrogram (default: 0.1)
			- [grid] : Setting to 'True' will overlay a grid (default: False)
			- [style] : Set the matplotlib style (see https://matplotlib.org/stable/tutorials/introductory/customizing.html) (default: 'seaborn-white')
			- [transparent] : Setting to 'True' will make the background of all plots transparent (default: False)
			- [dpi] : The number of Dots Per Inch (DPI) for the image (default: 200)
			- [figSize] : The figure size as a tuple (width,height) (default: (10,10))
			- [fontSize] : The font size for all text (default: 15)
			- [PeakTable] : The Peak Table Pandas dataframe (default: empty dataframe)
			- [DataTable] : The Data Table Pandas dataframe (default: empty dataframe)
			- [group_column_name] : The group column name used in the datatable (e.g. 'Class') (default: None)
			- [textColorScale] : The scale to use for colouring the text ("linear", "reverse_linear", "log", "reverse_log", "square", "reverse_square", "area", "reverse_area", "volume", "reverse_volume", "ordinal", "reverse_ordinal") (default: 'linear')
			- [text_color_column] : The colour column to use from Peak Table (Can be colour or numerical values such as 'pvalue') (default: 'black')
			- [label_column] : The label column to use from Peak Table (default: use original Peak Table index from cartesian dendrogram)
			- [text_cmap] : The CMAP colour palette to use (default: 'brg')

		- [getClusterPlots] : Generates plots of mean/median peak area over the group/class variable for each cluster from the polar dendrogram
			- [plot_type] : The type of plot to plot. Either mean peak area or median peak area (default: 'mean')
			- [column_numbers] : The number of columns to display in the plots (default: 4)
                	- [log_data] : Perform a log ('natural', base 2 or base 10) on all data (default: (True, 2))
                	- [scale_data] : Scale the data to unit variance (default: True)
                	- [impute_data] : Impute any missing values using KNN impute with a set number of nearest neighbours (default: (True, 3))
                	- [figSize] : The figure size as a tuple (width,height) (default: (15,10))
                	- [x_axis_rotation] : Rotate the x axis labels this number of degrees (default: 0)
                	- [ci] : The bootstrapped confidence interval. Can also be standard deviation ("sd") (default: 95)
                	- [saveImage] : Setting to 'True' will save the image to file (default: True)
                	- [imageFileName] : The image file name to save to (default: 'clusterPlots.png')
                	- [dpi] : The number of Dots Per Inch (DPI) for the image (default: 200)

		- [help] : Print this help text
		
		- [build] : Generates and displays the Polar dendrogram.

- [pca](https://github.com/brettChapman/multivis/blob/master/multivis/pca.py): Creates a Principal Component Analysis (PCA) scores and loadings biplot.
	- [parameters](https://github.com/brettChapman/multivis/blob/master/multivis/pca.py#L7)
		- [data] : array-like matrix, shape (n_samples, n_features)
		- [imageFileName] : The image file name to save to (default: 'PCA.png')
		- [saveImage] : Setting to 'True' will save the image to file (default: True)
		- [dpi] : The number of Dots Per Inch (DPI) for the image (default: 200)
		- [pcx] : The first component (default: 1)
		- [pcy] : The second component (default: 2)
		- [group_label] : Labels to assign to each group/class in the PCA plot (default: None)
		- [sample_label] : Labels to assign to each sample in the PCA plot (default: None)
		- [peak_label] : Labels to assign to each peak in the loadings biplot (default: None)
		- [markerSize] : The size of each marker (default: 100)
		- [fontSize] : The font size for all text (default: 12)
		- [figSize] : The figure size as a tuple (width,height) (default: (20,10))
		- [background_colour] : Set the background colour (default: 'white')
		- [grid] : Setting to 'True' will overlay a grid (default: True)
		- [transparent] : Setting to 'True' will ignore background_colour and make the background transparent (default: False)
		- [cmap] : The CMAP colour palette to use (default: 'Set1')

- [pcaLoadings](https://github.com/brettChapman/multivis/blob/master/multivis/pcaLoadings.py): Creates a lollipop plot of PCA components with bootstrapped confidence intervals.
	- [parameters](https://github.com/brettChapman/multivis/blob/master/multivis/pcaLoadings.py#L10)
		- [data] : array-like, shape (n_samples, n_features)
		- [peak_label] : A list of peaks to plot
		- [imageFileName] : The image file name to save to (default: 'PCA_loadings.png')
		- [saveImage] : Setting to 'True' will save the image to file (default: True)
        	- [dpi] : The number of Dots Per Inch (DPI) for the image (default: 200)
        	- [pc_num] : The principal component to plot (default: 1)
        	- [boot_num] : The number of bootstrap samples to use to calculate confidence internals (default: 500)
        	- [alpha] : The alpha value for the bootstrapped confidence intervals (default: 0.05)
        	- [fontSize] : The font size for all text (default: 30)
        	- [markerSize] : The size of each marker (default: 100)
        	- [figSize] : The figure size as a tuple (width,height) (default: (40,40))
        	- [transparent] : Setting to 'True' will make the background transparent (default: False)

- [pcoa](https://github.com/brettChapman/multivis/blob/master/multivis/pcoa.py): Creates a Principal Coordinate Analysis (PCoA) plot.
	- [parameters](https://github.com/brettChapman/multivis/blob/master/multivis/pcoa.py#L8)
		- [similarities] : array-like matrix, shape (n_samples, n_features)
		- [imageFileName] : The image file name to save to (default: 'PCOA.png')
		- [saveImage] : Setting to 'True' will save the image to file (default: True)
		- [dpi] : The number of Dots Per Inch (DPI) for the image (default: 200)
		- [n_components] : Number of components (default: 2)
		- [max_iter] : Maximum number of iterations of the SMACOF algorithm (default: 300)
		- [eps] : Relative tolerance with respect to stress at which to declare convergence (default: 1e-3)
		- [seed] : Seed number used by the random number generator for the RandomState instance (default: 3)
		- [group_label] : Labels to assign to each group/class (default: None)
		- [peak_label] : Labels to assign to each peak (default: None)
		- [markerSize] : The size of each marker (default: 100)
		- [fontSize] : The font size for all text (default: 12)
		- [figSize] : The figure size as a tuple (width,height) (default: (20,10))
		- [background_colour] : Set the background colour (default: 'white')
		- [grid] : Setting to 'True' will overlay a grid (default: True)
		- [transparent] : Setting to 'True' will ignore background_colour and make the background transparent (default: False)
		- [cmap] : The CMAP colour palette to use (default: 'Set1')

#### multivis.utils

- [loadData](https://github.com/brettChapman/multivis/blob/master/multivis/utils/loadData.py): Loads and validates the Data and Peak sheet from an excel file.
	- [parameters](https://github.com/brettChapman/multivis/blob/master/multivis/utils/loadData.py#L6)
		- [filename] : The name of the excel file (.xlsx file) e.g. 'Data.xlsx'.
		- [DataSheet] : The name of the data sheet in the file e.g. 'Data'. The data sheet must contain an 'Idx', 'SampleID', and 'Class' column.
		- [PeakSheet] : The name of the peak sheet in the file e.g. 'Peak'. The peak sheet must contain an 'Idx', 'Name', and 'Label' column.
	- [Returns](https://github.com/brettChapman/multivis/blob/master/multivis/utils/loadData.py#L52)
		- DataTable: Pandas dataFrame
		- PeakTable: Pandas dataFrame

- [groups2blocks](https://github.com/brettChapman/multivis/blob/master/multivis/utils/groups2blocks.py): Slices the data by group/class name into blocks for later identification of multi-block associations and places the data into a dictionary indexed by group/class name.
	- [parameters](https://github.com/brettChapman/multivis/blob/master/multivis/utils/groups2blocks.py#L7)
		- [PeakTable] : Pandas dataframe containing the feature/peak data. Must contain 'Name' and 'Label'.
		- [DataTable] : Pandas dataframe matrix containing values. The data must contain a column separating out the different groups in the data (e.g. Class)
		- [group_column_name] : The group column name used in the datatable (e.g. Class)
	- [Returns](https://github.com/brettChapman/multivis/blob/master/multivis/utils/groups2blocks.p#57)
		- [DataBlocks] : A dictionary containing DataTables indexed by group names
		- [PeakBlocks] : A dictionary containing PeakTables indexed by group names

- [mergeBlocks](https://github.com/brettChapman/multivis/blob/master/multivis/utils/mergeBlocks.py): Merges multiply different Data Tables and Peak Tables from dictionaries into a single Peak Table and Data Table (used for multi-block/multi-omics data preparation). The 'Name' column needs to be unique across all blocks. Automatically annotates the merged Peak Table with a 'Block' column and consolidates any statistical results generated from the multivis.utils.statistics package in relation to each block.
	- [parameters](https://github.com/brettChapman/multivis/blob/master/multivis/utils/mergeBlocks.py#L5)
		- [peak_blocks] : A dictionary of Pandas Peak Table dataframes from different datasets indexed by dataset type.
		- [data_blocks] : A dictionary of Pandas Data Table dataframes from different datasets indexed by dataset type.
		- [mergeType] : The type of merging to perform. Either by 'SampleID' or 'Index'.
	- [Returns](https://github.com/brettChapman/multivis/blob/master/multivis/utils/mergeBlocks.p#91)
		- [DataTable] : Merged Pandas dataFrame
		- [PeakTable] : Merged Pandas dataFrame (with any statistical results generated by multivis.utils.statistics consolidated into each block)

- [scaleData](https://github.com/brettChapman/multivis/blob/master/multivis/utils/scaleData.py): Scales data in forward or reverse order based on different scaling options.
	- [parameters](https://github.com/brettChapman/multivis/blob/master/multivis/utils/scaleData.py#L6)
		- [data] : A 1D numpy array of values
		- [scale] : The scaling option chosen to apply to the data ("linear", "reverse_linear", "log", "reverse_log", "square", "reverse_square", "area", "reverse_area", "volume", "reverse_volume", "ordinal", "reverse_ordinal")
		- [min] : The minimum value for scaling
		- [max] : The maximum value for scaling
	- [Returns](https://github.com/brettChapman/multivis/blob/master/multivis/utils/scaler.py#L81)
		- [scaled_data] : A scaled 1D numpy array

- [scaler](https://github.com/brettChapman/multivis/blob/master/multivis/utils/scaler.py): Scales a series of values in a 1D numpy array or pandas dataframe matrix based on different scaling functions
	- [parameters](https://github.com/brettChapman/multivis/blob/master/multivis/utils/scaler.py#L9)
		- [data] : A pandas dataframe matrix or 1D numpy array of numerical values
		- [type] : The scaler type to apply based on sklearn preprocessing functions (default: "standard")
		- [newMin] : The minimum value for scaling (default: 1)
		- [newMax] : The maximum value for scaling (default: 10)
	- [Returns](https://github.com/brettChapman/multivis/blob/master/multivis/utils/scaler.py#L39)
		- [scaled_data] : A scaled pandas dataframe matrix or 1D numpy array of numerical values

- [imputeData](https://github.com/brettChapman/multivis/blob/master/multivis/utils/imputeData.py): Imputes data given a pandas dataframe of values
	- [parameters](https://github.com/brettChapman/multivis/blob/master/multivis/utils/imputeData.py#L5)
		- [data] : A pandas dataframe of values
		- [k] : The number of nearest neighbours
	- [Returns](https://github.com/brettChapman/multivis/blob/master/multivis/utils/imputeData.py#L25)
		- [data_filled] : Imputed data

- [statistics](https://github.com/brettChapman/multivis/blob/master/multivis/utils/statistics.py): Generate a table of parametric or non-parametric statistics and merges them with the Peak Table (node table).
	- [init_parameters](https://github.com/brettChapman/multivis/blob/master/multivis/utils/statistics.py#L51-L64)
		- [peaktable] : Pandas dataframe containing peak data. Must contain 'Name' and 'Label'.
            	- [datatable] : Pandas dataframe matrix containing values for statistical analysis
	- [methods](https://github.com/brettChapman/multivis/blob/master/multivis/utils/statistics.py#L66-L482)
		- [set_params] : Set parameters
			- [parametric] : Perform parametric statistical analysis, assuming the data is normally distributed (default: True)
                	- [log_data] : Perform a log ('natural', base 2 or base 10) on all data prior to statistical analysis (default: (False, 2))
                	- [scale_data] : Scale the data to unit variance (default: False)
                	- [impute_data] : Impute any missing values using KNN impute with a set number of nearest neighbours (default: (False, 3))
			- [group_column_name] : The group column name used in the datatable (default: None)
                	- [control_group_name] : The control group name in the datatable, if available (default: None)
                	- [group_alpha_CI] : The alpha value for group confidence intervals (default: 0.05)
                	- [fold_change_alpha_CI] : The alpha value for mean/median fold change confidence intervals (default: 0.05)
                	- [pca_alpha_CI] : The alpha value for the PCA confidence intervals (default: 0.05)
                	- [total_missing] : Calculate the total missing values per feature (Default: False)
                	- [group_missing] : Calculate the missing values per feature per group (if group_column_name not None) (Default: False)
               		- [pca_loadings] : Calculate PC1 and PC2 loadings for each feature (Default: True)
                	- [normality_test] : Determine normal distribution across whole dataset using Shapiro-Wilk test (pvalues < 0.05 ~ non-normal distribution) (default: True)
                	- [group_normality_test] : Determine normal distribution across each group (if group_column_name not None) using Shapiro-Wilk test (pvalues < 0.05 ~ non-normal distribution) (default: True)
                	- [group_mean_CI] : Determine the mean with bootstrapped CI across each group (if parametric = True and group_column_name not None) (default: True)
                	- [group_median_CI] : Determine the median with bootstrapped CI across each group (if parametric = False and group_column_name not None) (default: True)
               		- [mean_fold_change] : Calculate the mean fold change with bootstrapped confidence intervals (if parametric = True, group_column_name not None and control_group_name not None) (default: False)
			- [median_fold_change] : Calculate the median fold change with bootstrapped confidence intervals (if parametric = False, group_column_name not None and control_group_name not None) (default: False)
                	- [levene_twoGroup] : Test null hypothesis that control group and each of the other groups come from populations with equal variances (if group_column_name not None and control_group_name not None) (default: False)
                	- [levene_allGroup] : Test null hypothesis that all groups come from populations with equal variances (if group_column_name not None) (default: False)
                	- [oneway_Anova_test] : Test null hypothesis that all groups have the same population mean, with included Benjamini-Hochberg FDR (if parametric = True and group_column_name not None) (default: False)
                	- [kruskal_wallis_test] : Test null hypothesis that population median of all groups are equal, with included Benjamini-Hochberg FDR (if parametric = False and group_column_name not None) (default: False)
                	- [ttest_oneGroup] : Calculate the T-test for the mean across all the data (one group), with included Benjamini-Hochberg FDR (if parametric = True, group_column_name is None or there is only 1 group in the data) (default: False)
                	- [ttest_twoGroup] : Calculate the T-test for the mean of two groups, with one group being the control group, with included Benjamini-Hochberg FDR (if parametric = True, group_column_name not None and control_group_name not None) (default: False)
                	- [mann_whitney_u_test] : Compute the Mann-Whitney rank test on two groups, with one being the control group, with included Benjamini-Hochberg FDR (if parametric = False, group_column_name not None and control_group_name not None) (default: False)

		- [help] : Print this help text
		
		- [calculate] : Performs the statistical calculations and outputs the Peak Table (node table) with the results appended.

- [corrAnalysis](https://github.com/brettChapman/multivis/blob/master/multivis/utils/corrAnalysis.py): Correlation analysis on a matrix of values with Pearson, Spearman or Kendall's Tau.
	- [parameters](https://github.com/brettChapman/multivis/blob/master/multivis/utils/corrAnalysis.py#L7)
		- [df_data] : A Pandas dataframe matrix of values
		- [correlationType] : The correlation type to apply. Either 'Pearson', 'Spearman' or 'KendallTau'
	- [Returns](https://github.com/brettChapman/multivis/blob/master/multivis/utils/corrAnalysis.py#L60)
		- [df_corr] : Pandas dataframe matrix of all correlation coefficients
		- [df_pval] : Pandas dataframe matrix of all correlation pvalues

- [cluster](https://github.com/brettChapman/multivis/blob/master/multivis/utils/cluster.py): Clusters data using a linkage cluster method. If the data is correlated the correlations are first preprocessed, then clustered, otherwise a distance metric is applied to non-correlated data before clustering.
	- [parameters](https://github.com/brettChapman/multivis/blob/master/multivis/utils/cluster.py#L7)
		- [matrix] : A Pandas dataframe matrix of scores
		- [transpose_non_correlated] : Setting to 'True' will transpose the matrix if it is not correlated data
		- [is_correlated] : Setting to 'True' will treat the matrix as if it contains correlation coefficients
		- [distance_metric] : Set the distance metric. Used if the matrix does not contain correlation coefficients.
		- [linkage_method] : Set the linkage method for the clustering.
	- [Returns](https://github.com/brettChapman/multivis/blob/master/multivis/utils/cluster.py#L47)
		- [matrix] : The original matrix, transposed if transpose_non_correlated is 'True' and is_correlated is 'False'.
		- [row_linkage] : linkage matrix for the rows from a linkage clustered distance/similarities matrix
		- [col_linkage] : linkage matrix for the columns from a linkage clustered distance/similarities matrix

### License
Multivis is licensed under the MIT license.

### Authors
- Brett Chapman
- https://scholar.google.com.au/citations?user=A_wYNAQAAAAJ&hl=en

### Correspondence
Dr. Brett Chapman, Post-doctoral Research Fellow at the Western Crop Genetics Alliance, Murdoch University.
E-mail: brett.chapman@murdoch.edu.au, brett.chapman78@gmail.com

### Citation
If you would like to cite MultiVis in a scientific publication, please cite this GitHub page until a citation to a publication becomes available.


