DNN
- pipeline.code.dnn.analyzer.calculate_rank(params=None, model=None, epoch=0, model_dir=None)[source]
Calculate cross-species embedding correlation ranks between human and mouse regions.
This function extracts embeddings for human and mouse transcriptional data using a trained neural network model, computes the correlation matrix between region-level embeddings, and calculates the average rank of prior homologous human regions for each mouse region.
- Parameters:
params (object) – Configuration object containing paths to human/mouse transcriptional data and homology mapping file, typically accessed via params.data_files.
model (torch.nn.Module) – Trained deep learning model used to extract embeddings from the input data.
epoch (int, optional) – Epoch index used.
model_dir (str, optional) – Directory path to save the embedding results.
- Returns:
The average rank of prior homologous human regions for each mouse region in terms of correlation with the extracted embeddings. Lower values indicate better cross-species alignment.
- Return type:
float
- pipeline.code.dnn.analyzer.transform_human_mouse(params)[source]
Transform human and mouse data using trained embedding models.
This function loads trained neural network models for a number of repeated experiments, transforms human and mouse transcriptional data into embeddings, and saves them as CSV files.
- Parameters:
params (object) –
- Configuration object containing:
data_files: dictionary with paths to human/mouse data and model save directories
repeat_n: number of repeated models
model hyperparameters: input_units, hidden_units1/2/3, output_units
use_bestrank / use_bestvalid: flags for model selection
- Returns:
Saves the generated embedding CSV files.
- Return type:
None
- pipeline.code.dnn.analyzer.independent_test(params)[source]
Evaluate the trained model on an independent test dataset.
Loads the saved models from repeated training runs, runs inference on the test data, saves predictions and confusion matrix to CSV files.
- Parameters:
params (object) –
Configuration object with the following attributes:
- data_filesdict
Paths to independent test data and saved model weights. Expected keys include ‘independent_test_path’, ‘independent_data_path’, ‘independent_s_path’.
- repeat_nint
Number of repeated training runs/models to evaluate.
- input_units, hidden_units1, hidden_units2, hidden_units3, output_unitsint
Model architecture hyperparameters.
- use_bestrank, use_bestvalidbool
Flags to select which checkpoint to load for evaluation.
- Returns:
Saves prediction results and confusion matrices as CSV files.
- Return type:
None
- class pipeline.code.dnn.pipeline.DnnPipeline(config)[source]
Bases:
objectDeep Neural Network (DNN) model training and evaluation pipeline.
This class implements a full pipeline for training, cross-species transformation, and independent test of DNN models on transcriptional data.
- Parameters:
config (object) – Configuration object containing paths, parameters, and flags to control the pipeline.
- cfg
Configuration object.
- Type:
object
- logger
Logger instance for tracking pipeline progress and information.
- Type:
logging.Logger