write_clustering_results¶
- tsml_eval.utils.results_writing.write_clustering_results(cluster_predictions, cluster_probabilities, class_labels, clusterer_name, dataset_name, file_path, full_path=True, first_line_clusterer_name=None, split=None, resample_id=None, time_unit='N/A', first_line_comment=None, parameter_info='No Parameter Info', clustering_accuracy=-1, fit_time=-1, predict_time=-1, benchmark_time=-1, memory_usage=-1, n_classes=-1, n_clusters=-1)[source]¶
Write the predictions for a clustering experiment in the format used by tsml.
- Parameters:
- cluster_predictionsnp.array
The predicted values to write to file. Must be the same length as labels.
- cluster_probabilitiesnp.ndarray
Estimated cluster probabilities. These are written after the predicted values for each case.
- class_labelsnp.array
The actual class values written to file with the predicted values. If no label is available for a case, a NaN value should be substituted.
- clusterer_namestr
Name of the clusterer that made the predictions. Written to file and can determine file structure if full_path is False.
- dataset_namestr
Name of the problem the clusterer was built on.
- file_pathstr
Path to write the results file to or the directory to build the default file structure if full_path is False.
- full_pathboolean, default=True
If True, results are written directly to the directory passed in file_path. If False, then a standard file structure using the clusterer and dataset names is created and used to write the results file.
- first_line_clusterer_namestr or None, default=None
Alternative name for the clusterer to be written to the file. If None, the clusterer_name is used. Useful if full_path is False and extra information is wanted in the clusterer name (i.e. and alias and class name)
- splitstr or None, default=None
Either None, ‘TRAIN’ or ‘TEST’. Influences the result file name and first line of the file.
- resample_idint or None, default=None
Indicates what random seed was used to resample the data or used as a random_state for the clusterer.
- time_unitstr, default=”N/A”
The format used for timings in the file, i.e. ‘Seconds’, ‘Milliseconds’, ‘Nanoseconds’
- first_line_commentstr or None, default=None
Optional comment appended to the end of the first line, i.e. the file used to generate the results or a dictionary linking label indices to actual values.
- parameter_infostr, default=”No Parameter Info”
Unstructured estimator dependant information, i.e. estimator parameters or values from the model build.
- clustering_accuracyfloat, default=-1
The clustering accuracy of the predictions.
- fit_timeint, default=-1
The time taken to fit the clusterer.
- predict_timeint, default=-1
The time taken to predict the cluster labels.
- benchmark_timeint, default=-1
A benchmark time for the hardware used to scale other timings.
- memory_usageint, default=-1
The memory usage of the clusterer.
- n_classesint, default=-1
The number of classes in the dataset.
- n_clustersint, default=-1
The number of clusters founds by the clusterer.