dimcat.data package#
Subpackages#
- dimcat.data.catalogs package
- Submodules
- dimcat.data.catalogs.base module
DimcatCatalogDimcatCatalog.PickleSchemaDimcatCatalog.SchemaDimcatCatalog.add_package()DimcatCatalog.add_resource()DimcatCatalog.basepathDimcatCatalog.check_feature_availability()DimcatCatalog.copy()DimcatCatalog.extend()DimcatCatalog.extend_package()DimcatCatalog.get_package()DimcatCatalog.get_package_by_name()DimcatCatalog.get_resource_by_config()DimcatCatalog.get_resource_by_name()DimcatCatalog.get_resources_by_regex()DimcatCatalog.get_resources_by_type()DimcatCatalog.has_package()DimcatCatalog.iter_resources()DimcatCatalog.make_new_package()DimcatCatalog.package_namesDimcatCatalog.packagesDimcatCatalog.replace_package()DimcatCatalog.summary_dict()
- dimcat.data.catalogs.inputs module
- dimcat.data.catalogs.outputs module
- Module contents
- dimcat.data.datasets package
- Submodules
- dimcat.data.datasets.base module
DatasetDataset.PickleSchemaDataset.SchemaDataset.add_output()Dataset.apply_step()Dataset.check_feature_availability()Dataset.copy()Dataset.extract_feature()Dataset.extractable_featuresDataset.from_catalogs()Dataset.from_dataset()Dataset.from_loader()Dataset.from_package()Dataset.get_feature()Dataset.get_last_step()Dataset.get_metadata()Dataset.get_steps()Dataset.inputsDataset.iter_features()Dataset.load()Dataset.load_feature()Dataset.load_package()Dataset.n_active_featuresDataset.n_features_availableDataset.outputsDataset.pipelineDataset.reset_pipeline()Dataset.summary_dict()
- dimcat.data.datasets.processed module
- Module contents
- dimcat.data.packages package
- Submodules
- dimcat.data.packages.base module
PackagePackage.PickleSchemaPackage.SchemaPackage.add_resource()Package.available_featuresPackage.basepathPackage.check_if_homogeneous()Package.contained_featuresPackage.copy()Package.create_and_add_resource()Package.descriptor_existsPackage.descriptor_filenamePackage.descriptor_is_completePackage.extend()Package.extract_feature()Package.extractable_featuresPackage.filepathPackage.from_descriptor()Package.from_descriptor_path()Package.from_directory()Package.from_filepaths()Package.from_package()Package.from_resources()Package.get_descriptor_filename()Package.get_descriptor_path()Package.get_feature()Package.get_metadata()Package.get_piece_index()Package.get_resource()Package.get_resource_by_config()Package.get_resource_by_name()Package.get_resources_by_regex()Package.get_resources_by_type()Package.get_zip_filepath()Package.get_zip_path()Package.is_alignedPackage.is_emptyPackage.is_fully_serializedPackage.is_partially_serializedPackage.is_paths_onlyPackage.iter_facets()Package.iter_features()Package.make_descriptor()Package.n_resourcesPackage.normpathPackage.package_existsPackage.package_namePackage.replace_resource()Package.resource_namesPackage.resourcesPackage.statusPackage.store_descriptor()Package.summary_dict()Package.validate()Package.zip_file_exists
PackageModePackageSchemaPackageStatusPathPackage
- dimcat.data.packages.dc module
- dimcat.data.packages.score module
- Module contents
- dimcat.data.resources package
- Submodules
- dimcat.data.resources.base module
FeatureNameFeatureName.AnnotationsFeatureName.ArticulationFeatureName.BassNotesFeatureName.CadenceLabelsFeatureName.DcmlAnnotationsFeatureName.FeatureFeatureName.HarmonyLabelsFeatureName.KeyAnnotationsFeatureName.MeasuresFeatureName.MetadataFeatureName.NotesFeatureName.PhraseAnnotationsFeatureName.PhraseComponentsFeatureName.PhraseLabels
PathResourceResourceResource.IDResource.PickleSchemaResource.SchemaResource.basepathResource.copy()Resource.copy_to_new_location()Resource.corpus_nameResource.descriptor_existsResource.descriptor_filenameResource.detach_from_basepath()Resource.detach_from_descriptor()Resource.detach_from_filepath()Resource.filepathResource.from_descriptor()Resource.from_descriptor_path()Resource.from_filepath()Resource.from_resource()Resource.from_resource_path()Resource.get_corpus_name()Resource.get_descriptor_filename()Resource.get_descriptor_path()Resource.get_filepath()Resource.get_innerpath()Resource.get_path_dict()Resource.get_resource_name()Resource.innerpathResource.is_emptyResource.is_frozenResource.is_loadedResource.is_packagedResource.is_serializedResource.is_validResource.is_zippedResource.make_descriptor()Resource.normpathResource.resourceResource.resource_existsResource.resource_nameResource.set_basepath()Resource.set_descriptor_filename()Resource.statusResource.store_descriptor()Resource.to_dict()Resource.validate()
ResourceSchemaResourceStatusreconcile_base_and_file()resource_specs2resource()
- dimcat.data.resources.dc module
DimcatIndexDimcatIndex.PickleSchemaDimcatIndex.SchemaDimcatIndex.copy()DimcatIndex.filter()DimcatIndex.from_dataframe()DimcatIndex.from_grouping()DimcatIndex.from_index()DimcatIndex.from_resource()DimcatIndex.from_tuples()DimcatIndex.get_level_values_to_drop()DimcatIndex.indexDimcatIndex.namesDimcatIndex.piece_level_positionDimcatIndex.sample()DimcatIndex.to_resource()
DimcatResourceDimcatResource.SchemaDimcatResource.align_with_grouping()DimcatResource.apply_slice_intervals()DimcatResource.apply_step()DimcatResource.column_schemaDimcatResource.dataframeDimcatResource.default_groupbyDimcatResource.dfDimcatResource.extract_feature()DimcatResource.extractable_featuresDimcatResource.field_namesDimcatResource.filter_index_level()DimcatResource.formatDimcatResource.format_dataframe()DimcatResource.formatted_columnDimcatResource.from_dataframe()DimcatResource.from_descriptor()DimcatResource.from_descriptor_path()DimcatResource.from_filepath()DimcatResource.from_index()DimcatResource.from_resource()DimcatResource.from_resource_and_dataframe()DimcatResource.from_resource_path()DimcatResource.get_dataframe()DimcatResource.get_default_analysis()DimcatResource.get_default_column_names()DimcatResource.get_grouping_levels()DimcatResource.get_index()DimcatResource.get_interval_index()DimcatResource.get_level_names()DimcatResource.get_normpath()DimcatResource.get_piece_index()DimcatResource.get_slice_intervals()DimcatResource.get_time_spans()DimcatResource.has_distinct_formatted_columnDimcatResource.innerpathDimcatResource.is_emptyDimcatResource.is_loadedDimcatResource.is_validDimcatResource.join_on_index()DimcatResource.load()DimcatResource.make_bar_plot()DimcatResource.make_bubble_plot()DimcatResource.make_pie_chart()DimcatResource.metadataDimcatResource.plot()DimcatResource.plot_grouped()DimcatResource.set_basepath()DimcatResource.set_dataframe()DimcatResource.store_dataframe()DimcatResource.store_resource()DimcatResource.subselect()DimcatResource.summary_dict()DimcatResource.update_default_groupby()DimcatResource.validate()DimcatResource.value_column
FeatureIndexFieldPieceIndexPlaythroughSliceIntervalsUnitOfAnalysis
- dimcat.data.resources.facets module
AnnotationsFacetControlsFacetEventsFacetFacetMuseScoreChordsMuseScoreFacetMuseScoreFacetNameMuseScoreHarmoniesMuseScoreMeasuresMuseScoreNotesStructureFacetadd_chord_tone_intervals()add_chord_tone_scale_degrees()chord_tones2interval_structure()extend_cadence_feature()extend_harmony_feature()extend_keys_feature()get_index_intervals_for_phrases()make_chord_col()make_raw_phrase_df()make_take_mask_and_index()
- dimcat.data.resources.features module
AUXILIARY_DCML_ANNOTATIONS_COLUMNSAnnotationsArticulationBassNotesBassNotesFormatCadenceLabelFormatCadenceLabelsDcmlAnnotationsHARMONY_CONVENIENCE_COLUMNSHarmonyLabelsHarmonyLabelsFormatKEY_CONVENIENCE_COLUMNSKeyAnnotationsMeasuresMetadataNotesNotesFormatPhraseAnnotationsPhraseComponentNamePhraseComponentsPhraseLabelscondense_components()condense_phrases()extend_bass_notes_feature()extend_notes_feature()make_sequence_non_repeating()merge_tied_notes()tuple_contains()
- dimcat.data.resources.results module
CadenceCountsCountsCulledPrevalenceMatrixCulledRelativePrevalenceMatrixDurationsGroupwisePrevalenceMatrixInverseDocumentFrequencyFlavorNgramTableNgramTable.compute_information_gain()NgramTable.formatNgramTable.get_default_analysis()NgramTable.get_grouping_levels()NgramTable.get_transitions()NgramTable.make_bigram_df()NgramTable.make_bigram_table()NgramTable.make_bigram_tuples()NgramTable.make_ngram_df()NgramTable.make_ngram_table()NgramTable.make_ngram_tuples()NgramTable.make_ranking_table()NgramTable.ngram_levelsNgramTable.plot()NgramTable.plot_grouped()
NgramTableFormatNgramTuplesPhraseDataPhraseDataFormatPrevalenceMatrixPrevalenceMatrix.absolutePrevalenceMatrix.document_frequencies()PrevalenceMatrix.document_frequency()PrevalenceMatrix.document_prevalence()PrevalenceMatrix.get_culled_matrix()PrevalenceMatrix.get_groupwise_prevalence()PrevalenceMatrix.get_relative_prevalence()PrevalenceMatrix.inverse_document_frequencies()PrevalenceMatrix.inverse_document_frequency()PrevalenceMatrix.is_absolutePrevalenceMatrix.is_completePrevalenceMatrix.make_bar_plot()PrevalenceMatrix.make_bubble_plot()PrevalenceMatrix.make_pie_chart()PrevalenceMatrix.make_ranking_table()PrevalenceMatrix.n_documentsPrevalenceMatrix.n_typesPrevalenceMatrix.overall_prevalencePrevalenceMatrix.plot()PrevalenceMatrix.plot_grouped()PrevalenceMatrix.relativePrevalenceMatrix.tf_idf()PrevalenceMatrix.type_countPrevalenceMatrix.type_prevalence()PrevalenceMatrix.z_scores
RelativePrevalenceMatrixResultResult.SchemaResult.analyzed_resourceResult.combine_results()Result.compute_entropy()Result.dimension_columnResult.feature_columnsResult.formatted_columnResult.get_grouping_levels()Result.is_combinationResult.make_bar_plot()Result.make_bubble_plot()Result.make_pie_chart()Result.make_ranking_table()Result.metadataResult.plot()Result.plot_grouped()Result.uses_line_of_fifths_colorsResult.value_columnResult.x_columnResult.y_column
ResultNameResultName.CadenceCountsResultName.CountsResultName.CulledPrevalenceMatrixResultName.CulledRelativePrevalenceMatrixResultName.DurationsResultName.GroupwisePrevalenceMatrixResultName.NgramTableResultName.NgramTuplesResultName.PhraseDataResultName.PrevalenceMatrixResultName.RelativePrevalenceMatrixResultName.ResultResultName.Transitions
TerminalSymbolTransitionscompute_entropy_of_observations()compute_entropy_of_occurrences()compute_entropy_of_probabilities()logarithm_function()make_heatmaps_from_transitions()prepare_transitions()turn_proportions_into_percentage_strings()
- dimcat.data.resources.utils module
align_with_grouping()append_index_levels()apply_playthrough()apply_slice_intervals_to_resource_df()boolean_is_minor_column_to_mode()check_configs_against_allowed_configs()check_qstamp_columns()condense_dataframe_by_groups()condense_pedal_points()drop_duplicated_ultima_rows()drop_rows_with_missing_values()ensure_level_named_piece()feature_specs2config()features_argument2config_list()fl_fields2pandas_params()get_corpus_display_name()get_existing_normpath()get_time_spans_from_resource_df()infer_piece_col_position()infer_schema_from_df()insert_index_level()join_df_on_index()load_fl_resource()load_index_from_fl_resource()make_adjacency_groups()make_adjacency_mask()make_boolean_mask_from_set_of_tuples()make_frictionless_schema_descriptor()make_group_start_mask()make_groups_lasts_mask()make_groupwise_range_index_from_groups()make_index_from_grouping_dict()make_multiindex_for_unstack()make_phrase_start_mask()make_range_index_from_boolean_mask()make_regrouped_stage_index()make_tsv_resource()merge_columns_into_one()merge_ties()nan_eq()overlapping_chunk_per_interval_cutoff_direct()regroup_phrase_stages()resolve_columns_argument()resolve_join_str_argument()resolve_levels_argument()resolve_recognized_piece_columns_argument()safe_row_tuple()store_json()str2inttuple()str2pd_interval()subselect_multiindex_from_df()transform_phrase_data()transpose_notes_to_c()tuple2str()update_duration_qb()value2bool()
- Module contents
Submodules#
dimcat.data.base module#
- class dimcat.data.base.AbsolutePathStr[source]#
Bases:
strThis is just a string but if it includes the HOME directory, it is represented with a leading ‘~’.
- class dimcat.data.base.Data(basepath: Optional[str] = None)[source]#
Bases:
DimcatObjectThis base class unites all classes containing data in some way or another.
- class PickleSchema(*, only: Optional[Union[Sequence[str], AbstractSet[str]]] = None, exclude: Union[Sequence[str], AbstractSet[str]] = (), many: Optional[bool] = None, load_only: Union[Sequence[str], AbstractSet[str]] = (), dump_only: Union[Sequence[str], AbstractSet[str]] = (), partial: Optional[Union[bool, Sequence[str], AbstractSet[str]]] = None, unknown: Optional[Literal['exclude', 'include', 'raise']] = None)[source]#
Bases:
SchemaWhen serializing data objects, the basepath is used as location, but it is not included in the descriptor, according to the frictionless specification.
- exclude: set[Any] | MutableSet[Any]#
- unknown: types.UnknownOption#
- class Schema(*, only: Optional[Union[Sequence[str], AbstractSet[str]]] = None, exclude: Union[Sequence[str], AbstractSet[str]] = (), many: Optional[bool] = None, load_only: Union[Sequence[str], AbstractSet[str]] = (), dump_only: Union[Sequence[str], AbstractSet[str]] = (), partial: Optional[Union[bool, Sequence[str], AbstractSet[str]]] = None, unknown: Optional[Literal['exclude', 'include', 'raise']] = None)[source]#
Bases:
Schema- exclude: set[Any] | MutableSet[Any]#
- unknown: types.UnknownOption#
- get_basepath(set_default_if_missing: bool = False) str[source]#
Get the basepath of the resource. If not specified, the default basepath is returned. If
set_default_if_missingis set to True and no basepath has been set (e.g. during initialization), thebasepathis permanently set to the default basepath.
- class property pickle_schema[source]#
Returns the (instantiated) PickleSchema singleton object for this class. It is different from the ‘normal’ Schema in that it stores the tabular data to disk and returns the path to its descriptor.
- to_config(pickle=False) DimcatConfig[source]#
If
pickleis set to True,
- static treat_new_basepath(basepath: str, filepath=None, other_logger=None) AbsolutePathStr[source]#
- dimcat.data.base.resolve_path(path) Optional[AbsolutePathStr][source]#
Resolves ‘~’ to HOME directory and turns
pathinto an absolute path. This is an identical copy of the function in dimcat.utils.
dimcat.data.utils module#
- dimcat.data.utils.check_descriptor_filename_argument(descriptor_filename) str[source]#
Check if the descriptor_filename is a filename (not path) and warn if it doesn’t have the extension .json or .yaml.
- Parameters:
descriptor_filename –
- Raises:
ValueError – If the descriptor_filename is absolute.
- dimcat.data.utils.make_fl_resource(name: Optional[str] = None, **options) Resource[source]#
Creates a frictionless.Resource by passing the **options to the constructor.
- dimcat.data.utils.make_rel_path(path: str, start: str)[source]#
Like os.path.relpath() but ensures that path is contained within start.
- dimcat.data.utils.store_as_json_or_yaml(descriptor_dict: dict, descriptor_path: str, create_dirs: bool = True)[source]#