whylogs.core.statistics.schematracker

SchemaTracker Objects#

class SchemaTracker()

Track information about a column's schema and present datatypes

Paramters#

type_counts : dict If specified, a dictionary containing information about the counts of all data types.

track#

| track(item_type)

Track an item type

get_count#

| get_count(item_type)

Return the count of a given item type

infer_type#

| infer_type()

Generate a guess at what type the tracked values are.

Returns#

type_guess : object The guess tome. See InferredType.Type for candidates

merge#

| merge(other)

Merge another schema tracker with this and return a new one. Does not alter this object.

Parameters#

other : SchemaTracker

Returns#

merged : SchemaTracker Merged tracker

copy#

| copy()

Return a copy of this tracker

to_protobuf#

| to_protobuf()

Return the object serialized as a protobuf message

Returns#

message : SchemaMessage

from_protobuf#

| @staticmethod
| from_protobuf(message)

Load from a protobuf message

Returns#

schema_tracker : SchemaTracker

to_summary#

| to_summary()

Generate a summary of the statistics

Returns#

summary : SchemaSummary Protobuf summary message.