hipdf.core.dtypes.StructDtype#
25 min read time
- class hipdf.core.dtypes.StructDtype(fields)#
Bases:
_BaseDtypeType to represent a struct data.
Parameters#
- fieldsdict
A mapping of field names to dtypes, the dtypes can themselves be of
StructDtypetoo.
Attributes#
fields itemsize
Methods#
from_arrow to_arrow
Examples#
>>> import cudf >>> struct_dtype = cudf.StructDtype({"a": "int64", "b": "string"}) >>> struct_dtype StructDtype({'a': dtype('int64'), 'b': dtype('O')})
A nested
StructDtypecan also be constructed in the following way:>>> nested_struct_dtype = cudf.StructDtype({"dict_data": struct_dtype, "c": "uint8"}) >>> nested_struct_dtype StructDtype({'dict_data': StructDtype({'a': dtype('int64'), 'b': dtype('O')}), 'c': dtype('uint8')})
- __init__(fields)#
Methods
__init__(fields)Return the array type associated with this dtype.
construct_from_string(string)Construct this type from a string.
deserialize(header, frames)Generate an object from a serialized representation.
device_deserialize(header, frames)Perform device-side deserialization tasks.
Serialize data and metadata associated with device memory.
empty(shape)Construct an ExtensionArray of this dtype with the given shape.
from_arrow(typ)Convert a
pyarrow.StructTypetoStructDtype.host_deserialize(header, frames)Perform device-side deserialization tasks.
Serialize data and metadata associated with host memory.
is_dtype(dtype)Check if we match 'dtype'.
Generate an equivalent serializable representation of an object.
to_arrow()Convert a
StructDtypeto apyarrow.StructType.Attributes
Returns an ordered dict of column name and dtype key-value.
The Index subclass to return from Index.__new__ when this dtype is encountered.
A character code (one of 'biufcmMOSUV'), default 'O'
Default NA value to use for this type.
Ordered list of field names, or None if there are no fields.
The scalar type for the array, e.g.
int.- name = 'struct'#
- __init__(fields)#
- property fields#
Returns an ordered dict of column name and dtype key-value.
Examples#
>>> import cudf >>> struct_dtype = cudf.StructDtype({"a": "int64", "b": "string"}) >>> struct_dtype StructDtype({'a': dtype('int64'), 'b': dtype('O')}) >>> struct_dtype.fields {'a': dtype('int64'), 'b': dtype('O')}
- property type#
The scalar type for the array, e.g.
intIt’s expected
ExtensionArray[item]returns an instance ofExtensionDtype.typefor scalaritem, assuming that value is valid (not NA). NA values do not need to be instances of type.
- classmethod from_arrow(typ)#
Convert a
pyarrow.StructTypetoStructDtype.Examples#
>>> import cudf >>> import pyarrow as pa >>> pa_struct_type = pa.struct({'x': pa.int32(), 'y': pa.string()}) >>> pa_struct_type StructType(struct<x: int32, y: string>) >>> cudf.StructDtype.from_arrow(pa_struct_type) StructDtype({'x': dtype('int32'), 'y': dtype('O')})
- to_arrow()#
Convert a
StructDtypeto apyarrow.StructType.Examples#
>>> import cudf >>> struct_type = cudf.StructDtype({"x": "int32", "y": "string"}) >>> struct_type StructDtype({'x': dtype('int32'), 'y': dtype('O')}) >>> struct_type.to_arrow() StructType(struct<x: int32, y: string>)
- property itemsize#
- classmethod construct_array_type() type_t[ExtensionArray]#
Return the array type associated with this dtype.
Returns#
type
- classmethod construct_from_string(string: str) Self#
Construct this type from a string.
This is useful mainly for data types that accept parameters. For example, a period dtype accepts a frequency parameter that can be set as
period[h](where H means hourly frequency).By default, in the abstract class, just the name of the type is expected. But subclasses can overwrite this method to accept parameters.
Parameters#
- stringstr
The name of the type, for example
category.
Returns#
- ExtensionDtype
Instance of the dtype.
Raises#
- TypeError
If a class cannot be constructed from this ‘string’.
Examples#
For extension dtypes with arguments the following may be an adequate implementation.
>>> import re >>> @classmethod ... def construct_from_string(cls, string): ... pattern = re.compile(r"^my_type\[(?P<arg_name>.+)\]$") ... match = pattern.match(string) ... if match: ... return cls(**match.groupdict()) ... else: ... raise TypeError( ... f"Cannot construct a '{cls.__name__}' from '{string}'" ... )
- empty(shape: Shape) ExtensionArray#
Construct an ExtensionArray of this dtype with the given shape.
Analogous to numpy.empty.
Parameters#
shape : int or tuple[int]
Returns#
ExtensionArray
- index_class#
The Index subclass to return from Index.__new__ when this dtype is encountered.
- classmethod is_dtype(dtype: object) bool#
Check if we match ‘dtype’.
Parameters#
- dtypeobject
The object to check.
Returns#
bool
Notes#
The default implementation is True if
cls.construct_from_string(dtype)is an instance ofcls.dtypeis an object and is an instance ofclsdtypehas adtypeattribute, and any of the above conditions is true fordtype.dtype.
- property kind: str#
A character code (one of ‘biufcmMOSUV’), default ‘O’
This should match the NumPy dtype used when the array is converted to an ndarray, which is probably ‘O’ for object if the extension type cannot be represented as a built-in NumPy type.
See Also#
numpy.dtype.kind