What is the polars.from_numpy() in polars?

Polars is a fast and efficient data manipulation library written in Rust. It’s designed to provide high-performance operations on large datasets and handles them more quickly than pandas. It’s particularly suitable when working with tabular data.

The polars.from_numpy() method increases the usability and integration of Polars into data processing workflows, particularly for users who work with NumPy arrays in Python.

The `polars.from_numpy()` method

The polars.from_numpy() method builds a DataFrame using NumPy ndarray by copying its data into the newly created DataFrame. In other words, it creates a separate copy of the data, so any modifications made to the DataFrame will not affect the original array.

Syntax

Parameters

data: It refers to the data stored as a NumPy ndarray.
schema: It refers to the structure of the DataFrame, which includes the names of the columns and the data types associated with each column. Declaration of DataFrame schema can be done in different ways:
schema_overrides: A dictionary to specify or override types for one or more columns. It overrides any types inferred from the columns.
orient: This parameter specifies ways to interpret two-dimensional data.
- None: This implies the default orientation where each row in the array becomes a row in the DataFrame, and each column in the array becomes a column in the DataFrame.
- col: Data is treated as columns.
- row: Data is treated as rows.

Note: If orientation inference doesn't yield conclusive results, column orientation is used by default.

Return value

This method returns a DataFrame from NumPy ndarray.

Code

Explanation

Lines 1–2: We import the polars and numpy library as np and pl, respectively.

Lines 4: We create a 2D NumPy array, data , with three rows and columns.

Line 6: The pl.from_numpy() function creates a DataFrame, df, out of the data array.

The schema parameter is set to ["row 1", "row 2", "row 3"], which specifies the column names for the DataFrame. In this case, the DataFrame will have columns named "row 1" , "row 2" and "row 3".
The orient parameter is set to "col", which specifies that the NumPy array's columns should be treated as columns in the DataFrame.

Line 7: We print the Polars DataFrame.

Free AI Mock Interviews

Coding Interview

Coding PatternsFree Interview

Gain insights and practical experience with coding patterns through targeted MCQs and coding problems, designed to match and challenge your expertise level.

System Design

YouTubeFree Interview

Learn to design a video streaming platform like YouTube by tackling functional and non-functional requirements, core components, and high-level to detailed design challenges.

Free Resources