Python pyspark DataFrame.swapaxes用法及代码示例

本文简要介绍 pyspark.pandas.DataFrame.swapaxes 的用法。

用法: DataFrame.swapaxes(i: Union[int, str], j: Union[int, str], copy: bool = True) → pyspark.pandas.frame.DataFrame

适当地交换轴和交换值轴。

注意

由于大数据的性质，这种方法基于昂贵的操作。在内部，它需要为每个值生成每一行，然后分组两次——这是一个巨大的操作。为防止误用，此方法具有“compute.max_rows”默认输入长度限制，并引发 ValueError。

>>> from pyspark.pandas.config import option_context
>>> with option_context('compute.max_rows', 1000):  
...     ps.DataFrame({'a': range(1001)}).swapaxes(i=0, j=1)
Traceback (most recent call last):
  ...
ValueError: Current DataFrame has more then the given limit 1000 rows.
Please set 'compute.max_rows' by using 'pyspark.pandas.config.set_option'
to retrieve to retrieve more than 1000 rows. Note that, before changing the
'compute.max_rows', this operation is considerably expensive.

参数：

i: {0 or ‘index’, 1 or ‘columns’}. The axis to swap.：
j: {0 or ‘index’, 1 or ‘columns’}. The axis to swap.：
copy：布尔值，默认为真。

DataFrame

例子：

>>> psdf = ps.DataFrame(
...     [[1, 2, 3], [4, 5, 6], [7, 8, 9]], index=['x', 'y', 'z'], columns=['a', 'b', 'c']
... )
>>> psdf
   a  b  c
x  1  2  3
y  4  5  6
z  7  8  9
>>> psdf.swapaxes(i=1, j=0)
   x  y  z
a  1  4  7
b  2  5  8
c  3  6  9
>>> psdf.swapaxes(i=1, j=1)
   a  b  c
x  1  2  3
y  4  5  6
z  7  8  9

相关用法

注：本文由纯净天空筛选整理自spark.apache.org大神的英文原创作品 pyspark.pandas.DataFrame.swapaxes。非经特殊声明，原始代码版权归原作者所有，本译文未经允许或授权，请勿转载或复制。

用法:

参数：

返回：

例子：