无法将组织模块导入PySpark集群

问题描述 投票:1回答:1

我正在尝试从组织模块导入FPGrowth,但是在安装组织模块时抛出错误。我还尝试将org.apache.spark替换为pyspark,仍然无法正常工作。

!pip install org
import org.apache.spark.ml.fpm.FPGrowth

下面是错误:

ERROR: Could not find a version that satisfies the requirement org (from versions: none)
ERROR: No matching distribution found for org
---------------------------------------------------------------------------
ModuleNotFoundError                       Traceback (most recent call last)
<ipython-input-12-c730562e7076> in <module>
      1 get_ipython().system('pip install org')
----> 2 import org.apache.spark.ml.fpm.FPGrowth

ModuleNotFoundError: No module named 'org'
python apache-spark pyspark google-cloud-dataproc fpgrowth
1个回答
0
投票

要在PySpark中导入FPGrowth,您需要编写:

from pyspark.ml.fpm import FPGrowth

您可以在FPGrowth中找到有关如何使用Spark documentation的其他说明。

© www.soinside.com 2019 - 2024. All rights reserved.