我有这个sql查询,用于pyspark中的hiveql:
spark.sql('SELECT split(parse_url(page.viewed_page,"PATH"),"/")[1] as path FROM df')
我想翻译成如下功能查询:
df.select(split(parse_url(col('page.viewed_page'),'HOST')))
但是当我导入parse_url
函数时,我得到:
----> 1 from pyspark.sql.functions import split,parse_url
ImportError: cannot import name 'parse_url' from 'pyspark.sql.functions' (/usr/local/opt/apache-spark/libexec/python/pyspark/sql/functions.py)
您能指出我正确的方向来导入parse_url函数吗?
欢呼