尝试使用 python 在 Databricks 上将字符串列拆分为 4 列:A、B、C、D。
# Load CSV file
df = spark.read.format("csv").options(header='false').load("file path")
# Rename header
RawDataDF = RawDataDF.withColumnRenamed("_c0","raw")
#Attempt to split "raw" into 4 columns:
splitDF = RawDataDF.withColumn("split_raw_arr",split("raw"," "))
uDataDF= uDataDF.withColumn('Column A',splitDF.getItem(0))
uDataDF= uDataDF.withColumn('Column B',splitDF.getItem(1))
uDataDF= uDataDF.withColumn('Column C',splitDF.getItem(2))
uDataDF= uDataDF.withColumn('Column D',splitDF.getItem(3))
错误信息:
AttributeError: 'DataFrame' object has no attribute 'getItem'
感谢任何建议。