pyspark - PySpark 文档的 DataFrames df、df2、df3 等在哪里定义?
解决方案
它们在_test()
方法中定义Class GroupedData(...)
from pyspark.sql import Row
df4 = sc.parallelize([Row(course="dotNET", year=2012, earnings=10000),
Row(course="Java", year=2012, earnings=20000),
Row(course="dotNET", year=2012, earnings=5000),
Row(course="dotNET", year=2013, earnings=48000),
Row(course="Java", year=2013, earnings=30000)]).toDF()
推荐阅读
- sql - Removing Null rows from a UNION Query
- php - Best practices when querying backend from script
- python - Convert Tensor List for Image to PIL Format
- javascript - Bootstrap-5 JavaScript import inside javascript
- antlr4 - ANTLR4 - 命名函数参数
- piranha-cms - How to set a CheckBoxField initially checked when adding a new block in manager in Piranha CMS
- javascript - Show input from the second select option
- firebase - Firebase Stripe Extension - adding custom claims to user
- c++ - Construct an std:array from a smaller std::array
- javascript - Is there a way I can call have the info on a table in a function or class that can be called multiple times in HTML?