首页 > 解决方案 > pyspark中是否有Substr oracle等效功能

问题描述

Pyspark 中的以下语句引发了问题 .select ('Substr(trim(Name), -3))')

    .select ('Substr(trim(Name), -3))')

     .select ('Substr(trim(Name), -3))')

输入数据为 RAMS 123,预期输出为 123

标签: pyspark-sql

解决方案


pyspark 中可用的substr()方法。

myDf = sqlContext.createDataFrame([('Alex',"30"),("Hitesh R","34"),("David B","35")],("Name","Age"))

myDf.show()

+--------+---+
|    Name|Age|
+--------+---+
|    Alex| 30|
|Hitesh R| 34|
| David B| 35|
+--------+---+

myDf.select(myDf['Name'].substr(-3,3)).show()

+----------------------+                                                        
|substring(Name, -3, 3)|
+----------------------+
|                   lex|
|                   h R|
|                   d B|
+----------------------+


推荐阅读