apache-spark - 如何在火花中选择名称中带有引号的数据框中的列
问题描述
我尝试使用访问"accession" "database" "disease" "ec.code" "omics_type" "species"
列
fileDf.select("\"accession\"","\"database\"","\"connections\"")
但仍然出现错误
root
|-- "accession" "database" "disease" "ec.code" "omics_type" "species" "tissue" "citations.x" "coding.x" "ensembl.x" "go.x" "intact.x" "kegg.compound.x" "kegg.glycan.x" "kegg.pathway.x" "kegg.reaction.x" "metabolights.x" "ncbi.x" "pubchem.compound.x" "pubchem.substance.x" "reactome.x" "reanalysis.x" "rnacentral.x" "sgd.x" "sra.x" "uniprot.x" "ajs.connectivity.score" "citations.y" "coding.y" "ensembl.y" "go.y" "intact.y" "kegg.compound.y" "kegg.glycan.y" "kegg.pathway.y" "kegg.reaction.y" "metabolights.y" "ncbi.y" "pubchem.compound.y" "pubchem.substance.y" "reactome.y" "reanalysis.y" "rnacentral.y" "sgd.y" "sra.y" "uniprot.y" "connections": string (nullable = true)
Exception in thread "main" org.apache.spark.sql.AnalysisException: cannot resolve '`"accession"`' given input columns: ["accession" "database" "disease" "ec.code" "omics_type" "species" "tissue" "citations.x" "coding.x" "ensembl.x" "go.x" "intact.x" "kegg.compound.x" "kegg.glycan.x" "kegg.pathway.x" "kegg.reaction.x" "metabolights.x" "ncbi.x" "pubchem.compound.x" "pubchem.substance.x" "reactome.x" "reanalysis.x" "rnacentral.x" "sgd.x" "sra.x" "uniprot.x" "ajs.connectivity.score" "citations.y" "coding.y" "ensembl.y" "go.y" "intact.y" "kegg.compound.y" "kegg.glycan.y" "kegg.pathway.y" "kegg.reaction.y" "metabolights.y" "ncbi.y" "pubchem.compound.y" "pubchem.substance.y" "reactome.y" "reanalysis.y" "rnacentral.y" "sgd.y" "sra.y" "uniprot.y" "connections"];;
'Project ['"accession", '"database", '"connections"]
+- AnalysisBarrier
+- Relation["accession" "database" "disease" "ec.code" "omics_type" "species" "tissue" "citations.x" "coding.x" "ensembl.x" "go.x" "intact.x" "kegg.compound.x" "kegg.glycan.x" "kegg.pathway.x" "kegg.reaction.x" "metabolights.x" "ncbi.x" "pubchem.compound.x" "pubchem.substance.x" "reactome.x" "reanalysis.x" "rnacentral.x" "sgd.x" "sra.x" "uniprot.x" "ajs.connectivity.score" "citations.y" "coding.y" "ensembl.y" "go.y" "intact.y" "kegg.compound.y" "kegg.glycan.y" "kegg.pathway.y" "kegg.reaction.y" "metabolights.y" "ncbi.y" "pubchem.compound.y" "pubchem.substance.y" "reactome.y" "reanalysis.y" "rnacentral.y" "sgd.y" "sra.y" "uniprot.y" "connections"#10] csv
我应该如何在数据框中选择带有引号的列在火花中?
解决方案
推荐阅读
- javascript - 猫鼬`findById`不返回预期的文件
- c++ - C ++多索引列csv加载
- wordpress - 每月在 woocommerce 优惠券中重置每位用户的使用限制
- android - firestore getDocuments().size() != getDocumentChanges().size()
- kdb - 将时间戳四舍五入到最接近的 30 秒
- c++ - 消失的 SDL 矩形。如何使用新的附加形状更新窗口
- c# - 使用 CREATE_SUSPENDED 标志和 JobObject 从 CreateProcessAsUser 为 C# 进程设置 ProcessStartInfo
- reactjs - 如何将 BSC 代币发送到地址?
- ajax - 手动自定义 Ajax 加入购物车 一次性付款订阅
- typescript - 类型“{toggle():void}”上不存在属性“x”