首页 > 解决方案 > 如何在火花中选择名称中带有引号的数据框中的列

问题描述

我尝试使用访问"accession" "database" "disease" "ec.code" "omics_type" "species"

fileDf.select("\"accession\"","\"database\"","\"connections\"")

但仍然出现错误

root
 |-- "accession"    "database"  "disease"   "ec.code"   "omics_type"    "species"   "tissue"    "citations.x"   "coding.x"  "ensembl.x" "go.x"  "intact.x"  "kegg.compound.x"   "kegg.glycan.x" "kegg.pathway.x"    "kegg.reaction.x"   "metabolights.x"    "ncbi.x"    "pubchem.compound.x"    "pubchem.substance.x"   "reactome.x"    "reanalysis.x"  "rnacentral.x"  "sgd.x" "sra.x" "uniprot.x" "ajs.connectivity.score"    "citations.y"   "coding.y"  "ensembl.y" "go.y"  "intact.y"  "kegg.compound.y"   "kegg.glycan.y" "kegg.pathway.y"    "kegg.reaction.y"   "metabolights.y"    "ncbi.y"    "pubchem.compound.y"    "pubchem.substance.y"   "reactome.y"    "reanalysis.y"  "rnacentral.y"  "sgd.y" "sra.y" "uniprot.y" "connections": string (nullable = true)

Exception in thread "main" org.apache.spark.sql.AnalysisException: cannot resolve '`"accession"`' given input columns: ["accession" "database"  "disease"   "ec.code"   "omics_type"    "species"   "tissue"    "citations.x"   "coding.x"  "ensembl.x" "go.x"  "intact.x"  "kegg.compound.x"   "kegg.glycan.x" "kegg.pathway.x"    "kegg.reaction.x"   "metabolights.x"    "ncbi.x"    "pubchem.compound.x"    "pubchem.substance.x"   "reactome.x"    "reanalysis.x"  "rnacentral.x"  "sgd.x" "sra.x" "uniprot.x" "ajs.connectivity.score"    "citations.y"   "coding.y"  "ensembl.y" "go.y"  "intact.y"  "kegg.compound.y"   "kegg.glycan.y" "kegg.pathway.y"    "kegg.reaction.y"   "metabolights.y"    "ncbi.y"    "pubchem.compound.y"    "pubchem.substance.y"   "reactome.y"    "reanalysis.y"  "rnacentral.y"  "sgd.y" "sra.y" "uniprot.y" "connections"];;
'Project ['"accession", '"database", '"connections"]
+- AnalysisBarrier
      +- Relation["accession"   "database"  "disease"   "ec.code"   "omics_type"    "species"   "tissue"    "citations.x"   "coding.x"  "ensembl.x" "go.x"  "intact.x"  "kegg.compound.x"   "kegg.glycan.x" "kegg.pathway.x"    "kegg.reaction.x"   "metabolights.x"    "ncbi.x"    "pubchem.compound.x"    "pubchem.substance.x"   "reactome.x"    "reanalysis.x"  "rnacentral.x"  "sgd.x" "sra.x" "uniprot.x" "ajs.connectivity.score"    "citations.y"   "coding.y"  "ensembl.y" "go.y"  "intact.y"  "kegg.compound.y"   "kegg.glycan.y" "kegg.pathway.y"    "kegg.reaction.y"   "metabolights.y"    "ncbi.y"    "pubchem.compound.y"    "pubchem.substance.y"   "reactome.y"    "reanalysis.y"  "rnacentral.y"  "sgd.y" "sra.y" "uniprot.y" "connections"#10] csv

我应该如何在数据框中选择带有引号的列在火花中?

标签: apache-sparkapache-spark-sqlapache-spark-dataset

解决方案


推荐阅读