scala - 多项目 sbt 组装问题
问题描述
我正在尝试创建一个包含两个主要类的项目 - SparkConsumer 和 KafkaProducer。为此,我在 sbt 文件中引入了多项目结构。消费者和生产者模块用于单独的项目,核心项目包含生产者和消费者都使用的实用程序。根是主要项目。还介绍了通用设置和库依赖项。但是,由于某种原因,该项目无法编译。所有与 sbt 组装相关的设置都标记为红色。但是,定义了 sbt-assembly 插件的 plugins.sbt 位于根项目中。
这种问题的解决方案可能是什么?
项目结构如下所示:
这是 build.sbt 文件:
lazy val overrides = Seq("com.fasterxml.jackson.core" % "jackson-core" % "2.9.5",
"com.fasterxml.jackson.core" % "jackson-databind" % "2.9.5",
"com.fasterxml.jackson.module" % "jackson-module-scala_2.11" % "2.9.5")
lazy val commonSettings = Seq(
name := "Demo",
version := "0.1",
scalaVersion := "2.11.8",
resolvers += "Spark Packages Repo" at "http://dl.bintray.com/spark-packages/maven",
dependencyOverrides += overrides
)
lazy val assemblySettings = Seq(
assemblyMergeStrategy in assembly := {
case PathList("org","aopalliance", xs @ _*) => MergeStrategy.last
case PathList("javax", "inject", xs @ _*) => MergeStrategy.last
case PathList("javax", "servlet", xs @ _*) => MergeStrategy.last
case PathList("javax", "activation", xs @ _*) => MergeStrategy.last
case PathList("org", "apache", xs @ _*) => MergeStrategy.last
case PathList("com", "google", xs @ _*) => MergeStrategy.last
case PathList("com", "esotericsoftware", xs @ _*) => MergeStrategy.last
case PathList("com", "codahale", xs @ _*) => MergeStrategy.last
case PathList("com", "yammer", xs @ _*) => MergeStrategy.last
case PathList("org", "slf4j", xs @ _*) => MergeStrategy.last
case PathList("org", "neo4j", xs @ _*) => MergeStrategy.last
case PathList("com", "typesafe", xs @ _*) => MergeStrategy.last
case PathList("net", "jpountz", xs @ _*) => MergeStrategy.last
case PathList("META-INF", xs @ _*) => MergeStrategy.discard
case "about.html" => MergeStrategy.rename
case "META-INF/ECLIPSEF.RSA" => MergeStrategy.last
case "META-INF/mailcap" => MergeStrategy.last
case "META-INF/mimetypes.default" => MergeStrategy.last
case "plugin.properties" => MergeStrategy.last
case "log4j.properties" => MergeStrategy.last
case x =>
val oldStrategy = (assemblyMergeStrategy in assembly).value
oldStrategy(x)
}
)
val sparkVersion = "2.2.0"
lazy val commonDependencies = Seq(
"org.apache.kafka" %% "kafka" % "1.1.0",
"org.apache.spark" %% "spark-core" % sparkVersion % "provided",
"org.apache.spark" %% "spark-sql" % sparkVersion,
"org.apache.spark" %% "spark-streaming" % sparkVersion,
"org.apache.spark" %% "spark-streaming-kafka-0-10" % sparkVersion,
"neo4j-contrib" % "neo4j-spark-connector" % "2.1.0-M4",
"com.typesafe" % "config" % "1.3.0",
"org.neo4j.driver" % "neo4j-java-driver" % "1.5.1",
"com.opencsv" % "opencsv" % "4.1",
"com.databricks" %% "spark-csv" % "1.5.0",
"com.github.tototoshi" %% "scala-csv" % "1.3.5",
"org.elasticsearch" %% "elasticsearch-spark-20" % "6.2.4"
)
lazy val root = (project in file("."))
.settings(
commonSettings,
assemblySettings,
libraryDependencies ++= commonDependencies,
assemblyJarName in assembly := "demo_root.jar"
)
.aggregate(core, consumer, producer)
lazy val core = project
.settings(
commonSettings,
assemblySettings,
libraryDependencies ++= commonDependencies
)
lazy val consumer = project
.settings(
commonSettings,
assemblySettings,
libraryDependencies ++= commonDependencies,
mainClass in assembly := Some("consumer.SparkConsumer"),
assemblyJarName in assembly := "demo_consumer.jar"
)
.dependsOn(core)
lazy val producer = project
.settings(
commonSettings,
assemblySettings,
libraryDependencies ++= commonDependencies,
mainClass in assembly := Some("producer.KafkaCheckinsProducer"),
assemblyJarName in assembly := "demo_producer.jar"
)
.dependsOn(core)
更新:堆栈跟踪
(producer / update) java.lang.IllegalArgumentException: a module is not authorized to depend on itself: demo#demo_2.11;0.1
[error] (consumer / update) java.lang.IllegalArgumentException: a module is not authorized to depend on itself: demo#demo_2.11;0.1
[error] (core / Compile / compileIncremental) Compilation failed
[error] (update) sbt.librarymanagement.ResolveException: unresolved dependency: org.apache.spark#spark-sql_2.12;2.2.0: not found
[error] unresolved dependency: org.apache.spark#spark-streaming_2.12;2.2.0: not found
[error] unresolved dependency: org.apache.spark#spark-streaming-kafka-0-10_2.12;2.2.0: not found
[error] unresolved dependency: com.databricks#spark-csv_2.12;1.5.0: not found
[error] unresolved dependency: org.elasticsearch#elasticsearch-spark-20_2.12;6.2.4: not found
[error] unresolved dependency: org.apache.spark#spark-core_2.12;2.2.0: not found
解决方案
未解决的依赖关系:org.apache.spark#spark-sql_2.12;2.2.0
Spark 2.2.0 需要 Scala 2.11,请参阅https://spark.apache.org/docs/2.2.0/ 出于某种原因,您的 commonSettings 中的 scalaVersion 不适用。您可能需要设置全局 scalaVersion 来解决它。
Spark 在 Java 8+、Python 2.7+/3.4+ 和 R 3.1+ 上运行。对于 Scala API,Spark 2.2.0 使用 Scala 2.11。您将需要使用兼容的 Scala 版本 (2.11.x)。
spark-sql 和 spark-streaming 也应标记为“已提供”
推荐阅读
- azure-data-factory - 数据流活动 Azure 数据工厂中的并发文件处理
- javascript - 下拉菜单 - 在重新加载时保存选择
- python - 从冻结层获取输出张量
- c - 当我不使用引号时,argc 究竟是如何工作的?
- php - 使用被 gmail 阻止的 mailgun smtp 的 php 邮件;错误 RFC 5322
- python - 如何在 vscode windows 10 上打开 graphviz(dot) 文件
- slack-api - 获取通话/视频聊天时长和成员
- android - 如何在 Android 中使用 Kotlin 获取 Firebase 实时数据库服务器时间
- angular7 - 如何防止我的应用程序(核心 Webapi 和 Angular 7 的前端)遭受跨站点请求伪造
- java - 有没有一种算法可以删除一些与另一个相同的元素?