scala 使用案例类对 JSON 进行编码时，为什么会出现错误“无法找到存储在数据集中的类型的编码器”？

Question

提问by Milad Khajavi

I've written spark job:

我写过火花作业：

object SimpleApp {
  def main(args: Array[String]) {
    val conf = new SparkConf().setAppName("Simple Application").setMaster("local")
    val sc = new SparkContext(conf)
    val ctx = new org.apache.spark.sql.SQLContext(sc)
    import ctx.implicits._

    case class Person(age: Long, city: String, id: String, lname: String, name: String, sex: String)
    case class Person2(name: String, age: Long, city: String)

    val persons = ctx.read.json("/tmp/persons.json").as[Person]
    persons.printSchema()
  }
}

In IDE when I run the main function, 2 error occurs:

在 IDE 中运行 main 函数时，出现 2 个错误：

Error:(15, 67) Unable to find encoder for type stored in a Dataset.  Primitive types (Int, String, etc) and Product types (case classes) are supported by importing sqlContext.implicits._  Support for serializing other types will be added in future releases.
    val persons = ctx.read.json("/tmp/persons.json").as[Person]
                                                                  ^

Error:(15, 67) not enough arguments for method as: (implicit evidence: org.apache.spark.sql.Encoder[Person])org.apache.spark.sql.Dataset[Person].
Unspecified value parameter evidence.
    val persons = ctx.read.json("/tmp/persons.json").as[Person]
                                                                  ^

but in Spark Shell I can run this job without any error. what is the problem?

但是在 Spark Shell 中，我可以毫无错误地运行此作业。问题是什么？

Answer 1

回答by Developer

The error message says that the Encoderis not able to take the Personcase class.

错误消息表示Encoder无法接受Person案例类。

Error:(15, 67) Unable to find encoder for type stored in a Dataset.  Primitive types (Int, String, etc) and Product types (case classes) are supported by importing sqlContext.implicits._  Support for serializing other types will be added in future releases.

Move the declaration of the case class outside the scope of SimpleApp.

将 case 类的声明移到的范围之外SimpleApp。

Answer 2

回答by Paul Leclercq

You have the same error if you add sqlContext.implicits._and spark.implicits._in SimpleApp(the order doesn't matter).

如果添加sqlContext.implicits._和spark.implicits._输入SimpleApp（顺序无关紧要），则会出现相同的错误。

Removing one or the other will be the solution:

删除一个或另一个将是解决方案：

val spark = SparkSession
  .builder()
  .getOrCreate()

val sqlContext = spark.sqlContext
import sqlContext.implicits._ //sqlContext OR spark implicits
//import spark.implicits._ //sqlContext OR spark implicits

case class Person(age: Long, city: String)
val persons = ctx.read.json("/tmp/persons.json").as[Person]

Tested with Spark 2.1.0

用Spark 2.1.0测试

The funny thing is if you add the same object implicits twice you will not have problems.

有趣的是，如果你两次添加相同的对象隐式，你就不会有问题。

Answer 3

回答by Santhoshm

@Milad Khajavi

Define Person case classes outside object SimpleApp. Also, add import sqlContext.implicits._ inside main() function.

在对象 SimpleApp 之外定义 Person 案例类。另外，在 main() 函数中添加 import sqlContext.implicits._。

scala 使用案例类对 JSON 进行编码时，为什么会出现错误“无法找到存储在数据集中的类型的编码器”？

提问by Milad Khajavi

回答by Developer

回答by Paul Leclercq

回答by Santhoshm

相关推荐

最近更新

标签

scala 使用案例类对 JSON 进行编码时，为什么会出现错误“无法找到存储在数据集中的类型的编码器”？

提问by Milad Khajavi

回答by Developer

回答by Paul Leclercq

回答by Santhoshm

相关推荐

scala java.io.IOException: 方案没有文件系统：hdfs

scala 使用 sbt 编译 spark 项目时未解决的依赖问题

scala 如何使用 Spark DataFrames 查询 JSON 数据列？

scala 我如何单元测试/模拟 ElasticSearch

相关推荐

最近更新

标签