报错内容:

2021-08-06 15:18:30 : Job aborted due to stage failure: Task 0 in stage 0.0 failed 1 times, most recent failure: Lost task 0.0 in stage 0.0 (TID 0, localhost, executor driver): java.sql.BatchUpdateException: Incorrect string value: '\xF0\x9F\x93\xB1\xE7\x8F...' for column 'abc993' at row 7773

at com.gbase.jdbc.PreparedStatement.executeBatchedInserts(PreparedStatement.java:1825)

at com.gbase.jdbc.PreparedStatement.executeBatch(PreparedStatement.java:1450)

at org.apache.spark.sql.execution.datasources.jdbc.CntJdbcUtilsEnhance$.savePartitionWithCnt(CntJdbcUtilsEnhance.scala:117)

at org.apache.spark.sql.execution.datasources.jdbc.CntJdbcUtilsEnhance$$anonfun$saveTableWithCnt$1.apply(CntJdbcUtilsEnhance.scala:44)

at org.apache.spark.sql.execution.datasources.jdbc.CntJdbcUtilsEnhance$$anonfun$saveTableWithCnt$1.apply(CntJdbcUtilsEnhance.scala:44)

at org.apache.spark.rdd.RDD$$anonfun$foreachPartition$1$$anonfun$apply$29.apply(RDD.scala:935)

at org.apache.spark.rdd.RDD$$anonfun$foreachPartition$1$$anonfun$apply$29.apply(RDD.scala:935)

at org.apache.spark.SparkContext$$anonfun$runJob$5.apply(SparkContext.scala:2074)

at org.apache.spark.SparkContext$$anonfun$runJob$5.apply(SparkContext.scala:2074)

at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:87)

at org.apache.spark.scheduler.Task.run(Task.scala:109)

at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:345)

at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)

at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)

at java.lang.Thread.run(Thread.java:748). Driver stacktrace:

解决:字符集问题,oracle表数据中可能存在表情包或者其他特殊数据,通过oracle字符集转换函数CONVERT对报错字段abc993进行字符集转换,然后起别名。

CONVERT(ABC993,  'AL24UTFFSS') abc993

Logo

魔乐社区(Modelers.cn) 是一个中立、公益的人工智能社区,提供人工智能工具、模型、数据的托管、展示与应用协同服务,为人工智能开发及爱好者搭建开放的学习交流平台。社区通过理事会方式运作,由全产业链共同建设、共同运营、共同享有,推动国产AI生态繁荣发展。

更多推荐