问题描述
我正在尝试通过 API 将 bigquery 数据导出到谷歌云存储桶.我从这里改编了一个代码片段https://cloud.google.com/bigquery/docs/exporting-data
I am trying to export bigquery data to google cloud storage bucket via the API. I adapted a code snippet from herehttps://cloud.google.com/bigquery/docs/exporting-data
Job job = table.extract(format, gcsUrl);
// Wait for the job to complete
try {
Job completedJob = job.waitFor(WaitForOption.checkEvery(1,
TimeUnit.SECONDS),
WaitForOption.timeout(3, TimeUnit.MINUTES));
if (completedJob != null && completedJob.getStatus().getError() == null) {
// Job completed successfully
} else {
// Handle error case
System.out.println(completedJob.getStatus().getError());
}
} catch (InterruptedException | TimeoutException e) {
// Handle interrupted wait
}
我已经用JSON"交换了格式,因为我的数据是嵌套的,不能用gs://mybucket/export_*.json"导出到 CSV 和 gcsUrl.但是错误消息告诉我以下问题:
I have exchanged format with "JSON" since my data is nested and can't be exported to CSV and the gcsUrl with "gs://mybucket/export_*.json". But the error messages tells me the following problem:
transfer not working BigQueryError{reason=invalid, location=null, message=Operation cannot be performed on a nested schema. Field: totals}
有什么建议吗?JSON 应该能够处理嵌套格式...
Any advice what to do? JSON should be able to handle a nested format...
推荐答案
参考destinationFormat 选项,您应该为 format
变量设置 "NEWLINE_DELIMITED_JSON"
以便导出为 JSON.
Referring to the destinationFormat option, you should set "NEWLINE_DELIMITED_JSON"
for the format
variable in order to export as JSON.
这篇关于将嵌套的 BigQuery 数据导出到云存储的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!