问题描述
专家您好,
要求是从Azure Blob中提取数据并在ADwaas中加载(Azure数据仓库)
我已经创建了Dataset和Pipeline,一切似乎都工作正常,并且可以将数据从Blob很好地加载到Azure Dwaas Table.
问题:
如果Azure存储帐户中不存在Blob->容器,然后管道失败,并显示类似
的错误找不到Blob"
如何克服这个问题,仅当存在blob时,我才需要启动管道执行(要引用输入数据集的blob),以使管道不会因源数据不可用而失败(blob)
Hello Experts,
Requirement is to Pull the data from Azure Blob and Load in ADwaas (Azure Data ware house)
I have created the Dataset and Pipeline ,All seems to be working fine and load data from Blob to Azure Dwaas Table very well.
Issue:
If the Blob is not present in the Azure Storage Account -> Container ,then Pipeline fails with error like
"Blob not found"
How to overcome this, I need to start the Execution of Pipeline only if blob exist(blob which Input Dataset is Refering),so that Pipeline does not fail due to unavailability of Source data(blob)
推荐答案
我猜您正在使用具有固定Blob名称的输入数据集.
I guess that you are using an input dataset with a fixed blob name.
如果您在容器而不是Blob上运行活动,则您的活动可能会通过. (对我来说,如果您指定了不存在的Blob名称,这似乎是正常行为.)
Your activity will probably pass if you would run the activity on a container instead of a blob. (To me it looks like a normal behaviour if you specify a blob name that does not exist).
或者,您可以编写一个自定义活动来处理文件.在自定义活动中,您可以检查blob存在.这是一些例子
Alternatively you can write a custom activity which processes the file. In the custom activity you can check for blob existance. Here are some examples
- https://docs.microsoft.com/zh-cn/azure/data-factory/data-factory-use-custom-activities
- https://github.com/Azure/Azure -DataFactory/tree/master/Samples
欢呼
汤姆
这篇关于ADF-检查Azure存储帐户中Blob的存在的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!