问题描述
如何使用Cosoms DB(带有Graph API)作为数据接收器来创建Azure的Data Factory管道? (数据源仅是Cosmos DB(文档数据库为API)
How can we create Azure's Data Factory pipeline with Cosoms DB (with Graph API) as data sink ? (data source being Cosmos DB only (Document DB as API)
推荐答案
您可以使用的一个选择是,仅对启用了图形的CosmosDB接收器继续使用Document API.如果您将文档转换为GraphSON格式并将其作为常规文档写入到目标位置,则它们将在将来的图形遍历中自动用作顶点和边.
One option that is available to you is to simply continue using the Document API for the graph enabled CosmosDB sink. If you transform and write your documents into the destination in GraphSON format as regular documents they will be automatically usable as vertices and edges in future graph traversals.
针对同一集合同时使用DocumentSQL和Gremlin API的能力是CosmosDB IMO的最令人兴奋和最强大的功能之一(并且该团队计划将来支持更多与相同数据集交互的API).
The ability to use both DocumentSQL and Gremlin APIs against the same collection is one of the most exciting and powerful features of CosmosDB IMO (and the team plans to support more APIs interacting with the same dataset in the future).
这不仅是可能的,而且我个人观察到在使用Document API(而不是gremlin)将大型数据集导入到启用了图表的Cosmos集合中时,吞吐量显着提高.我计划在不久的将来发布一篇博客文章,详细描述该过程.
Not only is this possible, but I've personally observed significant improvements in throughput when importing large datasets into a graph enabled Cosmos collection using the Document APIs instead of gremlin. I plan to release a blog post describing this process in more detail in the near future.
这篇关于如何使用Cosoms DB(带有Graph API)作为数据接收器创建Azure的Data Factory管道?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!