本文介绍了UIMA-Ducc和UIMA-AS的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我在文本分析和提取信息的过程中使用了 UIMA .流水线同时执行6个进程而失败.

I used UIMA in a process for analyzing and extracting information since text. The pipeline fails with 6 simultaneous processes.

我认为我需要使用横向扩展工具,例如 UIMA-Ducc UIMA-AS ,但我不清楚.

I think that I need to use a scaleout tool, like UIMA-Ducc and UIMA-AS, but I don't see clearly which.

何时使用每个?他们有什么区别?

When to use each one? Which are their differences?

推荐答案

UIMA-AS提供了用于部署UIMA管道的机制.本质上,UIMA-AS允许用户在UIMA组件的前面放置一个队列,以便它可以在不同的线程或不同的进程中运行. UIMA-AS处理CASes的线程和进程间传输.除了一些简单的bash脚本之外,UIMA-AS不为用户进程提供生命周期管理.

UIMA-AS provides mechanisms for deploying a UIMA pipeline. Essentially, UIMA-AS allows users to put a queue in front of a UIMA component so that it can run in a different thread or in a different process. UIMA-AS handles threading and the interprocess transport of CASes. Other than some simple bash scripts, UIMA-AS does not provide life-cycle management for user processes.

DUCC是一个群集控制器,除其他功能外,它还为UIMA-AS服务提供生命周期管理. DUCC还提供了一种用于扩展具有多个线程和多个进程的UIMA管道并将工作馈送到管道实例的机制.这称为DUCC作业. DUCC作业是从UIMA核心组件创建的,不需要UIMA-AS的知识.

DUCC is a cluster controller that, among other things, provides life-cycle management for UIMA-AS services. DUCC also provides a mechanism for scaling out a UIMA pipeline with multiple threads and multiple processes and feeding work to the pipeline instances; this is called a DUCC Job. DUCC jobs are created from core UIMA components, no knowledge of UIMA-AS required.

这篇关于UIMA-Ducc和UIMA-AS的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!

10-28 21:38