Apache UIMA DUCC v2.1.0發布,分布式 UIMA 集群計算服務

jopen 8年前發布 | 9K 次閱讀 Apache UIMA DUCC

UIMA (Unstructured Information Management applications) 是一個軟件系統,用來分析大量的非結構化信息從而發掘中對最終用戶有用的知識點,一個最典型的 UIM 應用就是從文本文件中提取有用信息,例如人員、地址和組織等相關信息。 DUCC 是為分布式 UIMA 集群計算服務的,是集群管理系統,提供工具鏈,管理和調度設施。

更新日志

完整日志:here

此版本中的主要變化是:

  • Ubuntu and RHEL 7 support
  • cgroup enhancements
    • uses standard cgroups organization
    • supports cgroup swappiness setting, restricting any swapping if desired
  • DUCC state and history storage moved from flat files to Cassandra DB, reducing storage size 5x
  • Ships with the latest UIMA-AS v2.8.1
  • Ships with recent ActiveMQ v5.13.2
  • DUCC's UIMA-AS services support failover and ssl connectors
  • Many DUCC webpage improvements
  • Clear user display of DUCC classes and relation to machines
  • Robust handling of dynamic changes to DUCC class and nodepool definitions
  • Full support of nodepools with different quantum
  • DUCC broker access restricted to user ducc
  • Eliminate need for user home directories located on a shared filesystem
  • Built-in Job error handler programmable per job
  • Migration utility for DUCC updates
  • Change to vary-off behavior to facilitate cluster management
  • Horizontal stacking of services instance allocations
  • java-viaducc improvements including separation of stdout from stderr respoonses
  • An alert banner is displayed on ducc-mon pages if daemons are down
  • Promoted DUCC from sandbox to the regular Apache project in the SVN

下載

 

本站原創,轉載時保留以下信息:
本文轉自:深度開源(open-open.com)
原文地址:http://www.baiduhome.net/news/view/11ad7ca7
 

 本文由用戶 jopen 自行上傳分享,僅供網友學習交流。所有權歸原作者,若您的權利被侵害,請聯系管理員。
 轉載本站原創文章,請注明出處,并保留原始鏈接、圖片水印。
 本站是一個以用戶分享為主的開源技術平臺,歡迎各類分享!