Reviews
Now Reading
General Architecture for Text Engineering – GATE
0
Review

General Architecture for Text Engineering – GATE

Overview
Synopsis

GATE (General Architecture for Text Engineering) is a Java suite of tools used for all sorts of natural language processing tasks, including information extraction in many languages.

Category

Text Analytics Software

Website
Company

GATE

Rating
Our Rating
User Rating
Ease of use
7.7
Features & Functionality
7.7
Advanced Features
7.7
Integration
7.7
Customer Support
7.7
Performance
7.7
Training
Implementation
Renew & Recommend
Bottom Line

GATE (General Architecture for Text Engineering) is a Java suite of tools used for all sorts of natural language processing tasks, including information extraction in many languages.

7.7
Our Rating
0.0
User Rating
You have rated this

General Architecture for Text Engineering – GATE : GATE (General Architecture for Text Engineering) is a Java suite of tools used for all sorts of natural language processing tasks, including information extraction in many languages. The Text Analytics software was developed at the University of Sheffield beginning in 1995. GATE has grown over the years to include a desktop client for developers, a workflow-based web application, a Java library, an architecture and a process.GATE includes components for diverse language processing tasks, such as parsers, morphology, tagging, Information Retrieval tools, Information Extraction components for various languages, and many others. GATE Developer and Embedded are supplied with an Information Extraction system (ANNIE) .

General Architecture for Text Engineering – GATE

GATE Components

GATE Components

ANNIE is often used to create RDF or OWL (metadata) for unstructured content. ANNIE (A Nearly-New Information Extraction System) is a set of modules comprising a tokenizer, a gazetteer, a sentence splitter, a part of speech tagger, a named entities transducer and a coreference tagger. ANNIE can be used as-is to provide basic information extraction functionality, or provide a starting point for more specific tasks.

General Architecture for Text Engineering

General Architecture for Text Engineering

The core functions of GATE include modelling and persistence of specialised data structures, measurement, evaluation, benchmarking ,visualisation and editing of annotations, ontologies, parse trees, a finite state transduction language for rapid prototyping and efficient implementation of shallow analysis methods (JAPE), extraction of training instances for machine learning and pluggable machine learning implementations for Weka, YALE, SVM Lite.

General Architecture for Text Engineering

General Architecture for Text Engineering

Languages currently handled in GATE include English, Spanish, Chinese, Arabic, Bulgarian, French, German, Hindi, Italian, Cebuano, Romanian, Russian. Plugins are included for machine learning with Weka, RASP, MAXENT, SVM Light, as well as a LIBSVM integration and an in-house perceptron implementation, for managing ontologies like WordNet, for querying search engines like Google or Yahoo, for part of speech tagging with Brill or TreeTagger, and many more. Many external plugins are also available, for handling e.g. tweets. GATE accepts input in various formats, such as TXT, HTML, XML, Doc, PDF documents, and Java Serial, PostgreSQL, Lucene, Oracle Databases with help of RDBMS storage over JDBC.

General Architecture for Text Engineering

General Architecture for Text Engineering

GATE Family

GATE Developer is an integrated development environment for language processing components bundled with the most widely used Information Extraction system and a comprehensive set of other plugins. GATE Embedded is an object library optimised for inclusion in diverse applications giving access to all the services used by GATE developer. GATE Teamware is a collaborative annotation environment for high volume factory-style semantic annotation projects built around a workflow engine and the GATE cloud backend web services. GATE Mímir (Multi-paradigm Information Management Index and Repository) is a massively scaleable multiparadigm index supporting Ontotext KIM and built on Ontotext’s semantic repository family. GATE Wiki is a Controllable Wiki and CMS with collaborative and asynchronous off-line editing, hosting controlled languages for round-trip ontology engineering. GATE Cloud is a parallel distributed processing engine that combines GATE embedded with a heavily optimised service infrastructure running on supercomputer hardware.

GATE

 

Filter reviews
User Ratings





User Company size



User role





User industry





Ease of use
Features & Functionality
Advanced Features
Integration
Customer Support
Performance
Training
Implementation
Renew & Recommend

What's your reaction?
Love It
75%
Very Good
0%
INTERESTED
8%
COOL
0%
NOT BAD
8%
WHAT !
8%
HATE IT
0%