Reviews
Now Reading
LPU
0
Review

LPU

Overview
Synopsis

LPU (which stands for Learning from Positive and Unlabeled data) is a text learning or classification system that learns from a set of positive documents and a set of unlabeled documents (without labeled negative documents).

Category

Text Analytics Software Free

Features

•Cost sensitive classification
• Ramp loss function
• Hinge loss function

License

Open Source

Price

Free

Pricing

Subscription

Free Trial

Available

Users Size

Small (<50 employees), Medium (50 to 1000 employees), Enterprise (>1001 employees)

Website
Company

LPU

What is best?

•Cost sensitive classification
• Ramp loss function
• Hinge loss function

What are the benefits?

•Can be used for both retrieval or classification.

PAT Rating™ ( Beta)
Editor Rating
Aggregated User Rating
Ease of use
7.6
Features & Functionality
7.6
Advanced Features
7.6
Integration
7.6
Performance
7.6
Customer Support
7.6
Implementation
Renew & Recommend
Bottom Line

LPU is a text learning or classification system that learns from a set of positive documents and a set of unlabeled documents (without labeled negative documents) and can be used for both retrieval or classification.

7.6
Editor Rating
Aggregated User Rating
You have rated this

LPU (which stands for Learning from Positive and Unlabeled data) is a text learning or classification system that learns from a set of positive documents and a set of unlabeled documents (without labeled negative documents). This type of learning is different from classic text learning/classification, in which both positive and negative training documents are required. Given a set of positive documents and a set of unlabeled documents, the LPU algorithm learns a classifier in two steps:

• Step 1 : Identifying a set of reliable negative documents from the unlabeled set. For this step, LPU has three techniques, i.e., spy, roc (rocchio), nb (naive bayes) and DNF. In all these techniques, the unlabeled set is treated as negative data. In Spy technique you sample a certain % of positive examples and put them into unlabeled set to act as “spies”.Then run a classification algorithm assuming all unlabeled examples are negative, we will know the behavior of those actual positive examples in the unlabeled set through the “spies”.We can then extract reliable negative examples from the unlabeled set more accurance.
• Step 2 : Building and selecting a classifier, which consists of two sub-steps:
1. Building a set of classifiers by iteratively applying a classification algorithm. For this step, LPU has two techniques, SVM(Support vector machines ) and EM (Expectation Maximization).
2. Selecting a good classifier from the set of classifiers constructed above. We call this sub-step "catching a good classifier".
LPU system can be used for retrieval or classification. For retrieval, the document collection is the unlabeled set, which is also the test set. For classification, you can provide a separate test set that is different from the unlabeled set used in training.

LPU
Text Analytics Software Free: Top Twenty
PAT Index™
 
1
General Architecture for Text Engineering – GATE
 
2
RapidMiner Text Mining Extension
 
3
KH Coder
 
4
Coding Analysis Toolkit
 
5
QDA Miner Lite
 
6
VisualText
 
7
TAMS
 
8
Datumbox
 
9
Carrot2
 
10
Natural Language Toolkit
 
11
Pattern
 
12
Apache Mahout
 
13
Textable
 
14
Apache OpenNLP
 
15
Twinword
 
16
Apache UIMA
 
17
Aika
 
18
tm – Text Mining Package
 
19
KNIME Text Processing
 
20
LingPipe
Heat Index
 
 
 
 
 
The Latest
 
Read More
399.5
Editor's Picks
 
 
 
 
Go To Text Analytics Software Free
Filter reviews
User Ratings





User Company size



User role





User industry





Ease of use
Features & Functionality
Advanced Features
Integration
Performance
Customer Support
Implementation
Renew & Recommend

What's your reaction?
Love It
0%
Very Good
0%
INTERESTED
0%
COOL
0%
NOT BAD
0%
WHAT !
0%
HATE IT
0%