Pattern is a web mining module for the Python programming language.
Web Scraping Tools
Data mining tools
Natural language processing
Small (<50 employees), Medium (50 to 1000 employees), Enterprise (>1001 employees)
Pattern is a web mining module for the Python programming language. It has tools for data mining (Google, Twitter and Wikipedia API, a web crawler, a HTML DOM parser), natural language processing (part-of-speech taggers, n-gram search, sentiment analysis, WordNet), machine learning (vector space model, clustering, SVM), network analysis and visualization. The pattern.web module is a web toolkit that contains API's (Google, Gmail, Bing, Twitter, Facebook, Wikipedia, Wiktionary, DBPedia, Flickr, ...), a robust HTML DOM parser and a web crawler. The pattern.en module is a natural language processing (NLP) toolkit for English. Because language is ambiguous (e.g., I can ↔ a can) it uses statistical approaches + regular expressions. This means that it is fast, quite accurate and occasionally incorrect. It has a part-of-speech tagger that identifies word types (e.g., noun, verb, adjective), word inflection (conjugation, singularization) and a WordNet API. The pattern.search module contains a search algorithm to retrieve sequences of words (called n-grams) from tagged text. The search pattern NP be RB?+ important than NP means any noun phrase (NP) followed by the verb to be, followed by zero or more adverbs (RB, e.g., much, more), followed by the words important than, followed by any noun phrase. It will also match "The mobile web will be much less important than mobile apps" and other grammatical variations. The pattern.vector module is a toolkit for machine learning, based on a vector space model of bag-of-words documents with weighted features (e.g., tf-idf) and distance metrics (e.g., cosine similarity, infogain). Models can be used for clustering (k-means, hierarchical), classification (Naive Bayes, Perceptron, k-NN, SVM) and latent semantic analysis (LSA).
You may also like to read, Top Digital Asset Management Software, Top Mobile Commerce Platforms, Top Personalization Software, Top Subscription Management Software, Best Channel Integration Platform, Best Web Content Management Systems, Best Web Payment Gateways and Processors, Top Mobile Payment Providers, Top Product Review Platforms, Top Customer Engagement Platforms, Top Web Hosting Services, Top Online Community Management Software, Top Open Source Digital Commerce Platform and Top Content Delivery Network Providers.
Top Python Web Framework Software , Top PHP Web Framework Software , Top Java Web Framework Software , Top Web Framework Software , Top Web Scraping Software , Top Domain Registration Providers , Top Event Management Software