Introduction

Overview

Contextor (Content Extractor) is a framework/library designed and developed mainly for data and web mining purposes. Basically, there is a requirement and lack of suitable and appropriate tools and libraries in this regards. Briefly speaking, Contextor provides services in the following areas:

  • Providing an abstract, robust and extensible model to describe data in the context of data mining and manipulation
  • Loading and reading various types of resources such as file system, database, and web
  • Filtering and manipulating raw data extracted from desired resources
  • Persistence of processed data into various forms such as file system or database
  • Infra-structure and basic modeling for extraction of analytical and statistical information

Contextor basically takes advantage of Spring Framework in the following areas:

  • Spring Core
  • Spring Beans
  • Spring DAO

There has been much effort so that Contextor would have a flexible, extensible, and highly pluggable architecture. We believe it could be used in many mining usages.