Home » Uncategorized

Free data mining programs for everyday use

If you want quickly to get started with data analysis, here is my advise on free software programs that I use every day for data analysis, statistics and data mining.

R-package – a software for statistical computing written in C. Script oriented.

  • Pros: widely used, simple, extensive documentation.
  • Cons: less options for graphics compared to competitors, no multi-threading, scripting features are limited compared to full-featured programming languages (such as Python, C++ or Java)  .

DMelt – a mathematical software written in Java and based on GNU libraries supported by the DMelt team.

  • Pros: support for many languages, Java, Python/Jython, Groovy, Ruby, Octave. Multi-threading. Extensive documentation, 2D/3D graphics and hundreds of code examples.
  • Cons: Hard to bind with CPython. Many advanced topics of DMelt documentation are proprietary.

Weka – A Java environment for data mining.

  • Pros: advanced GUI, good documentation.
  • Cons: support for data visualization is less advanced compared to alternative programs. Scripting support is limited.

Orange – visualization and analysis for novice and experts.

  • Pros: Advanced GUI and graphics. Good documentation. Support for CPython.
  • Cons: Less choice for scripting compared to alternative statistical packages