Subscribe to DSC Newsletter

Jwork.ORG's Blog (25)

Encyclosearch.org is launched

A new project called Encyclosearch (http://encyclosearch.org/) has been launched. Encyclosearch allows you quickly and easily search many high-quality encyclopedias at once, instead of relying…

Continue

Added by jwork.ORG on January 6, 2021 at 11:00am — No Comments

Recent Java enhancements for numeric calculations

In the past, slow evaluation of mathematical functions and large memory footprint were the most significant drawbacks of Java compared to C++/C for numeric computations and scientific data analysis. However, recent enhancements in the Java Virtual Machine (JVM) enabled faster and better numerical computing due to several enhancements in evaluating trigonometric functions.

In this article we will use the DataMelt (https://datamelt.org) for our…

Continue

Added by jwork.ORG on November 7, 2020 at 10:21am — No Comments

Restored Wikipedia articles on computing

About 15,200 scholarly articles permanently removed from Wikipedia in 2018 and 2019 have been restored by the Handwiki team. In 2018 and 2019, such articles  did not pass the Wikipedia's notability. As the result of this, a lot of useful content  was lost. Articles on software programs and computing were the most affected by this Wikipedia…

Continue

Added by jwork.ORG on August 20, 2020 at 10:03am — No Comments

Stunning 3D visualization with JavaView

JavaView(http://www.javaview.de/) is a 3D geometry viewer and a mathematical visualization software known since 90x. The program is written in Java, and enables a smooth integration into commercial software like Mathematica and Maple. JavaView can be used for 3D scientific visualization, geometric modeling, variational optimization,…

Continue

Added by jwork.ORG on June 21, 2020 at 5:30pm — No Comments

Can a human brain hold your life experience?

A human brain is an amazing instrument. It combines huge data storage with massive real-time processing. According to Scientific American [1], the memory capacity of the human brain was reported to have the equivalent of 2.5 petabytes (2500 TB) of memory capacity. This number was obtained by estimating how much information can be stored by 125…

Continue

Added by jwork.ORG on May 21, 2020 at 3:30pm — No Comments

HandWiki encyclopedia of datascience

In 2020, HandWiki has become the largest online wiki encyclopedia for major science topics (physics, math etc.) and computing. It has more than 105,000 scholarly articles, incorporating the current Wikipedia articles, scholarly articles submitted to the Wikipedia foundation (but later…

Continue

Added by jwork.ORG on January 16, 2020 at 6:30pm — No Comments

How to make you own Wiki from Wikipedia using Python

Here is a short blog I was asked to make about making a personal Wiki from Wikipedia. It shows the basic steps in text processing so I hope it will be useful for data scientists. It also requires some knowledge of MediaWiki setup on a web server, and some (not very advanced) knowledge of the Python programming language. It takes only several days to create this Wiki with Wikipedia articles if you know…

Continue

Added by jwork.ORG on October 24, 2019 at 1:21am — No Comments

Wikis for publishing scholarly articles on data science and software

By now you may already know that to add scholarly articles to the English version of Wikipedia is difficult due to the "notability" concept and tight control from anonymous editors (see this article). In recent years, entire Wikipedia topics and articles dedicated to software and data…

Continue

Added by jwork.ORG on September 28, 2019 at 4:30am — No Comments

Calculating exclusion limits for a new theory in hardcore science

In many fields of science, it is important to understand the relevance of new theories or hypotheses in a description of experimental data, assuming that such data are already well represented by predictions of some well-accepted theory. A popular statistical method for setting upper limits (also called exclusion limits) on model parameters of a new theory is based on the CLs method.…

Continue

Added by jwork.ORG on April 11, 2019 at 3:00pm — No Comments

Best dynamically-typed programming languages for data analysis

One can seriously argue about what programming language is the best for data analysis, but there is one universal metric that can define your choice: speed of calculations. Therefore, the word "best" in the title means the languages that lead to most performant applications. If most performant program can also be written in an easy-to-use, easy-to-learn, dynamically-typed…

Continue

Added by jwork.ORG on January 26, 2019 at 2:54pm — No Comments

Evaluation and comparison of open source software suites for data mining and knowledge discovery

An article by A.H.Abdulrahman, J. M. Luna, 2 M. A. Vallejo 3 and S. Ventura with the title "Evaluation and comparison of open source software suites for data mining and knowledge discovery" (published by Wiley "Data Mining and Knowledge Discovery, Vol 7 Issue 3 2017 see this link) provides the research community with an extensive study on different features included in any data mining tool. The final score for…

Continue

Added by jwork.ORG on October 30, 2018 at 3:18pm — No Comments

Statistical analysis on the Android platform

Last week a new release of AWork (version 2.0) was submitted to Google Play  (see the AWork link). Finally it supports Android 8+ devices with high resolution screens. AWork is a complete programming environment for Android devices…

Continue

Added by jwork.ORG on October 1, 2018 at 3:30pm — No Comments

Popularity of software programs for data science using recent reviews

In this article we discuss popularity of various software programs used for data analysis which are mentioned in various reviews published online in the period between 2017 and 2018. We used 14 reviews listed in the article Popularity of software programs for data…

Continue

Added by jwork.ORG on September 6, 2018 at 6:00pm — No Comments

Using Multi-Layer Recurrent Neural Network for language models

Here is another example of how to use Multi-Layer Recurrent Neural Network (RNN package) designed for character-level language models. This neural network was trained using 165,000+ real titles of acts submitted to the Congress from CONGRESS.GOV. The training was performed using GPU. Then the trained RNN was used to create "fake" titles. Use this link to find…

Continue

Added by jwork.ORG on August 17, 2018 at 4:29pm — No Comments

Everipedia as a desk reference for data mining topics

One interesting metric to check the  usefulness of Everipedia as a desk reference for data mining is to compare the number of relevant articles. Go to Everipedia (https://everipedia.org/) and search for "data mining". You will get 7 articles.Then go to Wikipedia and search "data mining" You will see 4 articles (overlapped with similar Everipedia  articles).

Another example. Try the word "smoothing" which is a popular topic in data analysis.…

Continue

Added by jwork.ORG on August 2, 2018 at 1:34pm — No Comments

DataMelt published Java API documentation

DataMelt computational platform for data analysis organized its Java documentation:

Continue

Added by jwork.ORG on June 23, 2018 at 5:24pm — No Comments

Image identification using a convolutional neural network

This blog  explores a typical image identification task using a convolutional ("Deep Learning") neural network. For this purpose we will use a simple JavaCNN packageby D.Persson, and make our example small and concise using the Python scripting language. This example can also be rewritten in Java, Groovy, JRuby or any scripting language supported by the Java virtual machine.



This example will use images in the grayscale format (PGM). The name "PGM" is an acronym derived from…

Continue

Added by jwork.ORG on May 31, 2018 at 1:30pm — No Comments

Neural network classification of data using Smile

Data classification is the central data-mining technique used for sorting data, understanding of data and for performing outcome predictions. In this small blog we will use a library Smilecthat includes many methods for supervising and non-supervising data classification…

Continue

Added by jwork.ORG on March 13, 2018 at 4:00pm — No Comments

Recasting Java neural networks in Python

Many neural network applications implemented in Java, such as Neuroph, Encog and Joone, may look rather different when switching from the Java language to Python with the help of the DMelt computing environment. First of all, they look simpler. You can use your favorite Python tricks to load and display data. The Python coding is simpler for viewing and fast modifications. It does not require recompiling after each change. At the same time, the platform…

Continue

Added by jwork.ORG on July 29, 2017 at 1:00pm — No Comments

Coding graphs for data mining in Python using Java platform

Graphs belong to the field of mathematics, graph theory. For data analysis that requires searches of particular patterns, graph-based data mining becomes an important technique. Indeed, in real life, most of the data we have to deal with can be represented as graphs. A typical graph consists of vertices (nodes, cells), and of edges that…

Continue

Added by jwork.ORG on June 19, 2017 at 5:30pm — No Comments

© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service