In this post, we learn about building a basic search engine or document retrieval system using Vector space model. This use case is widely used in information retrieval systems. Given a set of documents and search term(s)/query we need to retrieve relevant documents that are similar to the search query.
The problem statement explained above is represented as in below… Continue
Added by dataperspective on November 15, 2017 at 1:30am —
Michael Li is founder and CEO at The Data Incubator. The company offers curriculum based on feedback from corporate and government partners about the technologies they are using and learning, for masters and PhDs.
Below is a ranking of 23 open-source deep learning libraries that are useful for Data Science, based on Github and Stack Overflow activity, as well as Google search results. The… Continue
Added by Michael Li on October 17, 2017 at 12:00pm —
Technology has remarkably changed the way we live today, there is no denial to it. Compared with our ancestors, we stand far away from them in using different technologies for our day-to-day works.
So many technologies are developed in the past couple of years that have revolutionized our lives, and it’s impossible to list each of them. Though technology changes fast with time, we can observe the trends in which it changes. Last year, 2016 had bought so many fresh… Continue
Added by Venkatesan M on October 6, 2017 at 9:00pm —
Text Analytics with Python -- A Practical Real-World Approach to Gaining Actionable Insights from your Data
Text analytics can be a bit overwhelming and frustrating at times with the unstructured and noisy nature of textual data and the vast amount of information available. "Text Analytics with Python" published by Apress\Springer, is a book packed with 385 pages of useful information based on techniques, algorithms,… Continue
Added by Dipanjan Sarkar on July 14, 2017 at 4:00am —
Graphs belong to the field of mathematics, graph theory. For data analysis that requires searches of particular patterns, graph-based data mining becomes an important technique. Indeed, in real life, most of the data we have to deal with can be represented as graphs. A typical graph consists of vertices (nodes, cells), and of edges that…
Added by jwork.ORG on June 19, 2017 at 5:30pm —
For python programmers, scikit-learn is one of the best libraries to build Machine Learning applications with. It is ideal for beginners because it has a really simple interface, it is well documented with many examples and tutorials. Continue
Besides supervised machine learning (classification and regression), it can also be used for clustering, dimensionality reduction, feature extraction and engineering, and pre-processing the data. The interface is consistent over all of these methods, so…
Added by Ahmet Taspinar on May 26, 2017 at 4:30am —
Last year I started developing a Face Recognition model. I started with static pictures and using Wolfram Mathematica. This year I found out we can do the same job using OpenCV in Python, or creating specific filters in R and applying Weierstrass and Gaussian transformation.
There are lots of difficulties in recognizing faces of the same person, like: position, rotation of face, age, feeling, brightness, gamma, contrast, gamma, saturation, obstacles like hands,hair and so… Continue
Added by Rubens Zimbres on October 15, 2016 at 4:00am —
Contributed by Rob Castellano. He is currently in the NYC Data Science Academy 12 week full time Data Science Bootcamp program taking place between April 11th to… Continue
Added by NYC Data Science Academy on September 4, 2016 at 10:30am —
Here are three useful resources for learning about Data Science:
Added by Ujjwal Karn on May 18, 2016 at 8:59am —
Data is growing, and hence is the requirement for perfect decision making by companies. In this era where analytics is the new buzzword, companies are looking for able and determined Data Scientists.
But as always, the quest to get the sexiest and the highest paying job is not an easy task. For acquiring such a demanding position one must be well skilled and experienced.
Don’t freak out! Because now a days a lot of resources and tips are available to be perfect in what you are… Continue
Added by Sukanya Mohapatra on April 15, 2016 at 4:30am —
The Riemann Hypothesis is arguably the most important unsolved problem in mathematics. It falls into an area called Analytic Number Theory which is essentially number theory with complex numbers thrown into the mix. The hypothesis states that all non-trivial zeros of the Reimann Zeta function fall on the critical line. What!?? Ok, sorry. That is not very helpful. Lets just say that there is a critical relationship between this function and our…
Added by Eduardo Siman on February 11, 2016 at 3:30am —
It's a brand new year and that means you'll be seeing scads of trend pieces this week. As you can probably assume from the title, we're not necessarily immune to that sort of thing… Continue
Added by Leena Kamath on February 8, 2016 at 9:35am —
Guest blog by Greg Roberts at Packt Publishing
(to see this graph in its fully interactive form see http://gregroberts.github.io/)
I love Python, and to celebrate… Continue
Added by William Vorhies on October 14, 2015 at 8:18am —
This tutorial was authored by Jose A Dianes, PhD.
An important step in data analysis is data exploration and representation. In this tutorial we will see how by combining a technique called Principal Component Analysis (PCA) together with Cluster… Continue
Added by Codementor on September 1, 2015 at 4:32pm —
Which accounts should I follow on Twitter to stay up to date with Big Data?
This is a common question for those who want to learn about Big Data or know more about the field.
Well, it just so happened that I wanted to practise my skills with Python and with Twitter API, I have thus created a way to find and rank the most successful accounts talking about Big Data on Twitter.
Here is the… Continue
Added by Luca Naso on August 3, 2015 at 9:04am —
Image processing is a mathematically intensive operation & one of the biggest areas of research for a big data field. Image processing analytics has applications from processing a X-Ray to identifying stationary objects in a self… Continue
Added by Pansop on July 20, 2015 at 12:11am —
Added by Pansop on June 22, 2015 at 12:30am —
Added by Pansop on June 16, 2015 at 9:00pm —
Here is a list of top Python Machine learning projects on GitHub. A continuously updated list of open source learning projects is available on Pansop.
scikit-learn is a Python… Continue
Added by Pansop on May 21, 2015 at 8:00pm —
Python & data analytics go hand in hand. Here is a list of 9 Python data analytics libraries. This list is going to be… Continue
Added by Pansop on May 21, 2015 at 4:30am —