Subscribe to DSC Newsletter

November 2013 Blog Posts (40)

Big Data: An Understanding

We establish understanding of things in terms of Data or it will be better to say in terms of Big Data, the utilization of things, matters, issues, inventions, surroundings, maps and much more throughout our everyday life cycle, all of which has a certain data type to get input, process and output for us. Sometime we understand these in almost no time as a human, where data is being originated, what are we targeting for and more, and there are times, when some thing might take longer…

Continue

Added by Atif Farid Mohammad on November 29, 2013 at 12:50am — No Comments

Don't reinvent the wheel!

You could take a look at some traditional stat anslyses, like Cluster Analysis, and for a visual representation,  try Multidimensional Scaling. These would save you a lot of time. There is no reason to reinvent the wheel, but Big Data seems intent upon ignoring the vast historical knowkedge regarding statistics,  psychometrics, and Measurement Theory. Believe me- if you have a question on how to proceed with an analysis, multivariate statistics is likely to have several accaeptable answers.…

Continue

Added by Mark Biernbaum on November 25, 2013 at 9:20am — No Comments

Java Coding Samples for Online Data-mining

In this post, I discuss the basic characteristics of code that I have personally used to extract online data - in a process these days often called data-mining.  I intend to cover some general features.  Those that wish to do so can also compile the coding samples.

Over the years, I have programmed in a number of computer programming languages including Visual Basic, Perl, Python, and LISP (AutoLISP).  The coding samples on this blog are written in Java, my language of…

Continue

Added by Don Philip Faithful on November 24, 2013 at 7:00am — 3 Comments

My Data Science Book - Table of Contents

This book is also part of our apprenticeship. Part of the content as well as new content is in a separate document called Addendum. Click here to download the…

Continue

Added by Vincent Granville on November 23, 2013 at 12:30pm — 55 Comments

Hazards of Institutional Data

A prominent discrimination case in Canada involves a firefighter named Tawney Meiorin.  Meiorin had successfully performed her duties as a firefighter for many years.  She lost her job after the introduction of mandatory testing to determine her fitness for the position.  The testing measured aerobic capacity, and it was developed in a manner that many would regard as scientific; that is to say, it used a highly quantitative and analytic approach.  However,…

Continue

Added by Don Philip Faithful on November 23, 2013 at 4:43am — 1 Comment

Big Data getting bigger in UAE

With immense growth in technology, financial institutes, skyscrapers and malls, one can find that the number of consumers have increased exponentially in the Middle East over the past few years. Reports suggests that 91% of the population comprises of expats in UAE enjoying high salaries and tax free benefits. 

Moreover, the options to spend your money have gotten bigger and better with UAE working towards the initiative to build Smart Cities setting strong infrastructure fundamentals…

Continue

Added by IPSITA on November 22, 2013 at 12:02pm — No Comments

SOA, Cloud Computing and Big Data Security

Let us start with some jargons and distill these into some meaningful things to be understood by us the only intelligent autonomous system in other words humans.

 

SaaS – Applications focused on end-users, used internet as a medium, e-mail, salesforce etc.

PaaS – Set of tools focused on developers, such as Ruby on Rails, Python, Eclipse, REST, SOAP, Oracle.

IaaS – Complete software and hardware solutions, VMware, Amazon EC2, Rackspace Cloud, Google Compute…

Continue

Added by Atif Farid Mohammad on November 22, 2013 at 9:22am — No Comments

QlikView usage for full search

Hello, friends!

In the following entry of my own blog I describe how to take advantage of one of the features of QlikView, a business intelligence tool, to perform quick searches across a data set.

QlikView usage for full search

I hope it to be interesting and practical.

QlikView is an option to take into account by a data scientist, given…

Continue

Added by Jose Luis Dengra on November 22, 2013 at 8:30am — No Comments

Need solution To Crawl 200 million Web Page in 10-1 month Using Perl and hadoop is this possible?

Hi all Greetings from Prajan,

 I am Prajan(Pandiyarajan), Working As a Perl Developer in Sciera Solution.I have 2+ year in perl and Big Data .

 I got a difficult task from my manager ,that was i need to Crawl 200 million URL in same Domain with in 20-1 month time ,i have tried in Perl i got Maximum 250 hits only in 60 seconds.I heard will make this using Hadoop But i don't know hadoop ,Any one can You give a solution for my task with detail then how can i reach…

Continue

Added by Pandiyarajan on November 21, 2013 at 10:10pm — 1 Comment

Healthcare Informatics and Big Data

In the era of Big Data, it is getting obvious for the healthcare market to expand its horizons to absorb technologies to achieve greater efficiencies by leveraging the Service Oriented Cloud tools and technologies, which will lower costs for hospitals and deliver higher quality patient care on simultaneous basis, There is a huge influx impulse of healthcare market drivers is generating Big Data every minute.

 

Once a patient is in some healthcare facility in any condition, we…

Continue

Added by Atif Farid Mohammad on November 21, 2013 at 7:30am — No Comments

Taxonomy of Data Scientists

This is a first attempt at classifying data scientists. I invite you to produce a more comprehensive, better solution.…

Continue

Added by Vincent Granville on November 20, 2013 at 8:00pm — 8 Comments

Introduction to the BigObject® and In-place Computing Model

The BigObject® - A  Computing Engine Designed for Big Data

BigObject® presents an in-place* computing approach, designed to solve the complexity of big data and compute on a real-time basis. The mission of the BigObject® is to deliver affordable computing power, enabling enterprises of all scales to interpret big data. With the advances in what a commodity machine can perform, it…

Continue

Added by Yuanjen Chen on November 20, 2013 at 5:29pm — No Comments

Should Big Data processing matter ?

Well, when it comes about Big Data processing, there are a few platforms like Hadoop, mongoDB and Oracle in-memory DB etc. However, I wonder why the platforms must be complicated ? 

Someone think in a same way, please look at following. 

http://en.usp-lab.com

Cheers,

Added by Buyanjargal Shirnen on November 19, 2013 at 9:00pm — 1 Comment

The Data Science Equation

I present here the results of a data science study about data science. Based on LinkedIn data (top people listed when you do a people search for data science, from a LinkedIn account with 8,000+ data science connections), we identified the fields most frequently associated with data science, as well as top data scientists on LinkedIn.…

Continue

Added by Vincent Granville on November 19, 2013 at 8:00pm — 9 Comments

Three Announcements

Data Visualization Contest

This December 4-5, DataBeat will host the 3rd annual Data Science Summit taking place December 4-5 in Redwood, CA. The event brings together academics, organizations, media companies, and brands to explore the…

Continue

Added by Vincent Granville on November 19, 2013 at 10:00am — 2 Comments

Plotly Beta: Gallery of Beautiful Graphs

Plotly has a new gallery of graphs, which showcases some of the best graphs made to date with the product.

You can make graphs like these with a GUI, or with code, which can be found and copied on our API pages (with support for Python, R, MATLAB, Arduino, REST, Julia, and Perl).…

Continue

Added by Matthew Sundquist on November 18, 2013 at 4:27pm — No Comments

ETL, ELT and Data Hub: Where Hadoop is the right fit ?

Few days back i have attended a good webinar conducted by Metascale on topic “Are You Still Moving Data? Is ETL Still Relevant in the Era of Hadoop?” This post is targeting this webinar.

In summary, this webinar had nicely explained about how enterprise can use Hadoop as a data hub along with the existing Datawarehouse set up. “Hadoop as a Data Hub” this line itself raised lot of questions in my…

Continue

Added by Manish Bhoge on November 17, 2013 at 8:16pm — 5 Comments

Models vs. Experiments

Continue

Added by Michael Walker on November 17, 2013 at 8:30am — No Comments

Monthly Archives

2017

2016

2015

2014

2013

2012

2011

1999

Follow Us

Videos

  • Add Videos
  • View All

Resources

© 2017   Data Science Central   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service