R language is a free statistical computing environment; hence there are multiple ways/packages to achieve a particular statistical/quantitative output. I am going to discuss here a concise list of R packages that one can use for the modeling of financial risks and/or portfolio optimization with utmost efficiency and effectiveness. The intended audience for this article is financial market analysts interested in using R, and also for quantitatively inclined folks…Continue
Added by Ranjit Mishra on April 28, 2018 at 2:30am — No Comments
Cluster.OBeu v1.2.1 release on CRAN
We are very pleased to announce Cluster.OBeu v1.2.1 on CRAN!Continue
Added by Kleanthis Koupidis on March 12, 2018 at 4:00am — No Comments
We will use a R package called rvest which was created by Hadley Wickham. This package simplifies the process of scraping web pages.…
Added by Deepanshu Bhalla on February 26, 2018 at 9:15am — No Comments
Today I've just faced one challenge...
I work on one project, and we need to decide which assets from this company will be visited initially.
As every project, we have limited budget, so we should find some variables to indicate a priority scale.
After some discussion, someone said: let's find the assets close to schools and hospitals. Those must be inspected firstly than others.
Everybody agreed with the voice, but how would be possible to map every school and…
Now that Christmas and the New Year are behind us the nights are becoming a little longer with each passing day. Nevertheless, there's still loads of cold winter nights left to endure (unless you're in the Southern Hemisphere, in which case – throw me a shrimp on the barbie!).
It's time to dust off your New Year resolutions from last year (remember those?) and get ready for a new start, a new you and learn some new data skills.
I’ve thrown together a collection of five…Continue
R shiny app is an interactive web interface. R shiny app has two components user interface object (UI.R) and server function (Server .R). The two components are passed as arguments to the shiny app function that creates a shiny app object. For more info on how to build Shiny…Continue
Today, data scientists are generally divided among two languages — some prefer R, some prefer Python. I will not try to explain in this article which one is…Continue
Added by Marija Zoldin on September 26, 2017 at 10:00am — No Comments
Guess whose twitter handle gives this word cloud? Enough hints present in there. You are right, that is Andrew Ng tweeting about his new Deep Learning course on Coursera! It’s always fun to see data in action; isn’t it? Let’s try and create a similar wordcloud for three world leaders, viz. American President Donald Trump, Indian Prime Minister Narendra Modi, and Russian President Vladimir Putin.…Continue
Added by Kritika Jalan on September 23, 2017 at 11:30pm — No Comments
Social media platforms such as Twitter and Facebook enable everyone to voice their opinions about topics, companies, and products online.
These comments are a great source for companies to analyze their customers’ opinion about their brand or product. However, with billions of Tweets and posts daily, this is can take a lot of time.
Unless of course, you use R J With just a few lines of R-code and the help of machine learning, we’re able to build mood monitoring tools quickly,…Continue
Added by Daniel Schmeh on September 7, 2017 at 9:30am — No Comments
Principal Component Analysis (PCA) is a technique used to find the core components that underlie different variables. It comes in very useful whenever doubts arise about the true origin of three or more variables. There are two main methods for performing a PCA: naive or less naive. In the naive method, you first check some conditions in your data which will determine the essentials of the analysis. In the less-naive method, you set the those yourself,…Continue
Added by Pablo Bernabeu on September 6, 2017 at 1:30pm — No Comments
DVC is an open source tool that could help with achieving code simplicity, readability and faster model development.The idea is to track files/data dependencies during model development in order to facilitate reproducibility and track data files versioning. However, DVC is a language agnostic tool and can be used with any programming language. Here we will describe how we can…Continue
Added by Marija Zoldin on August 4, 2017 at 12:30am — No Comments
Added by Sandipan Dey on July 31, 2017 at 4:00am — No Comments
First of all we will see what is R Clustering, then we will see the Applications of Clustering, Clustering by Similarity Aggregation, use of R amap Package, Implementation of Hierarchical Clustering in R and examples of R clustering in various fields.
2. Introduction to Clustering in…Continue
Added by Sheetal Sharma on July 19, 2017 at 9:00pm — No Comments
Visualization apps may be privately consulted as well as published online. There are two main platforms: R Shiny and Tableau. Shiny has a free starter license…Continue
Added by Pablo Bernabeu on July 15, 2017 at 4:00am — No Comments
R language is the world's most widely used programming language for statistical analysis, predictive modeling and data science. It's popularity is claimed in many recent surveys and studies. R programming language is getting powerful day by day as number of supported packages grows. Some of big IT companies such as Microsoft and IBM have also started developing packages on R and offering enterprise version of R.
Added by Deepanshu Bhalla on June 12, 2017 at 12:30am — No Comments
This article explains how to select important variables using boruta package in R. Variable Selection is an important step in a predictive modeling project. It is also called 'Feature Selection'. Every private and public agency has started tracking data and collecting information of various attributes. It results to access to too many predictors for a predictive model. But not every variable is important for prediction of a particular task. Hence it is essential to…Continue
About two months ago there was new SaaS product, the Keyword Hero. It’s the only solution to “decrypt” the organic keywords in Google Analytics that users searched for in order to get to one’s website. We do so by buying lots of data off sources such as plugins and matching the data with our customers’ sessions in Google Analytics (side note: the entire algorithm was coded in R before we refactored it in Python to allow scalability and operability with AWS).
Added by Daniel Schmeh on May 29, 2017 at 10:00am — No Comments
Summary: Someone had to say it. In my opinion R is not the best way to learn data science and not the best way to practice it either. More and more large employers agree.
This is a tutorial to show how to implement dashboards in R, using the new "flexdashboard" library package.
this new library leverages these libraries and allows us to create some stunning dashboards, using interactive graphs and text. What I loved the most, was the “storyboard” feature that allows me to present content in Tableau-style frames. Please note that for this you need to create RMarkdown (.Rmd) files and insert the code using the…Continue