R coming to Visual Studio

R Tools for Visual Studio (RTVS) follows the model of Python Tools for Visual Studio: it’s an open-source plug-in to Visual Studio that makes it a complete IDE for R, with syntax-aware editing, a command-line REPL, and interactive debugging. (Like PTVS, there will be a GitHub repo for RTVS when it’s ready for release.)


Visual Studio R

Of course, RStudio already has excellent capabilities for developing R code, and RTVS isn’t available — yet. But if you already develop in Visual Studio, or want to develop R code alongside C++, JavaScript, Python or any language supported by Visual Studio, send an email to RTVS-Invite@Microsoft.com to sign up for early access to RTVS.

For more go to: Revolutionary Analytics

Facebook data collection and photo network visualization with Gephi and R

In the following tutorial Katherine Ognyanova explains how to collect and visualize data  from Facebook with R and Gephi,  using people’s profile photos as node images (three ways: with Gephi, igraph, or qgraph): Get the full R script here.

1.  Explaining Gephi

2. Collecting Facebook Data

3. Visualizing  the network with Gephi

4. Visualizing  the network with igraph

FB igraph

Static and dynamic network visualization with R

Katherine Ognyanova proposes  a detailed tutorial for network visualization.

1. Basics for Network visualization: goals, types, etc

2.Data format, size and preparation

3.Data set 1 and Data set 2

4. Network visualization with igraph

5. Plotting

Also the code and data can be dowloaded from:

Download this tutorial as a PDF file.

Download the example datasets and R code this tutorial uses.

The code and data are also available on GitHub.

network visualization

Read more at: http://kateto.net/network-visualization


Useful free datasets (Part 2)

Other sites also offer great variaty of datasets free

Yahoo! Labs – Collection of datasets related to language, social, marketing and more. They’re well organized and most of them are hundreds of megabytes in size.

Awesome Publid Datasets – This is a Github repository that’s a list of publicly available datasets organized by category.

Gapminder – Hundreds of datasets on world health, economics, population, etc. All of it is viewable online within Google Docs, and downloadable as spreadsheets.

The Info – Mostly large datasets. The site is losing momentum, but the data available here is still gold.

The Data Hub – Hosted by CKAN. Most of these datasets come from the government.

Datamob – List of public datasets.

Numbrary – Lists of datasets.

Kaggle – Kaggle is a site that hosts data mining competitions. Each competition provides a data set that’s free for download.

SNAP – Stanford’s Large Network Dataset Collection. This list has several datasets related to social networking. Lots of fun in here!

More available datasets at: https://r-dir.com/reference/datasets.html

Useful free datasets (Part 1)

Here I share a list of datasets free for download


American Economic Ass. (AEA): http://www.aeaweb.org/RFE/toc.php?show=complete
Gapminder: http://www.gapminder.org/data/
UMD:: http://inforumweb.umd.edu/econdata/econdata.html
World bank: http://data.worldbank.org/indicator

Data Science Practice

This section contains data sets used in the book “Doing Data Science” by Rachel Schutt and Cathy O’Neil (O’Reilly 2014)
Datasets on the book site: https://github.com/oreillymedia/doing_data_science
Enron Email Dataset: http://www.cs.cmu.edu/~enron/
GetGlue (time stamped events: users rating TV shows): http://bit.ly/1aL8XS0
Titanic Survival Data Set: http://bit.ly/1kJ4pkF
Half a million Hubway rides: http://hubwaydatachallenge.org/trip-history-data/


CBOE Futures Exchange: http://cfe.cboe.com/Data/
Google Finance: https://www.google.com/finance (R)
Google Trends: http://www.google.com/trends?q=google&ctab=0&geo=all&date=all&sort=0
St Louis Fed: http://research.stlouisfed.org/fred2/ (R)
NASDAQ: https://data.nasdaq.com/
OANDA: http://www.oanda.com/ (R)
Quandl: http://www.quandl.com/
Yahoo Finance: http://finance.yahoo.com/ (R)

To view more go to: http://www.inside-r.org/howto/finding-data-internet



Like Alyzer-Free Social Media Tool for Facebook

LikeAlyzer is an online tool for companies that want to be successful on Facebook. It  helps you to measure and analyze the potential and effectiveness of your Facebook Pages


  • It provides daily updated Facebook statistics for your company or other Pages of interest.
  • It enables you to monitor and compare your efforts with those of the world’s popular brands or relevant companies, such as competitors.


Graph Visualization with Gephi

Gephi is an interactive visualization and exploration solution that supports dynamic and hierarchical graphs. It runs on Windows, Linux and Mac OS X. Gephi is open-source and free.


The goal is to help data analysts to make hypothesis, intuitively discover patterns, isolate structure singularities or faults during data sourcing. It is a complementary tool to traditional statistics, as visual thinking with interactive interfaces is now recognized to facilitate reasoning.

  • Real time visualization
  • Layout algorithms (force- based and multi-level)
  • Metric (Betweeness, Closeness, Diameter, Clustering Coefficient, Average shortest path, PageRank, HITS, Community Detection,  Random Generators)
  • Dynamic Network Analysis
  • Create Cartograpy
  • Clustering and hierarchical graphs
  • Dynamic Filtering



To learn more about it, go to http://gephi.org/