1. Thou shalt decide on a business problem before committing any time, money, or animal sacrifice to the project. 2. Thou shalt start small and build, committing not to cataloging the Internet on thy first project. 3. Thou shalt not spend unnecessarily. Open Source...
Month: June 2014
Recent Articles
Introduction to R for Data Mining: 3 YouTube videos to learn R
Are you new to R? Are you planning to learn R by yourself? Watch these videos...
Learning Data Mining: 12 books on R
R is a free and widely used programming language for data analysis and statistics. It is a dynamically typed interpreted language that possesses an extensive catalog of statistical and graphical methods. In this post, we list 12 books to help you learn R and your data...
Big Data can predict risk of metabolic syndrome
Research published today in the American Journal of Managed Care demonstrates that analysis of patient records using state-of-the-art data analytics can predict future risk of metabolic syndrome. More than a third of the U.S. population has metabolic syndrome, a...
Robots can learn faster when fed a lot of data
A trio of research projects out of Cornell, MIT and the University of Washington highlight the promise of building robots that can learn to do the things we want them to, but also suggest that patience on behalf of programmers will be a real virtue. Like any...
Airlines get the most from Big Data to improve passenger experience & increase revenues
Big Data is something we hear a lot about in the travel sector. As airlines face increasing pressure from travellers to improve the standards of air travel and create a more personal experience, while at the same time being tasked with finding new ways of generating...
Top 33 most noted Data Scientists on Twitter
Do you follow these 33 most notable Data Scientists on Twitter? 1. Hilary Mason @hmason Data Scientist in Residence at @accel. I ♥ data and cheeseburgers. 2. John Myles White @johnmyleswhite Scientist at Facebook and Julia developer. Author of Machine Learning for...
What are the skill sets data scientists need to succeed?
Thousands of new data jobs will be created in the next couple of years, making “data scientist” one of the hottest emerging job titles. New data jobs will center on “analytics, data architects, data scientists [and] data modelers,” according to Inhi Cho Suh, Vice...
Data in crime fighting: Beyond Minority Report
When we discuss Big Data in crime fighting, the analogy of Minority Report, the 2002 Tom Cruise film, always comes up. This is the idea that it would be possible to predict who is going to commit a crime and when meaning that law enforcement can stop these crimes...
Top 25 most popular Data Science blogs
Looking for most popular, frequently updated and insightful Data Science blogs? Here is our list. The blogs are arranged in no particular order. 1. FiveThirtyEight 2. The Numbers Guy 3. Freakonomics 4. OKCupid 5. DataTau 6. Data Ranker 7. Beyond The Purchase 8. Data...
Learn Hadoop with 10 SlideShare presentations
Want to learn Hadoop? Watch these presentations on SlideShare to understand Hadoop HDFS, the MapReduce algorithm, the Pig Latin language, and the Hive SQL language. 1. Introduction to MapReduce, an Abstraction for Large-Scale Computation by Ilan Horn, Google...
Best LinkedIn groups all Hadoop experts should join
There are hundreds of Hadoop groups on LinkedIn, but these are the best ones you should definitely consider joining. Join them to learn about the latest happenings in the world of Hadoop, and engage in discussions with other professionals online. 1. Hadoop Users Group...
Download 30 papers on Machine Learning
Are you looking for some serious research papers on Machine Learning? Here you go...! 1. Optimized projections for compressed sensing 2. Multiscale sparse image representation with learned dictionaries 3. Robust face recognition via sparse representation 4. Feature...
Why is climate change big data’s biggest challenge?
Global sea levels are about eight inches higher today than they were in 1880, and they are expected to rise another two to seven feet during this century. At the same time, some 5 million people in the U.S. live in 2.6 million coastal homes situated less than 4 feet...
Big Data and its threat to The Fourth Amendment
John Villasenor, Professor of Public Policy at UCLA, recently projected that it will be possible for governments to record and analyze everything an individual says and does in the near future. In the wake of the NSA revelations you could argue this already exists, so...
Big Data and its threat to The Fourth Amendment
John Villasenor, Professor of Public Policy at UCLA, recently projected that it will be possible for governments to record and analyze everything an individual says and does in the near future. In the wake of the NSA revelations you could argue this already exists, so...
Big Data and its threat to The Fourth Amendment
John Villasenor, Professor of Public Policy at UCLA, recently projected that it will be possible for governments to record and analyze everything an individual says and does in the near future. In the wake of the NSA revelations you could argue this already exists, so...
Top 17 white papers on Data Science
Check out our list of 17 White Papers on Data Science! You can download them by clicking on the links. If we have missed any, please feel free to add. 1. Building Data Science Teams by DJ Patil (2011) 2. Next-Gen Data Scientists by Dr. Rachel Schutt (2013) 3. Keep...
Can Big Data save newspapers from extinction?
For four years I had the job growing up that a lot of young men had—I was a “paperboy” for the local newspaper, in this case The Pittsburgh Press. This was back in the Stone Age, as my kids would call it, before the Internet when the news actually came to your front...
Five ways to become an effective database administrator
Big data, machine data, small data, personal data, corporate data; data is everywhere and it's the centerpiece of so many businesses. The question is, who is looking after it? The explosion of data hasn't seen a corresponding growth in the size of IT teams, so it's...
Top 30 simple Data Visualization tools
Data visualization is the presentation of quantitative information in a graphical form. A data visualization tool turns large and small datasets into visuals that are easier for the human brain to understand and process. Now, are you looking for a simple tool to...
Top 30 simple Data Visualization tools
Data visualization is the presentation of quantitative information in a graphical form. A data visualization tool turns large and small datasets into visuals that are easier for the human brain to understand and process. Now, are you looking for a simple tool to...
Top 30 simple Data Visualization tools
Data visualization is the presentation of quantitative information in a graphical form. A data visualization tool turns large and small datasets into visuals that are easier for the human brain to understand and process. Now, are you looking for a simple tool to...
How to install a Virtual Apache Hadoop Cluster with Vagrant and Cloudera Manager
It’s been a while since we provided a how-to for this purpose. Thanks, Daan Debie (@DaanDebie), for allowing us to re-publish the instructions below (for CDH 5)! I recently started as a Big Data Engineer at The New Motion. While researching our best options for...
How to install a Virtual Apache Hadoop Cluster with Vagrant and Cloudera Manager
It’s been a while since we provided a how-to for this purpose. Thanks, Daan Debie (@DaanDebie), for allowing us to re-publish the instructions below (for CDH 5)! I recently started as a Big Data Engineer at The New Motion. While researching our best options for...
Top 35 invaluable books on Data Visualization
Are you a hard-core enthusiast of data visualization? Or a beginner, who wants to learn and be able to create more effective visualisations? Check out our list of 35 invaluable books you must read for better visualization. (Some books in the list might not be directly...
Top 10 most popular myths about Hadoop
Hadoop and Big Data are practically synonymous these days. There is so much info on Hadoop and Big Data out there, but as the Big Data hype machine gears up, there's a lot of confusion about where Hadoop actually fits into the overall Big Data landscape. Let’s have a...
Deploying Big Data To Recruit And Retain Talent
Big Data is the buzzword of the year. Every leader — whether they’re managing a small team or are at the helm of a multinational corporation with thousands of employees — is wondering how they can use Big Data to better get to know their people, to create a setting...
Deploying Big Data To Recruit And Retain Talent
Big Data is the buzzword of the year. Every leader — whether they’re managing a small team or are at the helm of a multinational corporation with thousands of employees — is wondering how they can use Big Data to better get to know their people, to create a setting...
AlgoMost Contest: Can you predict future company acquisitions?
Looking for a challenging Big Data contest? Well, here is one! International data mining platform AlgoMost has launched a new competition focusing on M&A (Mergers and Acquisitions). The goal of the competition is to predict what companies are more likely to be...
How to make better pricing decisions using Big Data
It’s hard to overstate the importance of getting pricing right. On average, a 1 percent price increase translates into an 8.7 percent increase in operating profits (assuming no loss of volume, of course). Yet we estimate that up to 30 percent of the thousands of...
Watson Mobile Developer Challenge Q&A: CrayonData
Did you miss the online streaming of Crayon founder Suresh Shankar’s presentation of Maya – Crayon’s Personal Choice Assistant, before IBM’s Watson Mobile Developer Challenge judging panel last month? Here is the Youtube video.
How brain science will change computing
Treo creator Jeff Hawkins urges us to take a new look at the brain — to see it not as a fast processor, but as a memory system that stores and plays back experiences to help us predict, intelligently, what will happen next.
How brain science will change computing
Treo creator Jeff Hawkins urges us to take a new look at the brain — to see it not as a fast processor, but as a memory system that stores and plays back experiences to help us predict, intelligently, what will happen next.
Top 10 LinkedIn groups every Data Scientist should join
Are you a busy data scientist, building mathematical models and machine learning algorithms to run experiments on massive datasets? And are you struggling to find a meaningful way to stay up-to-date? Well, here is a list of Top 10 LinkedIn groups for you to stay...
For the airline industry, big data is cleared for take-off
When a customer checks into a flight with United Airlines UAL 1.34% , there is typically an array of potential add-on offers to navigate through: flight upgrades, access to the airline’s United Club, and more. Under United’s old “collect and analyze” approach to data,...
Building a basic recommendation engine using R
In our day to day life, we come across a large number of Recommendation engines like Facebook Recommendation Engine for Friends’ suggestions, and suggestions of similar Like Pages, Youtube recommendation engine suggesting videos similar to our previous...
How does a Choice Engine overcome choice problems?
Consider a typical decision – you have just arrived in a new city after a 6 hour flight, it’s 6.30 PM and you want to get a decent vegetarian meal. What do you do?You fire up Google, search for good restaurants in that city, and filter them by type, price, location...
The Metaphysics of Big Data: The Problem of Induction
How do we, human beings, acquire knowledge? Think for a second! And don’t tell me that it is from text books, TV, internet or newspapers etc!!! Scholars say that there are mainly (debatably) five valid ways (Perception, Inference, Comparison, Verbal Testimony and...
How Data Visualization Helped Me Run Faster
Could you use a data visualization lift in your life and work? I never thought I'd be that guy who tracks his fitness data. But a few months ago I tapped on the Nike+ app on my iPod Nano and after about a mile I was a running data junkie. The lure of easy information...
37 colleges to fulfill your dream of becoming a Data Scientist
Are you looking into a career as a data scientist and to become one of the most sought-after people in Big Data landscape? Here is a complete list of colleges/universities that offer Data Science degrees. Click on the links to find more details about the courses. 1....
The Era of Cloud Computing
SynapDx searches hundreds of thousands of genetic markers, looking for clues about autism in 880 children across 20 states. A few years ago, this would be the task of a major company or research institution. Thanks to cloud computing, the start-up in Lexington, Mass.,...
Understanding the power of Hadoop as a Service
Across a wide range of industries from health care and financial services to manufacturing and retail, companies are realizing the value of analyzing data with Hadoop. With access to a Hadoop cluster, organizations are able to collect, analyze, and act on data at a...
70+ websites to get large data repositories for free
Do you require GBs of data to check the performance of your app? The easiest way is to download samples of data from free data repositories available on the Web. But the main disadvantage of this approach is the data will have very less unique content and it may not...
Top 18 free and widely used, open source NoSQL databases
NoSQL is a new breed of database management systems that fundamentally differ from relational database systems. These NoSQL databases do not require tables with a fixed set of columns, avoid JOINs and typically support horizontal scaling. They are also referred to as...
Free video tutorials on Data Mining
Looking for free video tutorials on Data Mining...? Here is a list of TOP 10 SOURCES that provide free Data Mining video tutorials on the Web. They explain how to perform data mining tasks (classification, clustering, association rule and sequential pattern discovery)...
Artificial Intelligence raises new hope for cancer patients
On Monday mornings, Bob Michaels walks into the infusion center at Weill Cornell Medical College in New York City and takes a seat in a comfortable barcalounger. An oncology nurse connects the port implanted in the retired university professor’s chest to a portable IV...
Artificial Intelligence raises new hope for cancer patients
On Monday mornings, Bob Michaels walks into the infusion center at Weill Cornell Medical College in New York City and takes a seat in a comfortable barcalounger. An oncology nurse connects the port implanted in the retired university professor’s chest to a portable IV...
Artificial Intelligence raises new hope for cancer patients
On Monday mornings, Bob Michaels walks into the infusion center at Weill Cornell Medical College in New York City and takes a seat in a comfortable barcalounger. An oncology nurse connects the port implanted in the retired university professor’s chest to a portable IV...
Medicine’s Big Problem with Big Data: Information Hoarding
Researchers at IBM, Berg Pharma, Memorial Sloan Kettering, UC Berkeley and other institutions are exploring how artificial intelligence and big data can be used to develop better treatments for diseases. But one of the biggest challenges for making full use of these...