A distributed system for counting the number of occurrences of words in a given text file
-
Updated
Nov 30, 2016 - Java
A distributed system for counting the number of occurrences of words in a given text file
Big data project on Expedia data set provided in the Kaggle competition for Hotel Recommendations https://www.kaggle.com/c/expedia-hotel-recommendations
Crawl Clean and Learn is a basic data mining project in which data is crawled, cleaned and analyzed to get an insight into the data.
[Study] Daily plan for practice Hadoop.
Distributed Parallel Algorithm to predict the presence or absence of the Red-winged Blackbird in each birding session with an 83% accuracy
Hadoop map-reduce to derive some statistics from Yelp Dataset
A resource manager log analyzer using Hadoop MapReduce.
pagerank hadoop
Visual Studio solution with .Net implementations of classic parallel algorithms and a tool for their estimation
A hadoop project that is able to handle very large data sets and construct a red black tree. A script is available to automate iterative map reduce jobs.
Add a description, image, and links to the mapreduce topic page so that developers can more easily learn about it.
To associate your repository with the mapreduce topic, visit your repo's landing page and select "manage topics."