-
free-programming-books Public
Forked from EbookFoundation/free-programming-books📚 Freely available programming books
Other UpdatedMar 25, 2017 -
Machine-Learning-with-Python Public
Forked from PabloGalan/Machine-Learning-with-PythonJupyter Notebook UpdatedMar 20, 2017 -
hadoopecosystemtable.github.io Public
Forked from hadoopecosystemtable/hadoopecosystemtable.github.ioThis page is a summary to keep the track of Hadoop related projects, and relevant projects around Big Data scene focused on the open source, free software enviroment.
HTML Apache License 2.0 UpdatedNov 2, 2016 -
JustEnoughScalaForSpark Public
Forked from deanwampler/JustEnoughScalaForSparkA tutorial on the most important features and idioms of Scala they you need to use Spark's Scala APIs.
Apache License 2.0 UpdatedOct 27, 2016 -
-
high-performance-spark-examples Public
Forked from high-performance-spark/high-performance-spark-examplesExamples for High Performance Spark
Scala Other UpdatedMay 25, 2016 -
DataSciencePython Public
Forked from ujjwalkarn/DataSciencePythoncommon data analysis and machine learning tasks using python
Python MIT License UpdatedMay 23, 2016 -
utad-spark-ml Public
Forked from chicochica10/utad-spark-mlcurso de machine learning con spark para la UTAD.
Jupyter Notebook UpdatedMar 12, 2016 -
Hive-JSON-Serde Public
Forked from rcongiu/Hive-JSON-SerdeRead - Write JSON SerDe for Apache Hive.
Java Other UpdatedFeb 21, 2016 -
-
kerberos_and_hadoop Public
Forked from steveloughran/kerberos_and_hadoopKerberos and Hadoop: The Madness beyond the Gate
Apache License 2.0 UpdatedNov 17, 2015 -
project-interoperability.github.io Public
Forked from Project-Interoperability/project-interoperability.github.ioProject Interoperability: A Start-Up Guide to Info Sharing
CSS Creative Commons Zero v1.0 Universal UpdatedAug 11, 2015 -
theforeman.org Public
Forked from theforeman/theforeman.orgThe new and improved Foreman website.
HTML Other UpdatedJul 16, 2015 -
spark-exercises Public
Forked from ceteri/spark-exercisesCoding exercises for Apache Spark
Python Other UpdatedJun 4, 2015 -
clean-hadoop-tmp Public
Forked from nmilford/clean-hadoop-tmpCleans up data older than N seconds in /tmp on HDFS.
Ruby UpdatedApr 21, 2015 -
bigtop Public
Forked from apache/bigtopMirror of Apache Bigtop
Shell Apache License 2.0 UpdatedApr 17, 2015 -
tutorial-dplyr-es Public
Forked from fdelaunay/tutorial-dplyr-esTutorial en espñol para aprender a usar dplyr (R) pero tambien github y como organisar datos (tidy)
R GNU General Public License v2.0 UpdatedApr 17, 2015 -
sequenceiq-samples Public
Forked from sequenceiq/sequenceiq-samplesSequenceIQ Hadoop examples
Java Apache License 2.0 UpdatedApr 16, 2015 -
simplesparkavroapp Public
Forked from sryza/simplesparkavroappSimple Spark app that reads and writes Avro data
Scala UpdatedApr 13, 2015 -
-
tarjetasblack Public
A data only R package containing the movements of the unfamous spanish "black" credit cards
R GNU General Public License v2.0 UpdatedMar 15, 2015 -
wirbelsturm Public
Forked from miguno/wirbelsturmWirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data related infrastructure.
Shell Other UpdatedMar 10, 2015 -
kafka-storm-starter Public
Forked from miguno/kafka-storm-starterCode examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Scala Other UpdatedJan 20, 2015 -
simplesparkapp Public
Forked from sryza/simplesparkappSimple Spark Application
Java Apache License 2.0 UpdatedAug 12, 2014 -
benchmark Public
Forked from amplab/benchmarkLarge scale query engine benchmark
Python UpdatedJul 17, 2014 -
hive-testbench Public
Forked from cartershanklin/hive-testbenchTestbench for experimenting with Apache Hive at any data scale.
Java UpdatedJul 16, 2014 -
-
HiBench Public
Forked from Intel-bigdata/HiBenchHiBench is a Hadoop benchmark suite.
Java Other UpdatedJun 11, 2014 -
bdr-action Public
Forked from jholoman/bdr-actionJava Util for automated BDR schedules
Java UpdatedJun 8, 2014 -
Beetest Public
Forked from kawaa/BeetestA super simple utility for testing Apache Hive scripts locally for non-Java developers.
Java UpdatedJun 6, 2014