Skip to content

WilliamAntonRohm/hdinsight-docs

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HDInsight Developer's Guide

This guide is intended to provide a curated set of documentation useful to any developer, data scientist or big data engineer getting started or growing their experience with Azure HDInsight.

The delivery goal of this guide is to package this online format into the format of a digital book.

The table of contents follows, links to new content will open in the same window remaining in GitHub, while links to existing content that will soon be merged with this repo will open the Azure Docs.

Overview

Azure HDInsight and Hadoop Architecture

Configuring the Cluster

Configuring Identity and Access Controls

Monitoring and managing the HDInsight cluster

Developing Hive applications

Hive samples

Developing Spark applications

Use Spark with notebooks

Use Spark with IntelliJ

Spark samples

Developing Spark ML applications

Deep Learning with Spark

Developing R scripts on HDInsight

Developing Spark Streaming applications

Optimizing Spark Performance

Use HBase

Use Phoenix with HBase on HDInsight

Apache Open Source Ecosystem

Advanced Scenarios and Deep Dives

Troubleshooting

About

HDInsight Developer's Guide

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 98.7%
  • Other 1.3%