Jason Edelman's Blog
  • Home
  • About
  • Contact

A Quickstart for Building Networks for Hadoop Clusters

12/21/2012

1 Comment

 
I was trying to prepare for a meeting a few weeks back on the topic of Hadoop; its impact on the network was supposed to be a large part of the conversation.  Needless to say, it wasn’t, but during the preparation, I was looking for material specific to building and designing networks specific for Hadoop environments.

Since there aren’t that many articles out there on the subject, I figured I’d share what I was able to find.
Understanding Hadoop Clusters and the Network – by Brad Hedlund

Architecting a Network for Hadoop – by Stuart Miniman

Considering 10GE Hadoop clusters and the network – by Brad Hedlund

Big Data in the Enterprise: Network Design Considerations – by Cisco

Dell Force10 Hadoop Network – The Scale out Solution – by Dell (looks like more of Brad’s work)

Network Design Considerations for Hadoop ‘Big Data Clusters’ and the Hadoop File System – by Arista

Arista – Hadoop Cluster Applications – by Arista (couldn’t find this one again because I first found this several months ago, so it is attached right below).
aristahadoopapplication_tn.pdf
File Size: 910 kb
File Type: pdf
Download File

Cloudera Hadoop Infrastructure with Cisco UCS Servers and Cisco Nexus Switches Powered by Intel Xeon Processors

For those other SDN aficionados out there, you may like this next one (combines optical switching, SDN, and big data):  
          Programming Your Network at Run-time for Big Data Applications  by IBM T.J Watson Research Center and Rice University.  Sounds like Plexxi would be a good fit here. 

This is not meant to be an exhaustive list of all resources out there, but just a starting point.  I highly recommend reading Brad’s first and then even reading some general Hadoop whitepapers that aren’t focused on the network to really understand the application itself.  As you can see from the documents above, the vendors still recommend their same switches for Hadoop environments, but it is important to be knowledgeable as a network person when interfacing with the application folks.  You should always try and be relevant and do your best to understand the applications that ride over the network.  That’s why we are building the networks anyway.

For those stepping into a meeting in the next hour and need a few high level bullet points, here you go:

  • Understand amount of servers, amount per rack, and how many NICs per server are to be used
  • Understand overall growth
  • It’s common to deploy TOR switches – either 1 or 2 per rack based on server density and importance of the cluster to the business (just like in other parts of the DC)
  • Hadoop has no dependencies on Layer 2 connectivity  – use layer 3 whenever possible
  • Small deployments (just a few racks) can take advantage of L3 in the aggregation layer
  • Larger deployments will have L3 down to the TOR switch
  • 1GE is most common in today’s Hadoop environments, but 10GE is gaining traction
  • Hadoop environments are largely all bare metal – no virtualization 
  • Build out to be non-blocking whenever possible; large amounts of data will be shuffled across the network should there be a “rack” failure, e.g. single TOR switch fails
  • Take into consideration the traffic is bursty – use switches with optimized buffers 

Regards,
Jason

Follow me on Twitter: @jedelman8


1 Comment
PC Repairs Alabama link
10/23/2022 04:24:46 am

Hi greeat reading your post

Reply



Leave a Reply.

    Author

    Jason Edelman, Founder & CTO of Network to Code. 


    Enter your email address:

    Delivered by FeedBurner


    RSS Feed


    Categories

    All
    1cloudroad
    2011
    2960
    40gbe
    7000
    Arista
    Aruba
    Big Switch
    Brocade
    Capwap
    Christmas
    Cisco
    Controller
    Data Center
    Dell Force10
    Embrane
    Extreme
    Fex
    Hadoop
    Hp
    Ibm
    Isr G2
    Juniper
    Limited Lifetime Warranty
    Meraki
    Multicast
    N7k
    Nexus
    Nicira
    Ons
    Opendaylight
    Openflow
    Openstack
    Presidio
    Qsfp
    Quick Facts
    Routeflow
    Sdn
    Sdn Ecosystem
    Security
    Ucs


    Archives

    May 2015
    April 2015
    February 2015
    January 2015
    December 2014
    November 2014
    October 2014
    September 2014
    August 2014
    June 2014
    May 2014
    March 2014
    February 2014
    January 2014
    December 2013
    November 2013
    October 2013
    September 2013
    August 2013
    July 2013
    June 2013
    May 2013
    April 2013
    March 2013
    February 2013
    January 2013
    December 2012
    November 2012
    October 2012
    June 2012
    May 2012
    April 2012
    March 2012
    February 2012
    January 2012
    December 2011
    November 2011


    View my profile on LinkedIn
Photo used under Creative Commons from NASA Goddard Photo and Video