• Home
  • Consulting
  • Presentations
  • About
  • Contact Me!

Venu Anuganti Blog

All About Data, Databases, Data Architecture, Data Analytics, SQL, NoSQL and Big Data

  • Follow Venu Anuganti In TwitterFollow Venu Anuganti In Twitter
  • Connect With Venu Anuganti On LinkedInConnect With Venu Anuganti On LinkedIn
  • Subscribe To RSS FeedsSubscribe To RSS Feeds
  • Conact Venu AnugantiConact Venu Anuganti
  • Follow Venu Anuganti In SlideshareFollow Venu Anuganti In Slideshare

Category: Mapreduce

bigdata-scalein-architecture
  • November 30, 2012

Typical “Big” Data Architecture

119

Here is the typical “Big” data architecture, that covers most components involved in the data pipeline. More or less, we have the same architecture in production in number of places[…]

BigData · Cloudera · Data Analytics · Data Science · Data Warehouse · Database · ETL · Hadoop · Mapreduce · MySQL · NoSQL · PostgreSQL · Reporting · Scalability

elephant_rgb_sq
  • October 27, 2012

Hadoop NameNode – How A Space Can Ruin Your Life

2

I was setting up a new test cluster other day using the latest development branch (1.0.4 tag) of hadoop, to test the new patch which extends the balancer code to add[…]

BigData · Cloudera · Hadoop · Mapreduce

  • July 19, 2010

MapReduce – DBInputFormat – Serialization on readers

8

Last week I was working on EC2 MySQL server where one of the slave is taking lot of time to catch-up; and only job that is running on that server[…]

BigData · Cloudera · Data Analytics · Data Science · Database · Hadoop · Mapreduce · MySQL · NoSQL · Scalability

  • ScaleIN Consulting

    ScaleIN
  • Recent Posts

    • Why solution matters than technology
    • How to migrate to new (sharded) MongoDB cluster with zero downtime
    • Must Have Key Business Analytics & Insights – Product Analytics
    • Must Have Key Business Analytics and Insights – Part 1
    • MongoDB Map-reduce How To Avoid Global Locks
  • Categories

    • BigData (14)
    • Cloudera (4)
    • Data Analytics (11)
    • Data Architecture (3)
    • Data Science (5)
    • Data Warehouse (10)
    • Database (256)
    • ETL (5)
    • Hadoop (8)
      • Hive (1)
    • Hardware (4)
    • Hortonworks (1)
    • Insights (1)
    • Log processing (1)
    • Mapreduce (3)
    • Microsoft (1)
    • MongoDB (3)
    • MySQL (77)
    • Node.js (1)
      • JavaScript (1)
    • NoSQL (13)
      • Redis (1)
    • Performance (10)
    • PostgreSQL (3)
    • Predictive Analytics (2)
    • Reporting (4)
    • SAP HANA (1)
    • Scalability (14)
    • ScaleIN (1)
    • Socket.IO (1)
    • Splunk (1)
  • Recent Comments

    • @wiwer77 on Realtime Web Stats Using Node.js, Socket.IO and Redis
    • @BigDataBatman on Typical “Big” Data Architecture
    • @jackverr54 on Typical “Big” Data Architecture
    • @AdnaneMahmoudi on Realtime Web Stats Using Node.js, Socket.IO and Redis
    • @BigDataTweetBot on Realtime Web Stats Using Node.js, Socket.IO and Redis
  • Archives

  • Database Consulting

    • SQL Consulting
    • NoSQL Consulting
    • Database Services
  • Big Data, Data Analytics

    • Big Data Consulting
    • Data Analytics, Warehouse
    • Data Architecture
  • About

    • About
      • Presentations
    • Contact Me!
  • Follow Venu Anuganti In TwitterFollow Venu Anuganti In Twitter
  • Connect With Venu Anuganti On LinkedInConnect With Venu Anuganti On LinkedIn
  • Subscribe To RSS FeedsSubscribe To RSS Feeds
  • Conact Venu AnugantiConact Venu Anuganti
  • Follow Venu Anuganti In SlideshareFollow Venu Anuganti In Slideshare

© Copyright 2005-2013 http://venublog.com/