• Home
  • Consulting
  • Presentations
  • About
  • Contact

Venu Anuganti

All About Data,Analytics and Databases

Category: BigData

  • November 10, 2014

Why solution matters than technology

5

Unless you completely understand your visitors/users/business by analyzing and forecasting what is most important and applicable, you can™t really launch an effective campaign to target right set of audience or[…]

BigData · Data Analytics · Data Architecture · Data Science · Data Warehouse · Insights

  • May 27, 2014

How to migrate to new (sharded) MongoDB cluster with zero downtime

8

When business data starts growing, single MongoDB cluster (replica set) can’t yield the expected throughput; and only option left is to start sharding the data. Recently we migrated two different[…]

BigData · Data Analytics · Database · MongoDB · NoSQL

  • April 22, 2014

Must Have Key Business Analytics & Insights – Product Analytics

3

This is the second post from series of blog posts related to ˜Must Have Key Business Analytics and Insights˜, and this post will look into covering in-depth about product analytics.[…]

BigData · Data Analytics · Data Architecture · Data Warehouse · Database · ETL · Hadoop

target analytics audience
  • April 19, 2014

Must Have Key Business Analytics and Insights – Part 1

2

This is the first post from series of blog posts related to ‘Must Have Key Business Analytics and Insights‘. Road to building good big data architecture In this post I[…]

BigData · Data Analytics · Data Architecture · Data Science · Data Warehouse · ETL · Hadoop · Predictive Analytics

  • April 5, 2014

MongoDB Map-reduce How To Avoid Global Locks

8

Two important features that drives analytics in MongoDB are: Aggregation Map-Reduce In general most of the aggregation framework does not require any global write lock, but Map-reduce needs global write[…]

BigData · Data Analytics · Database · MongoDB · NoSQL

Hive Architecture
  • July 16, 2013

Hadoop Summit 2013 – Hive Authorization

0

This is a series of articles in which I will present various takeaways from Hadoop Summit 2013. The first in this series , Hive authorization models by Thejas Nair of[…]

BigData · Cloudera · Data Warehouse · Hadoop · Hive · Hortonworks

  • June 26, 2013

Realtime Web Stats Using Node.js, Socket.IO and Redis

33

In today’s bigdata analytics world, it is very important and easy to get real-time stats exposed either to end users to have better user experience or to internal dashboards to[…]

BigData · Data Analytics · JavaScript · Node.js · Redis · Reporting · Socket.IO

HANA
  • May 2, 2013

Exploring SAP HANA – Powering Next Generation Analytics

27

SAP HANA , having entered the data 2.0/3.0 space at the right time, has been getting traction lately; and there will be lot of users like me who wants to[…]

BigData · Data Analytics · Data Warehouse · Database · ETL · Predictive Analytics · Reporting · SAP HANA

  • April 5, 2013

How To Extract Events From Splunk – For Analytics & Reporting

5

Splunk is a leading discovery platform used by majority of small-to-medium companies as operational and/or application discovery service. Last week; I was trying to get login stats exposed to BI[…]

BigData · Data Analytics · Data Warehouse · Database · ETL · Log processing · Reporting · Scalability · Splunk

  • December 10, 2012

Data Science vs. Data Analytics

65

As this topic came up a few times this week for discussion at various places, I thought of composing a post on Data Scientist vs. Data Analytics Engineer; even though[…]

BigData · Data Analytics · Data Science · Data Warehouse · Database · Hadoop · MySQL

  • December 2, 2012

Distributed Clustering Services

5

Apart from my consulting as part of ScaleIn, I also invest to bootstrap companies with really disruptive ideas; and in the process met few database specific companies who are already[…]

BigData · Database · Hadoop · MySQL · NoSQL · PostgreSQL

  • November 30, 2012

Typical “Big” Data Architecture

119

Here is the typical œBig data architecture, that covers most components involved in the data pipeline. More or less, we have the same architecture in production in number of places[…]

BigData · Cloudera · Data Analytics · Data Science · Data Warehouse · Database · ETL · Hadoop · Mapreduce · MySQL · NoSQL · PostgreSQL · Reporting · Scalability

  • Next Page »
  • Recent Posts

    • Why solution matters than technology
    • How to migrate to new (sharded) MongoDB cluster with zero downtime
    • Must Have Key Business Analytics & Insights – Product Analytics
    • Must Have Key Business Analytics and Insights – Part 1
    • MongoDB Map-reduce How To Avoid Global Locks
  • Categories

    • BigData (14)
    • Cloudera (4)
    • Data Analytics (11)
    • Data Architecture (3)
    • Data Science (5)
    • Data Warehouse (10)
    • Database (256)
    • ETL (5)
    • Hadoop (8)
      • Hive (1)
    • Hardware (4)
    • Hortonworks (1)
    • Insights (1)
    • Log processing (1)
    • Mapreduce (3)
    • Microsoft (1)
    • MongoDB (3)
    • MySQL (77)
    • Node.js (1)
      • JavaScript (1)
    • NoSQL (13)
      • Redis (1)
    • Performance (10)
    • PostgreSQL (3)
    • Predictive Analytics (2)
    • Reporting (4)
    • SAP HANA (1)
    • Scalability (14)
    • ScaleIN (1)
    • Socket.IO (1)
    • Splunk (1)
  • Recent Comments

    • @lucasoft_co_uk on Realtime Web Stats Using Node.js, Socket.IO and Redis
    • @PhoenixGyaan on Realtime Web Stats Using Node.js, Socket.IO and Redis
    • @wiwer77 on Realtime Web Stats Using Node.js, Socket.IO and Redis
    • @wiwer77 on Realtime Web Stats Using Node.js, Socket.IO and Redis
    • @BigDataBatman on Typical “Big” Data Architecture
  • Archives

  • Follow Venu Anuganti In TwitterFollow Venu Anuganti In Twitter
  • Connect With Venu Anuganti On LinkedInConnect With Venu Anuganti On LinkedIn
  • Subscribe To RSS FeedsSubscribe To RSS Feeds
  • Conact Venu AnugantiConact Venu Anuganti
  • Follow Venu Anuganti In SlideshareFollow Venu Anuganti In Slideshare

(C) 2022 venublog.com