Strategy and Build up

Consulting, building algorithms, defining data processes

 
  • Understanding business case
  • Data assessment
  • Capacity planning to meet the end-goals
  • Strategies to improve the time to market
  • Understanding data processes work and define data storage practices
  • Data integration with the current solutions
Integration Management

Infrastructure configuration, Monitoring and Maintenance

 
  • Configuring Hadoop clusters
  • Big Data Application Integration Services
  • Integration with existing Enterprise Data Warehouse and Data Sources
  • Building Real time data pipeline
  • Migration from Relational DB to NoSQL
Data Processing

Batch Data processing / Real Time data processing

 
  • Collect and process gigantic heaps of data that often prove too complex to handle
  • Trend Analysis, Pattern Identification, Payroll and Billing Systems, Weather Forecasting
  • Real-time data processing by deploying the collected business data to drive insights
  • Stock Market Analytics, Real-time taxi booking
  • Serverless ETL Process Definition
  • Make scaling easier by processing high-velocity and high-volume transactions and events more efficiently and in a faster time frame
  • Orchestration using Airflow
AI/Machine Learning

Analyse Hybrid Data, Foresee and tackle setbacks.

 
  • Machine Learning/Data Mining
  • Data Modelling
  • Predictive and Prescriptive Analytics
  • Analytics Optimization
Data Analysis

Data Presentation, Provide Actionable Insights

 
  • Analytics, Dash-boarding & Alerting
  • Informed decision making by leveraging BI
  • Intelligent and personalized insights about gaps and opportunities for improvement in processes

Technology Stack

Data Engineering

  • Spark
  • HDFS
  • HIVE
  • HBASE
  • Presto
  • Scribe
  • Pig
  • Mapreduce
  • Sqoop
  • Flume
  • Talend
  • Informatica
  • Pentaho
  • Airflow
  • Apache Kafka

Data Warehouses

  • Impala
  • Hp Vertica
  • Aws Redshift
  • Azure SQL DW

Data Modelling Tools

  • Erwin,
  • Erstudio
  • Omnigraffle
  • Idef1x
  • Ddl
  • Dml
  • Uml

Data Analytics

  • Tableau
  • Power BI
  • Spotfire
  • SSRS
  • Microstrategy
  • Looker
  • Filebeat
  • Grafana
  • AWS Quicksight
  • Azure Stream Analytics
  • Data Lake Analytics
  • Azure Analysis Services

Monitoring Tools

  • Icinga
  • Nagios
  • Ganglia
  • Graphite
  • Prometheus

NoSQL DB

  • Mongodb
  • Cassandra

Development Languages

  • Python
  • Scala
  • Java
  • R

Log Management

  • Splunk
  • Logstash