Skip to content

Big Data

Analytics And More
  • Home
  • Spark
  • Design Patterns
  • streaming
  • Map Reduce
  • Hive
  • Hdfs & Yarn
  • Pig
  • Oozie
  • Hbase

Category: Data Analytics

pig tutorial 12 – pig example to implement custom load function

July, 2017 adarsh

The Pig load/store API is aligned with Hadoop’s InputFormat and OutputFormat classes. This enables you to create new LoadFunc and…

Continue Reading →

Posted in: Data Analytics, Pig, pig latin Filed under: pig latin, pig script

pig tutorial 11 – pig example to implement custom filter functions

adarsh

Filter functions are eval functions that return a boolean value. Filter functions can be used anywhere a Boolean expression is…

Continue Reading →

Posted in: Data Analytics, Pig, pig latin Filed under: pig latin, pig script

pig tutorial 10 – pig aggregate function with algebraic interface and accumulator interface

adarsh

An aggregate function is an eval function that takes a bag and returns a scalar value. One interesting and useful…

Continue Reading →

Posted in: Data Analytics, Pig, pig latin Filed under: pig latin, pig script

pig tutorial 9 – pig example to implement custom eval function for foreach

adarsh

Eval is the most common type of function. It can be used in FOREACH statements.Lets take an example Input Data…

Continue Reading →

Posted in: Data Analytics, Pig, pig latin Filed under: pig latin, pig script

pig tutorial 8 – performance tuning and optimizing a pig script

adarsh

Below are the pointers which can be considered to optimize a pig script 1. If types are not specified in…

Continue Reading →

Posted in: Data Analytics, performance tuning, Pig, pig latin Filed under: pig latin, pig performance tuning, pig script

pig tutorial 7 – pig load and store functions with compression and shell, file and utility commands

adarsh

Load/Store functions determine how data goes into Pig and comes out of Pig. Pig provides a set of built-in load/store…

Continue Reading →

Posted in: Data Analytics, Pig, pig latin Filed under: pig latin, pig script

pig tutorial 6 – Eval Functions AVG, CONCAT, COUNT, COUNT_STAR, DIFF, IsEmpty, MAX, MIN, SIZE, SUM and TOKENIZE

adarsh 1 Comment

AVG Use the AVG function to compute the average of the numeric values in a single-column bag. AVG requires a…

Continue Reading →

Posted in: Data Analytics, Pig, pig latin Filed under: pig latin, pig script

Post navigation

Page 19 of 26
← Previous 1 … 18 19 20 … 26 Next →

Recent Posts

  • Optimization for Using AWS Lambda to Send Messages to Amazon MSK
  • Rebalancing a Kafka Cluster in AWS MSK using CLI Commands
  • Using StsAssumeRoleCredentialsProvider with Glue Schema Registry Integration in Kafka Producer
  • Home
  • Contact Me
  • About Me
Copyright © 2017 Time Pass Techies