Skip to content

Big Data

Analytics And More
  • Home
  • Spark
  • Design Patterns
  • streaming
  • Map Reduce
  • Hive
  • Hdfs & Yarn
  • Pig
  • Oozie
  • Hbase

Category: Pig

oozie workflow example for pig action with end to end configuration

August, 2017 adarsh

Oozie’s Pig action runs a Pig job in Hadoop.Pig action requires you to bundle the Pig script with all the…

Continue Reading →

Posted in: Data Analytics, Oozie, Pig Filed under: oozie workflow, pig latin

pig tutorial 13 – pig example to implement custom store function

July, 2017 adarsh

The Pig load/store API is aligned with Hadoop’s InputFormat and OutputFormat classes. This enables you to create new LoadFunc and…

Continue Reading →

Posted in: Data Analytics, Pig, pig latin Filed under: pig latin, pig script

pig tutorial 12 – pig example to implement custom load function

adarsh

The Pig load/store API is aligned with Hadoop’s InputFormat and OutputFormat classes. This enables you to create new LoadFunc and…

Continue Reading →

Posted in: Data Analytics, Pig, pig latin Filed under: pig latin, pig script

pig tutorial 11 – pig example to implement custom filter functions

adarsh

Filter functions are eval functions that return a boolean value. Filter functions can be used anywhere a Boolean expression is…

Continue Reading →

Posted in: Data Analytics, Pig, pig latin Filed under: pig latin, pig script

pig tutorial 10 – pig aggregate function with algebraic interface and accumulator interface

adarsh

An aggregate function is an eval function that takes a bag and returns a scalar value. One interesting and useful…

Continue Reading →

Posted in: Data Analytics, Pig, pig latin Filed under: pig latin, pig script

pig tutorial 9 – pig example to implement custom eval function for foreach

adarsh

Eval is the most common type of function. It can be used in FOREACH statements.Lets take an example Input Data…

Continue Reading →

Posted in: Data Analytics, Pig, pig latin Filed under: pig latin, pig script

pig tutorial 8 – performance tuning and optimizing a pig script

adarsh

Below are the pointers which can be considered to optimize a pig script 1. If types are not specified in…

Continue Reading →

Posted in: Data Analytics, performance tuning, Pig, pig latin Filed under: pig latin, pig performance tuning, pig script

Post navigation

Page 1 of 2
1 2 Next →

Recent Posts

  • Optimization for Using AWS Lambda to Send Messages to Amazon MSK
  • Rebalancing a Kafka Cluster in AWS MSK using CLI Commands
  • Using StsAssumeRoleCredentialsProvider with Glue Schema Registry Integration in Kafka Producer
  • Home
  • Contact Me
  • About Me
Copyright © 2017 Time Pass Techies