monotonically_increasing_id

monotonically_increasing_id() – Returns monotonically increasing 64-bit integers. The generated ID is guaranteed to be monotonically increasing and unique, but not consecutive. The current implementation puts the partition ID in the upper 31 bits, and the lower 33 bits represent the record number within each partition. The assumption is that the data frame has less than 1 billion partitions, and each partition has less than 8 billion records. The function is non-deterministic because its result depends on partition IDs.

Platforms: WhereOS, Spark, Hive

Class: org.apache.spark.sql.catalyst.expressions.MonotonicallyIncreasingID

More functions can be added to WhereOS via Python or R bindings or as Java & Scala UDF (user-defined function), UDAF (user-defined aggregation function) and UDTF (user-defined table generating function) extensions. Custom libraries can be added on via Settings-page or installed from WhereOS Store.

View the complete guide of WhereOS functions.

No Comments

TAGS : hive spark sql

monotonically_increasing_id

Related Post

tile

vector_dot

rand

Leave a Comment Cancel reply

Articles by Category

Recent Posts

How WhereOS and JCDecaux

Data fusion: how integra

percentile_approx

Experimentation In Produ

Open Data Utilization

WhereOS

Navigation

Contact Us