Get invited to our slack community and get access to opportunities and data science insights


sessionize(long timeInSec, long thresholdInSec [, String subject])- Returns a UUID string of a session.

SELECT
sessionize(time, 3600, ip_addr) as session_id,
time, ip_addr
FROM (
SELECT time, ipaddr
FROM weblog
DISTRIBUTE BY ip_addr, time SORT BY ip_addr, time DESC
) t1

Platforms: WhereOS, Spark, Hive
Class: hivemall.tools.datetime.SessionizeUDF

More functions can be added to WhereOS via Python or R bindings or as Java & Scala UDF (user-defined function), UDAF (user-defined aggregation function) and UDTF (user-defined table generating function) extensions. Custom libraries can be added on via Settings-page or installed from WhereOS Store.

Related Post

Leave a Comment