I have very large log(50T per day),
My log event as follows
url,visitid,requesttime
http://www.aaa.com?a=b&c=d&e=f, 1, 1463387380
http://www.aaa.com?a=b&c=d&e=fa, 1, 1463387280
http://www.aaa.com?a=b&c=d&e=fa, 2, 1463387280
http://www.aaa.com?a=b&c=d&e=fab, 2, 1463387280
http://www.aaa.com?a=b&c=d&e=f, 1, 1463387380
When a user enters a part of the url, and returns the
uv(UniqueVisitor) pv(PageView)。
for example
input: e=f*
output: uv=2,pv=5,
input: e=fa
output:uv=2,pv=3
How to design rowkey?
Thanks.
|