16 May 2017

Analyse radio station broadcasts

Parquet columnar storage 只读取需要用到的数据

Databricks create an account => free 6Go RAM server

https://community.cloud.databricks.com/

Time series

wrap10.io

github.com/cityzendata

Versatile Data Model

Fusion value+time+geographique timestamp Timestamp jamais reguilier => BUCKETIZE: transform into regularly spaced ticks

Google market based model modeles à disposition

quantum IDE Pig par UDF warpscript Streaming: Apache Storm/ Flink

opensource

HBASE => fait pour series temp massive pas Cassandra

valeur manquante => boucher les trous avec la derniere valeur / interpolation

decouper les series en batch

OVH (client CityzenData)

HBase 是主流



blog comments powered by Disqus

Number of visits: - |