Apple's Siri uses Apache HBase to complete full ring replication around the world in 10 seconds
Siri is a virtual assistant that is part of Apple's iOS, iPadOS, watchOS, macOS, and tvOS operating systems. The assistant uses voice queries and a natural-language user interface to answer questions, make recommendations, and perform actions by delegating requests to a set of Internet services.
Siri Analytics Scale
Provide an accurate and reliable references
Data consumers should be able to iterate fast
Easy to share the new data
Raw data is cleaned, joined, and transformed into one standardized data model for data consumers to query on
Large amounts of request, Data Centers all over the world
Hadoop / YARN Cluster with thousands of nodes
HDFS has hundred of PB
100's TB of raw event data per day
Processing with Spark, Pig, and MapReduce
Apple's Siri uses Apache HBase to complete full ring replication around the world in 10 seconds