Many companies dump data into S3 thinking, "we'll figure out how to use it later". They often never figure it out. It too complicated to query S3 data directly (e.g. Spark) and it's too expensive to pipe it to a data warehouse (e.g. Snowflake).
In the vast, sprawling library that is your S3 data lake, volumes of information sit gathering dust on the shelves. Each book holds valuable insights, but without a card catalog or a knowledgeable librarian, finding the right nugget of data is like searching for a needle in a haystack. You wander the aisles aimlessly, pulling out books at random, hoping to stumble upon the answers you seek.
You need an AI librarian. One that will organize your library and guide you to insights. Enter Definite: the missing card catalog for your S3 data library.
SELECT passenger_count, trip_distance, total_amount
FROM read_parquet('s3://your-bucket/taxi_trips/2023-06.parquet')
WHERE company = 'Uber'
ORDER BY trip_distance DESC
LIMIT 10;
cubes:
taxi_trips:
measures:
trip_count:
type: count
average_distance:
type: average
sql: trip_distance
total_revenue:
type: sum
sql: total_amount
dimensions:
company:
type: string
day_of_week:
type: string
passenger_count:
type: number
This code defines a taxi_trips cube with measures like trip count, average distance, and total revenue, along with dimensions like company, day of the week, and passenger count. This cube becomes a versatile tool for analyzing various aspects of your taxi trip data.
Fi interprets your questions, interacts with the defined cubes, and presents the answers in visualizations and tables.
By eliminating technical barriers and providing an intuitive, AI-powered platform, Definite empowers everyone to become data-driven. From data analysts to business users, anyone can now explore, analyze, and extract valuable insights from their S3 data lakes.
So if you're ready to transform your S3 data lake from a cluttered, disorganized mess into a well-indexed, easily navigable library, Definite is here for you. With Definite, the answers you seek are always just a question away.
Get the new standard in analytics. Sign up below or get in touch and we’ll set you up in under 30 minutes.