2024-10-26 –, CLASS #1 - 4A
Language: English
This session will showcase how to enable BI and ML on a modern two-tier data architecture for a business continuity plan, improve real-time analysis for a financial service application, create a centralized BI dashboard for organizational performance forecasting, implement an automated ETL process for cross-functional collaboration, and share experiences in creating a data-intelligent service layer for rapid development.
Audience Learning Outcomes:
- Enable BI and ML on a modern two-tier data architecture to support a business continuity plan, including a live demonstration. (with Demo)
- Improve real-time analysis for a financial service application, including a live demonstration. (with Demo)
- Create a centralized BI dashboard for organizational performance forecasting, including a live demonstration. (with Demo)
- Create an automated ETL workflow for cross-functional collaboration, including a live demonstration. (with Demo)
- Create a data-intelligent service layer for rapid development through experience sharing. (experience sharing)
Stories:
- A financial service company with 10+ applications for customers, including bonds, ETFs/mutual funds, annuities, cryptocurrencies, collectibles (NFT), precious metals, and alternative assets.
- Each customer (or individual investor) has at least 5+ wallets on different applications, and customer data is stored on different applications.
Challenges:
- Adopting new database technology and data architecture to create data intelligence to fulfill the business continuity plan.
- Transforming financial processes and ensuring cross-functional collaboration between different applications.
- Establishing a single source of truth for financial reporting that is GAAP (Generally Accepted Accounting Principles) compliant and includes business logic and accounting logic checks.
Problems:
- Lack of centralized data storage for low-cost, easy maintenance.
- Absence of a unified data format for rapid development.
- Inability to handle streaming data (velocity) for real-time dashboards.
- Lack of transactional financial reports for customers.
- Insufficient operational analysis and forecasting for business intelligence.
Solutions:
Build a brand-new data-intelligent service layer with a modern data architecture to start the business continuity plan.
- Create a modern data architecture and central data storage strategy using dbt Core and Databricks.
- Implement a data transformation workflow from different applications to deliver high-quality, trusted, and reliable data using dbt Core.
- Develop real-time analytics, ML, and applications using Databricks Data Streaming.
- Ensure ACID transactions for customer reports using Delta Lake.
- Leverage Business Intelligence (BI) and Machine Learning (ML) for operational analysis using the Databricks Lakehouse.
Use Cases:
- Pricing recommendations: Calculating optimal ticket prices.
- Conversion optimization: Uncovering opportunities in the customer journey.
- Marketing: Identifying channels leading to purchases.
- Predictive analytics: Forecasting.
- Fraud detection: Reducing fraud.
- Customer experience: Enhancing fan experience.
Architectures:
- dbt Core (data orchestration, data catalog, SQL Python-based transformations, reusable code with macros).
- Databricks Delta Lake (ACID transactions on Spark, scalable metadata, streaming and batch processing).
- Databricks Lakehouse (scale of data lakes).
Results:
- Simplified data architecture reduces reliance on data engineers.
- Improved data velocity for real-time analysis.
- Confident deployment through an analytics engineering workflow.
- Consistent metrics among different applications for data consistency.
- Automated ETL process for data ingestion and transformation.
- Easy debugging and observability through automated alerts and pipeline visibility.
AWS Community Builder
MongoDB Community Creator
Financial service (FSI)