Whether it’s recent news or just new to you, The Data Planet serves up fascinating insights and resources ABOUT the Data analytics and BI WORLD EVERY MONTH.
Our snack-size summaries skip straight to the point.
This edition of the Data Planet includes:
- Data-driven Software: Towards the Future of Programming in Data Science
- Get the Scoop on New Snowflake Features Released this Summer
- Welcome to Snowpark: New Data Programmability for the Data Cloud
- Software Spotlight: Elastic stack (ELK)
Data-driven Software: Towards the Future of Programming in Data Science
Organizations increasingly lean on data science to inform business decisions, but changes in the pipeline often trigger problems. Data-driven software (DDS) is an attempt to apply source control concepts over data. This article, though long and esoteric, shows you how to treat data as if it were code using DDS.
“We believe it is time to break down the barrier between code and data, making any piece of data instantly accessible as if it was a normal function call,” the author says. This is good food for thought regarding data architecture.
Learn all about data-driven software and see examples of code
Get the Scoop on New Snowflake Features Released this Summer
Learn the details of the many announcements made at the Snowflake Summit this year, including the release of enhancements around data programmability, data sharing, and global governance.
Snowflake also announced the ability to connect to the database via RestAPIs. This is a major convenience for developers so that they don’t have to worry about drivers or custom libraries. Snowflake is also cleaning up their admin interface and marking some of their programmability extensions as public preview. This is a good overview if you want a summary of the changes.
Read about the newest Snowflake features, like Snowpark and Java UDFs
Welcome to Snowpark: New Data Programmability for the Data Cloud
This article takes a closer look at the Snowflake announcements pertaining to the new data programmability features that make Snowflake more extensible and friendly to traditional application developers. Get an overview of Snowpark and Java functions, and find links to their documentation within the article, as well.
Discover what Snowpark is about and access related documentation
See the step-by-step Getting Started Snowpark lab guide
Software Worth Sharing: Elastic Stack (ELK)
The Elastic Stack is a search engine. It lets you aggregate data from any source to search, analyze, and visualize it (close to) real time. It consists of three parts that form the letters of the ELK acronym:
- Elasticsearch: Distributed search and analytics engine
- Logstash: Data collection and ingestion
- Kibana: Graphs and visualization
Elasticsearch is the dominant open-source application for search applications. AWS has had a hosted version for ages, and Azure recently announced their hosted version. Traditionally the ELK stack has been used for IT monitoring. Logstash pulls server log files, Elasticsearch indexes them, and then Kibana can be used to show real-time stats on performance or outages.
Log analytics are not the only use case. Many websites use Elasticsearch to power search or filtering functionality. Elastic customers include big names like eBay, Verizon, Netflix, and Salesforce. These articles give you a comprehensive look at ELK.
Get an overview of Elastic stack