<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title>Schema - Tag - 300.Watts</title><link>https://300watts.me/tags/schema/</link><description>Schema - Tag - 300.Watts</description><generator>Hugo -- gohugo.io</generator><language>en</language><managingEditor>morristai01@gmail.com (Morris)</managingEditor><webMaster>morristai01@gmail.com (Morris)</webMaster><copyright>This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.</copyright><lastBuildDate>Wed, 26 Jan 2022 10:04:55 +0800</lastBuildDate><atom:link href="https://300watts.me/tags/schema/" rel="self" type="application/rss+xml"/><item><title>Make Kafka Schema Easier</title><link>https://300watts.me/posts/make-kafka-schema-easier/</link><pubDate>Wed, 26 Jan 2022 10:04:55 +0800</pubDate><author><name>Morris</name></author><guid>https://300watts.me/posts/make-kafka-schema-easier/</guid><description><![CDATA[<p>TL;DR: You can find the script on my GitHub repository: <a href="https://github.com/morristai/kschema-table" target="_blank" rel="noopener noreferrer">kschema-table</a></p>
<h2 id="coordinating-schema-with-data-scientist" class="headerLink">
    <a href="#coordinating-schema-with-data-scientist" class="header-mark"></a>Coordinating schema with Data Scientist</h2><p>As a data engineer, you&rsquo;ll need to collaborate closely with data scientists/domain experts to design data schemas. The optimal schema will depend heavily on the business domain and how the product is used. For instance, in a cybersecurity context, threat experts and data scientists would likely be the ones designing the data schema. They may work with infrastructure and data teams to define these schemas in a common format like, for example: Avro IDL (Avro Interface Description Language).</p>]]></description></item></channel></rss>