<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Diving Into Data]]></title><description><![CDATA[A monthly data career coach in your inbox diving into hands-on, actionable tips for data professionals; built on experience in tech companies, consulting and finance. ]]></description><link>https://www.divingintodata.com</link><image><url>https://substackcdn.com/image/fetch/$s_!3vK2!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc056d8c6-46bb-43da-abda-172f7767ccf4_1280x1280.png</url><title>Diving Into Data</title><link>https://www.divingintodata.com</link></image><generator>Substack</generator><lastBuildDate>Mon, 18 May 2026 21:14:27 GMT</lastBuildDate><atom:link href="https://www.divingintodata.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Tessa Xie]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[tessaxie@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[tessaxie@substack.com]]></itunes:email><itunes:name><![CDATA[Tessa Xie]]></itunes:name></itunes:owner><itunes:author><![CDATA[Tessa Xie]]></itunes:author><googleplay:owner><![CDATA[tessaxie@substack.com]]></googleplay:owner><googleplay:email><![CDATA[tessaxie@substack.com]]></googleplay:email><googleplay:author><![CDATA[Tessa Xie]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[How to Prioritize]]></title><description><![CDATA[To work smarter (not harder), you need to know how to size and prioritize]]></description><link>https://www.divingintodata.com/p/how-to-prioritize</link><guid isPermaLink="false">https://www.divingintodata.com/p/how-to-prioritize</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Sun, 24 Aug 2025 12:02:28 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!lMbf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17c03015-4467-485d-95f8-c1be34606477_1217x1220.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>I once sat on a committee to decide on manager promotions. When debating between two candidates (let&#8217;s call them <em>A</em> and <em>B</em>), the managers&#8217; unanimous opinion was that while A and B were both excellent at leading projects to the finish line, A would be a better team leader because she could &#8220;<em>better <strong>judge the ROI</strong> of all work streams and help herself and the team <strong>prioritize</strong></em>".</p><p>Being able to prioritize is undoubtedly one of the most important soft skills for managers and ICs alike. Working in a fast-paced company means you never just have one task on hand; even the most junior member on my team has multiple things they are working on at any given time. At the same time, there&#8217;s always pressure to deliver quickly since leadership and stakeholders want everything yesterday.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.divingintodata.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Diving Into Data! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>So knowing how to stack rank all the things on your plate in terms of priority is extremely important at all levels of the team. And if you want to grow into a more senior role or become a manager, it becomes absolutely essential.</p><p>However, based on my experience leading data science teams, this skill is not part of any formal training and it takes most people a long time to learn it through trial and error. I regularly see people working hard and delivering high-quality outputs and still get the feedback from cross-functional stakeholders that they are not doing things &#8220;fast enough&#8221; or &#8220;driving enough impact&#8221;.</p><p>When I sit down with them to look through their backlog of things, it&#8217;s clear that they have been prioritizing the wrong things.</p><p>You might think "<em>Prioritization is easy, shouldn&#8217;t I just work on the things my manager/stakeholders asked me to work on?</em>&#8221;. While that should definitely be part of your consideration, you will soon realize that most stakeholders have a pretty low bar for &#8220;asking for analysis&#8221; and your manager will not be able to stay on top of everything on everyone&#8217;s plate on the team. So it&#8217;s eventually part of your job to decide whether something should be prioritized.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lMbf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17c03015-4467-485d-95f8-c1be34606477_1217x1220.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lMbf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17c03015-4467-485d-95f8-c1be34606477_1217x1220.png 424w, https://substackcdn.com/image/fetch/$s_!lMbf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17c03015-4467-485d-95f8-c1be34606477_1217x1220.png 848w, https://substackcdn.com/image/fetch/$s_!lMbf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17c03015-4467-485d-95f8-c1be34606477_1217x1220.png 1272w, https://substackcdn.com/image/fetch/$s_!lMbf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17c03015-4467-485d-95f8-c1be34606477_1217x1220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lMbf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17c03015-4467-485d-95f8-c1be34606477_1217x1220.png" width="1217" height="1220" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/17c03015-4467-485d-95f8-c1be34606477_1217x1220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1220,&quot;width&quot;:1217,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:332928,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!lMbf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17c03015-4467-485d-95f8-c1be34606477_1217x1220.png 424w, https://substackcdn.com/image/fetch/$s_!lMbf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17c03015-4467-485d-95f8-c1be34606477_1217x1220.png 848w, https://substackcdn.com/image/fetch/$s_!lMbf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17c03015-4467-485d-95f8-c1be34606477_1217x1220.png 1272w, https://substackcdn.com/image/fetch/$s_!lMbf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17c03015-4467-485d-95f8-c1be34606477_1217x1220.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Prioritization is a daily mental exercise :) </figcaption></figure></div><p>The key to this is to understand which initiative drives the most impact. And &#8220;impact&#8221; is different from &#8220;output&#8221; or &#8220;input&#8221;:</p><ul><li><p>I noticed that the most junior DS on the team talk about &#8220;inputs&#8221; &#8211; &#8220;<em>I spent X hours working on these analyses</em>&#8221;</p></li><li><p>More senior DS learn to uplevel from &#8220;inputs&#8221; to &#8220;outputs&#8221;; they&#8217;ll say <em>&#8220;I helped with 5 experiments and built two dashboards</em>&#8221;</p></li><li><p>But only a handful are able to clearly articulate the business<strong> impact </strong>they drove; they&#8217;ll focus on the money they helped the company save, or the growth in important metrics they helped drive</p></li></ul><p>Even though it&#8217;s rarely an explicitly stated skill requirement, knowing how to prioritize is nevertheless a &#8220;hidden&#8221; requirement for anyone&#8217;s career progress because it&#8217;s a pre-requisite for driving impact. Working hard and delivering high-quality work is important, but it only matters if it's on the right projects.</p><p>In this article, I will convince you why being able to judge priorities is crucial across all roles (DS, PM, engineering, etc.), break down the mental prioritization framework you can immediately put to use, and walk you through a hands-on example of how to apply it.</p><h2>To achieve any goal, prioritization is key; and to prioritize, you need to know the ROI</h2><p>If somebody gave your team a big goal (say, growing the monthly active member base from 10M to 50M), would you know:</p><ol><li><p><strong>Whether</strong> it&#8217;s a realistic goal you can hit?</p></li><li><p><strong>How</strong> can you hit the goal, i.e. what do you need to do exactly?</p></li></ol><p>If you answered &#8220;<em>No</em>&#8221;, you&#8217;re not alone; nobody can answer these questions off the bat. The reason is that businesses are complex; you&#8217;ll need many different initiatives to hit an ambitious goal like that, and you cannot assume that what worked in the past will get you to the next level as well.</p><p>In other words, you need to break this big, abstract goal into smaller, concrete work streams and then come up with ideas that will help you move the needle on your target metrics. And then, once you have a list of things you <em>could</em> do, you need to figure out which ones you <em>should</em> actually prioritize. You don&#8217;t have unlimited resources to do everything (especially not in a single quarter), so you need to select the initiatives with the biggest potential.</p><p>That&#8217;s where ROI estimation comes in. ROI stands for &#8220;return on investment&#8221;:</p><ul><li><p><strong>Investment:</strong> How much of your limited resources you put in (usually time or money)</p></li><li><p><strong>Return: </strong>How much business impact you (expect to) get as a result</p></li></ul><p>In other words, you&#8217;re looking for the biggest &#8220;bang for your buck&#8221;.</p><p>Estimations like this are essential during planning cycles to make sure teams are working on the most impactful things. Here is what that would look like in an ideal state:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6Vif!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf37d03b-427d-45d4-a390-24444e00bcb4_1557x1574.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6Vif!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf37d03b-427d-45d4-a390-24444e00bcb4_1557x1574.png 424w, https://substackcdn.com/image/fetch/$s_!6Vif!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf37d03b-427d-45d4-a390-24444e00bcb4_1557x1574.png 848w, https://substackcdn.com/image/fetch/$s_!6Vif!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf37d03b-427d-45d4-a390-24444e00bcb4_1557x1574.png 1272w, https://substackcdn.com/image/fetch/$s_!6Vif!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf37d03b-427d-45d4-a390-24444e00bcb4_1557x1574.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6Vif!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf37d03b-427d-45d4-a390-24444e00bcb4_1557x1574.png" width="1456" height="1472" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bf37d03b-427d-45d4-a390-24444e00bcb4_1557x1574.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1472,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6Vif!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf37d03b-427d-45d4-a390-24444e00bcb4_1557x1574.png 424w, https://substackcdn.com/image/fetch/$s_!6Vif!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf37d03b-427d-45d4-a390-24444e00bcb4_1557x1574.png 848w, https://substackcdn.com/image/fetch/$s_!6Vif!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf37d03b-427d-45d4-a390-24444e00bcb4_1557x1574.png 1272w, https://substackcdn.com/image/fetch/$s_!6Vif!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf37d03b-427d-45d4-a390-24444e00bcb4_1557x1574.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The planning process of my dreams&#8230; </figcaption></figure></div><ol><li><p>Leadership decides on top-down goals for the company (e.g. grow revenue by $XM)</p></li><li><p>The goal is distributed across the different organizations by assigning team-level goals (e.g. product team 1 needs to contribute 20% of the goal, product team 2 30% etc.)</p></li><li><p>Teams brainstorm initiatives that can help them hit their goal</p></li><li><p>Teams estimate the impact of each initiative they came up with and stack-rank them; the most promising ones become the roadmap for the quarter, and the rest goes into the backlog</p></li><li><p>[Optional] Based on steps 1 - 4 above, teams should have an idea of whether they&#8217;ll be able to hit their goal if they successfully executive their roadmap. If they won&#8217;t, they either need to come up with more (or better) initiatives, or push back on the goal they&#8217;ve been given</p></li></ol><p>Besides helping teams prioritize, the impact estimates also give you goals for each initiative that you can track against throughout the quarter. Then, after the quarter is over, you can do a retrospective and compare actuals against the plan. If any initiatives missed their goal due to overly optimistic or pessimistic estimates rather than execution issues, you can use those insights to refine your estimates during future planning exercises.</p><p>Unfortunately, this process doesn&#8217;t always get the attention it deserves. Some companies and teams run very structured planning processes and loop in the data science team (or business operations) early to help with these calculations, but other teams rely on experience to come up with these numbers, or simply &#8220;make an educated guess&#8221; when they run out of time.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NfLM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb40fcd6e-dae3-4c8e-8ed2-b68554383fea_1600x1558.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NfLM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb40fcd6e-dae3-4c8e-8ed2-b68554383fea_1600x1558.png 424w, https://substackcdn.com/image/fetch/$s_!NfLM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb40fcd6e-dae3-4c8e-8ed2-b68554383fea_1600x1558.png 848w, https://substackcdn.com/image/fetch/$s_!NfLM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb40fcd6e-dae3-4c8e-8ed2-b68554383fea_1600x1558.png 1272w, https://substackcdn.com/image/fetch/$s_!NfLM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb40fcd6e-dae3-4c8e-8ed2-b68554383fea_1600x1558.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NfLM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb40fcd6e-dae3-4c8e-8ed2-b68554383fea_1600x1558.png" width="1456" height="1418" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b40fcd6e-dae3-4c8e-8ed2-b68554383fea_1600x1558.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1418,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!NfLM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb40fcd6e-dae3-4c8e-8ed2-b68554383fea_1600x1558.png 424w, https://substackcdn.com/image/fetch/$s_!NfLM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb40fcd6e-dae3-4c8e-8ed2-b68554383fea_1600x1558.png 848w, https://substackcdn.com/image/fetch/$s_!NfLM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb40fcd6e-dae3-4c8e-8ed2-b68554383fea_1600x1558.png 1272w, https://substackcdn.com/image/fetch/$s_!NfLM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb40fcd6e-dae3-4c8e-8ed2-b68554383fea_1600x1558.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Guesses are okey, but only &#8220;educated&#8220; ones please ...</figcaption></figure></div><p>If you notice a gap like this, it&#8217;s a great opportunity for you to step up and offer to help your stakeholders to level up the process. And don&#8217;t worry if you don&#8217;t have experience in this regard: This guide will give you the toolkit you need to do that.</p><p>In addition to planning, impact estimation is also helpful day-to-day when you are trying to decide 1) whether you should take on a new project or 2) which project on your plate you should focus on. This is important not just for managers that are trying to figure this out for their teams; even as an IC you should constantly make sure you&#8217;re working on high-impact things.</p><p>Like it or not, when it comes to promotions, the most important thing is whether you helped drive business impact. It doesn&#8217;t matter how hard you work, if the project goes nowhere or doesn&#8217;t move the needle, it&#8217;s not going to be helpful for your promo packet.</p><p>That&#8217;s not to say that you need to do an elaborate estimation for every single project you work on; once you&#8217;re familiar with the technique and have practiced it a bit, you&#8217;ll start doing it subconsciously and will quickly develop a gut feeling about a project&#8217;s potential. However, learning the &#8220;proper&#8221; way of calculating ROI can help you build the foundation for this mental model and refine it.</p><h2>While the &#8220;I&#8221; can be estimated from experience, the &#8220;R&#8221; needs to be deduced based on first principles</h2><p>So how to estimate ROI? There are two components in ROI, <strong>R</strong>eturn and <strong>I</strong>nvestment.</p><p>I won&#8217;t spend too much time on the investment piece in this article because it&#8217;s less generalizable. If a project is similar to work you&#8217;ve done in the past, you will likely have a pretty good idea of how much effort it&#8217;ll be. And if it&#8217;s something new, your estimate will likely be very off anyways, so don&#8217;t spend too much time on it. According to <a href="https://en.wikipedia.org/wiki/Hofstadter%27s_law">Hofstadter&#8217;s Law</a>, things always take longer than expected, so whatever your initial estimated effort is, double it and call it a day.</p><p>The piece that is much more important to get right &#8212; but that a lot of people struggle to estimate &#8212; is the &#8220;return&#8221; component. Luckily, this is the part that you can actually estimate somewhat accurately by taking a first-principles approach, and the framework is generalizable for different domains.</p><h3>Step 1. Define the &#8220;return&#8221;</h3><p>When estimating the impact, you first need to decide which metric you want to anchor the impact on. For example, for a planned feature that is supposed to drive user engagement, should you estimate the % increase of total time spent on site or the amount of additional revenue?</p><p>This is important because it facilitates a &#8220;common currency&#8221; and enables an apples-to-apples comparison. Without anchoring to key metrics, you might have one initiative sized to drive &#8220;<em>15% incremental time spent on site</em>&#8221; and another sized to &#8220;<em>increase revenue by 5%</em>&#8221;. How do you know which one is more impactful?</p><p>Besides making sure you&#8217;re using <em>the same impact metric</em> for all estimates, it&#8217;s also important that you&#8217;re choosing one that teams think and care about (e.g. because they&#8217;ve set goals against it); otherwise, it&#8217;s hard to convey the impact because you won&#8217;t get their attention.</p><p>Which metric you ultimately choose depends highly on what your company cares about. In most companies, there are a handful of metrics that everyone rallies around, be it a north star metric or a set of topline KPIs around growth and revenue or profitability. In addition, each org may have a couple of key metrics it&#8217;s responsible for driving, and staying close to the planning process will help you understand what those are.</p><p>To have a<strong> tangible example</strong> for the rest of this post, let&#8217;s say your company ultimately wants to know the impact of any initiative in terms of <strong>revenue.</strong></p><h3>Step 2. Have clarity about which lever you are moving</h3><p>Once you decide what impact metric you are doing the sizing on, the problem becomes much clearer. You&#8217;re basically asking:</p><p>&#8220;<em>How much can I lift X (your &#8220;return&#8221; or impact metric) by improving Y (the lever that can be improved by the initiative you have in mind) by Z%?</em>&#8221;</p><p>You already know your <em>X </em>(the ultimate business impact you&#8217;re sizing; in our example, that&#8217;s <em><strong>revenue</strong></em>) from the previous section, so the next step is to figure out <em>Y </em>(the immediate improvement your initiative can drive)<em>.</em> This means you have to translate the <em>qualitative </em>thing you plan to do to the immediate <em>quantitative impact</em> you expect.</p><p>To continue our example, let&#8217;s say you are a DS working with the feed product team at a social media company. And one of the initiatives the team has in mind is to improve the feed ranking algorithm so that users see more relevant content and ultimately scroll more and consume more content units.</p><p>In this case, our Y (the quantitative impact from our initiative) would be &#8220;<em><strong># content consumed / visitor&#8221; </strong></em>.</p><p>Then it comes to the difficult part: Translating this improvement into the ultimate business impact (i.e. revenue increase) you care about. This requires establishing a lineage between the metrics, and the best way to do that is through building <a href="https://www.divingintodata.com/p/how-to-answer-business-questions">a driver tree</a>.</p><p>Which brings us to the next essential step.</p><h3>Step 3. Understand other levers that affect your target metric</h3><p>Building out the driver tree has two key advantages:</p><ol><li><p>Establishing a lineage between your initiative impact Y and your target metric X (as mentioned above)</p></li><li><p>Providing a comprehensive list of levers for X</p></li></ol><p>At this point you might ask, why is the second piece important if I already know which lever I want to move?</p><p>The reason is that there are typically a lot of ways (levers) to move your target metric, and any given initiative will only affect a subset of them. It&#8217;s essential to have visibility into all the other moving pieces, because by asking &#8220;<em>How much can this initiative improve X</em>?&#8221;, we are really asking &#8220;<em>How much can this improve X, <strong>holding everything else equal</strong>?</em>&#8221;.</p><p>Much like when running a regression, you need to control for confounding factors to establish the relationship between X and Y.</p><p>I have covered driver trees extensively in my <a href="https://www.divingintodata.com/p/how-to-answer-business-questions">previous articles</a>, so I will not repeat everything here, but the general idea is to break down the target metric into drivers that are MECE (Mutually Exclusive and Collectively Exhaustive). What this means is best explained by applying this principle to our example.</p><p>As a recap, your ultimate goal is to increase revenue. The company likely has more than one revenue stream; but as a social media company, any improvement on feed will likely mostly impact the revenue stream coming from ads. This means we can focus on ads revenue as the top of the driver tree.</p><p>Let&#8217;s also assume ads revenue is <em>click-based</em> (i.e. advertisers are charged, and you make money, per click). We can then decompose this mathematically:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-5D5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8531cac0-4796-499d-9a7f-0f2b777769ba_1484x569.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-5D5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8531cac0-4796-499d-9a7f-0f2b777769ba_1484x569.png 424w, https://substackcdn.com/image/fetch/$s_!-5D5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8531cac0-4796-499d-9a7f-0f2b777769ba_1484x569.png 848w, https://substackcdn.com/image/fetch/$s_!-5D5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8531cac0-4796-499d-9a7f-0f2b777769ba_1484x569.png 1272w, https://substackcdn.com/image/fetch/$s_!-5D5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8531cac0-4796-499d-9a7f-0f2b777769ba_1484x569.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-5D5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8531cac0-4796-499d-9a7f-0f2b777769ba_1484x569.png" width="1456" height="558" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8531cac0-4796-499d-9a7f-0f2b777769ba_1484x569.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:558,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-5D5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8531cac0-4796-499d-9a7f-0f2b777769ba_1484x569.png 424w, https://substackcdn.com/image/fetch/$s_!-5D5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8531cac0-4796-499d-9a7f-0f2b777769ba_1484x569.png 848w, https://substackcdn.com/image/fetch/$s_!-5D5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8531cac0-4796-499d-9a7f-0f2b777769ba_1484x569.png 1272w, https://substackcdn.com/image/fetch/$s_!-5D5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8531cac0-4796-499d-9a7f-0f2b777769ba_1484x569.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This breakdown is <strong>MECE</strong> because 1) Any change in ad revenue is explained by a change in one of these three drivers (i.e. &#8220;collectively exhaustive&#8221;), and 2) every driver represents a distinct component with no overlap (i.e. &#8220;mutually exclusive&#8221;): Volume, conversion and price.</p><p>Then, &#8220;<em># ad impressions</em>&#8221; can be further broken down:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tdG8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a7a6472-f680-4ff3-b2f6-4993353ec51c_1600x1238.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tdG8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a7a6472-f680-4ff3-b2f6-4993353ec51c_1600x1238.png 424w, https://substackcdn.com/image/fetch/$s_!tdG8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a7a6472-f680-4ff3-b2f6-4993353ec51c_1600x1238.png 848w, https://substackcdn.com/image/fetch/$s_!tdG8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a7a6472-f680-4ff3-b2f6-4993353ec51c_1600x1238.png 1272w, https://substackcdn.com/image/fetch/$s_!tdG8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a7a6472-f680-4ff3-b2f6-4993353ec51c_1600x1238.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tdG8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a7a6472-f680-4ff3-b2f6-4993353ec51c_1600x1238.png" width="1456" height="1127" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7a7a6472-f680-4ff3-b2f6-4993353ec51c_1600x1238.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1127,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!tdG8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a7a6472-f680-4ff3-b2f6-4993353ec51c_1600x1238.png 424w, https://substackcdn.com/image/fetch/$s_!tdG8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a7a6472-f680-4ff3-b2f6-4993353ec51c_1600x1238.png 848w, https://substackcdn.com/image/fetch/$s_!tdG8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a7a6472-f680-4ff3-b2f6-4993353ec51c_1600x1238.png 1272w, https://substackcdn.com/image/fetch/$s_!tdG8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a7a6472-f680-4ff3-b2f6-4993353ec51c_1600x1238.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>You need to keep on building out the tree (by breaking down the components into more granular subcomponents like shown above) until the metric you plan to move through your initiative (Y) shows up as a branch in the tree; then you can figure out the quantitative relationship by tracing the path all the way up to your target metric X.</p><p>And all the other branches on the tree are the additional levers you need to control for in your sizing.</p><h3>Step 4. Use the lineage to build the sizing and make implicit assumptions explicit</h3><p>The previous steps were about <em>understanding</em> the relationships between what you can control and the impact metric you are ultimately trying to size. Now, it&#8217;s time to start creating the actual structure of your sizing calculation.</p><p>This step should be pretty easy once you&#8217;ve done all of the prep work in the previous steps; you have all the pieces and you just need to put them together. The only tricky part here is thinking through how to set this up in a spreadsheet. Ideally you want to make this sizing not only a hacky one-off effort that lands in the trash after, but rather something that can be leveraged in the future for similar initiatives that aim to drive the same impact metric.</p><p>One tip for enabling this scalability is to separate the <em><strong>structure</strong></em> of the calculation from the <em><strong>assumptions</strong>.</em> In other words, list out the relevant variables from your driver tree and link them, and then have a separate section where you list out all of your assumptions like the expected lift from your initiative (instead of hardcoding them into the formulas).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wx6L!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3418ad69-5648-4065-96f0-ee4631b2bf53_1572x1450.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wx6L!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3418ad69-5648-4065-96f0-ee4631b2bf53_1572x1450.png 424w, https://substackcdn.com/image/fetch/$s_!wx6L!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3418ad69-5648-4065-96f0-ee4631b2bf53_1572x1450.png 848w, https://substackcdn.com/image/fetch/$s_!wx6L!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3418ad69-5648-4065-96f0-ee4631b2bf53_1572x1450.png 1272w, https://substackcdn.com/image/fetch/$s_!wx6L!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3418ad69-5648-4065-96f0-ee4631b2bf53_1572x1450.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wx6L!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3418ad69-5648-4065-96f0-ee4631b2bf53_1572x1450.png" width="1456" height="1343" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3418ad69-5648-4065-96f0-ee4631b2bf53_1572x1450.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1343,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wx6L!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3418ad69-5648-4065-96f0-ee4631b2bf53_1572x1450.png 424w, https://substackcdn.com/image/fetch/$s_!wx6L!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3418ad69-5648-4065-96f0-ee4631b2bf53_1572x1450.png 848w, https://substackcdn.com/image/fetch/$s_!wx6L!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3418ad69-5648-4065-96f0-ee4631b2bf53_1572x1450.png 1272w, https://substackcdn.com/image/fetch/$s_!wx6L!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3418ad69-5648-4065-96f0-ee4631b2bf53_1572x1450.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">A sizing sheet that conveys high work quality </figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6Kfo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f2e1a18-474b-4640-981b-ab2db77d6398_1430x1396.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6Kfo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f2e1a18-474b-4640-981b-ab2db77d6398_1430x1396.png 424w, https://substackcdn.com/image/fetch/$s_!6Kfo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f2e1a18-474b-4640-981b-ab2db77d6398_1430x1396.png 848w, https://substackcdn.com/image/fetch/$s_!6Kfo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f2e1a18-474b-4640-981b-ab2db77d6398_1430x1396.png 1272w, https://substackcdn.com/image/fetch/$s_!6Kfo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f2e1a18-474b-4640-981b-ab2db77d6398_1430x1396.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6Kfo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f2e1a18-474b-4640-981b-ab2db77d6398_1430x1396.png" width="1430" height="1396" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2f2e1a18-474b-4640-981b-ab2db77d6398_1430x1396.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1396,&quot;width&quot;:1430,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6Kfo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f2e1a18-474b-4640-981b-ab2db77d6398_1430x1396.png 424w, https://substackcdn.com/image/fetch/$s_!6Kfo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f2e1a18-474b-4640-981b-ab2db77d6398_1430x1396.png 848w, https://substackcdn.com/image/fetch/$s_!6Kfo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f2e1a18-474b-4640-981b-ab2db77d6398_1430x1396.png 1272w, https://substackcdn.com/image/fetch/$s_!6Kfo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f2e1a18-474b-4640-981b-ab2db77d6398_1430x1396.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">And one that says &#8220;I&#8217;m only doing the bare minimum&#8221;</figcaption></figure></div><p>This has a few key advantages besides making the sizing re-usable:</p><ul><li><p><strong>Transparency:</strong> It&#8217;s easy for others to see in one spot what assumptions you&#8217;re making. If anyone questions the end result, you can quickly figure out which exact assumption they disagree with</p></li><li><p><strong>Scenario modeling:</strong> You can easily tweak each assumption to see how the outcome changes. For example, you could figure out what the upper and lower bounds of possible impact are by plugging in the most ambitious or conservative assumptions for each driver</p></li><li><p><strong>Diagnosing underperformance:</strong> If the initiative gets launched and performance is not meeting the projected impact, it&#8217;s easier to root cause <em>why</em> you&#8217;re missing the target, i.e. which specific assumption was off. This then allows you to refine your projections in the future</p></li></ul><h3>Step 5. Plug numbers into your sizing</h3><p>Now that you have all the levers listed out, how do you know what values to plug in?</p><p>To start, you need baseline numbers for all of your metrics; this can be something like the average value over the last three months, and you can usually pull it from a dashboard or directly from a table in the data warehouse.</p><p>Then, you need to apply the <strong>anticipated % improvement</strong> to the lever you plan to move (the &#8220;Z&#8221; from our equation under Step 2). I.e. in our example, we need to plug in the <em>% increase in feed content consumption per user </em>that we expect from our ranking change.</p><p>You can estimate this anticipated lift by benchmarking against similar past initiatives. It&#8217;s likely not the first time the team has made improvements to the ranker to improve relevance, so you can reference those experiments or launches to get a rough idea of what to expect this time.</p><p>You&#8217;ll often find that there are trends you can forecast out; for example, you might find that the team has tackled all of the low-hanging fruit already, and the most recent improvements were mostly smaller fine-tuning changes with diminishing returns. Taking these trends into account will get you a more accurate range for where the impact of your upcoming initiative might land.</p><h3>Step 6. Consider and model out any ripple effect of this change</h3><p>Until this step, we have been working off of one big assumption: <strong>our initiative will move certain levers and the rest of them will be constant</strong>. This is essentially assuming that there&#8217;s no interaction effect between different levers. In reality though, when one lever is moved, there&#8217;s almost always ripple effects on other ones, and it&#8217;s important to anticipate these.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!t-YB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe587c687-4967-4247-994b-03fd20d90ca8_1600x1289.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!t-YB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe587c687-4967-4247-994b-03fd20d90ca8_1600x1289.png 424w, https://substackcdn.com/image/fetch/$s_!t-YB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe587c687-4967-4247-994b-03fd20d90ca8_1600x1289.png 848w, https://substackcdn.com/image/fetch/$s_!t-YB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe587c687-4967-4247-994b-03fd20d90ca8_1600x1289.png 1272w, https://substackcdn.com/image/fetch/$s_!t-YB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe587c687-4967-4247-994b-03fd20d90ca8_1600x1289.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!t-YB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe587c687-4967-4247-994b-03fd20d90ca8_1600x1289.png" width="1456" height="1173" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e587c687-4967-4247-994b-03fd20d90ca8_1600x1289.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1173,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!t-YB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe587c687-4967-4247-994b-03fd20d90ca8_1600x1289.png 424w, https://substackcdn.com/image/fetch/$s_!t-YB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe587c687-4967-4247-994b-03fd20d90ca8_1600x1289.png 848w, https://substackcdn.com/image/fetch/$s_!t-YB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe587c687-4967-4247-994b-03fd20d90ca8_1600x1289.png 1272w, https://substackcdn.com/image/fetch/$s_!t-YB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe587c687-4967-4247-994b-03fd20d90ca8_1600x1289.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">You can never anticipate all the ripple effects</figcaption></figure></div><p>For the sizing to be as accurate as possible, it&#8217;s crucial to model out the major interactions. In our example, when we increase relevance of feed content, members will consume more content when they are on site. And in turn they will consume more ads (assuming the <em><strong>ads share of contents </strong></em>is kept the same).</p><p>It would be naive to assume, however, that members will keep the same level of click-through-rate on ads when they are shown more of them.</p><p>In addition, if the ranking change is expected to affect some user groups more than others (e.g. because it&#8217;s leveraging a signal that is more relevant for certain users), those users will start accounting for a larger share of content and ad impressions, and your average cost per click might change as a result (since not all users are equally valuable to advertisers).</p><p>Again, past experiments will be the best data source to estimate this impact. Once you have enough data, you can start developing simple rules of thumb such as &#8220;<em>A 1% increase in the number of ads shown per session results in a X% decrease in CTR</em>&#8221;. This will make it easier to do these sizing exercises in the future since you won&#8217;t have to look at raw experiment data every time.</p><p>Lastly, these ripple effects should be part of the &#8220;<em>Assumptions</em>&#8221; section of the sizing spreadsheet so it&#8217;s easy to tweak them and model out different scenarios when needed.</p><p><strong>Remember, sizing is an art as much as it&#8217;s a science.</strong></p><p>Of course you can always make the sizing more accurate; but at the end of the day, it&#8217;s important to remember that nobody has a crystal ball that predicts the future. The sizing is just an attempt to estimate the <em>order of magnitude</em> of what you expect to happen, so you need to be comfortable with &#8220;good enoughs&#8221; in this process.</p><p>Remember, while &#8220;good enough&#8221; is not perfect, it&#8217;s better than flying blind and having no estimation at all.</p><p>Making all the assumptions explicit and tweak-able is enabling us to iterate on the sizing when we have learnings in the process (e.g. in the online experiment, we might notice that we have under-estimated some ripple effect) so our modeling can get closer to reality as we iterate on it with new information.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.divingintodata.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Diving Into Data! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[How to Be Taken Seriously]]></title><description><![CDATA[4 common mistakes of junior data scientists and how to fix them]]></description><link>https://www.divingintodata.com/p/how-to-be-taken-seriously</link><guid isPermaLink="false">https://www.divingintodata.com/p/how-to-be-taken-seriously</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Wed, 07 May 2025 09:30:56 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F108d6c4d-5ce5-477d-9ef0-f0b1a68405fd_1368x1600.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>I once overheard an elevator conversation between two managers. One of them was talking about a direct report:</p><blockquote><p>&#8220;He&#8217;s not ready to be promoted. Every time I ask him a question, I get this long-winded answer that&#8217;s loaded with unnecessary details&#8230; &#8221;</p></blockquote><p>This manager was saying out loud what a lot of managers and executives are thinking (but rarely give as direct feedback). Too much detail doesn&#8217;t make you seem more capable &#8212; on the contrary, it makes you look junior.</p><p>As a manager, I have led teams and coached a lot of bright DS in my career. Part of a manager&#8217;s job is to determine, among a team of bright people, who to pull in for what work.</p><p>Managers can usually quickly form a &#8220;gut feeling&#8221; about who they can bring in to lead cross-functional work streams independently versus who can execute but isn&#8217;t ready for leadership yet &#8212; whether formal (e.g. as a tech lead) or informal (leading a project).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ev6f!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c503d1-6c1c-4c27-addc-018b390d5b68_1600x1600.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ev6f!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c503d1-6c1c-4c27-addc-018b390d5b68_1600x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ev6f!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c503d1-6c1c-4c27-addc-018b390d5b68_1600x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ev6f!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c503d1-6c1c-4c27-addc-018b390d5b68_1600x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ev6f!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c503d1-6c1c-4c27-addc-018b390d5b68_1600x1600.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ev6f!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c503d1-6c1c-4c27-addc-018b390d5b68_1600x1600.jpeg" width="1456" height="1456" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/27c503d1-6c1c-4c27-addc-018b390d5b68_1600x1600.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ev6f!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c503d1-6c1c-4c27-addc-018b390d5b68_1600x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ev6f!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c503d1-6c1c-4c27-addc-018b390d5b68_1600x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ev6f!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c503d1-6c1c-4c27-addc-018b390d5b68_1600x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ev6f!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c503d1-6c1c-4c27-addc-018b390d5b68_1600x1600.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>So what are the key distinguishers between these two groups of individuals? After talking to a lot of managers, I&#8217;ve noticed there is a consensus:</p><p><strong>It&#8217;s usually not the ability to do technical work (surprising, I know), but rather certain behavioral/communication patterns that make people perceive you as a lead, take you seriously and pay attention to what you have to say.</strong></p><p>In this article, I will break down some of the key things that will make you appear junior, convince you why it&#8217;s important to improve on them and teach you how to do exactly that.</p><h1>Four behaviors that make you seem junior &#8212; and what to do instead</h1><h2><strong>Junior trait #1: Providing too much details/over-explaining</strong></h2><p>This is innate to analytics folks since the field requires us to pay attention to details. As a result, a lot of junior DS think providing a lot of details will showcase that they have considered a lot of the edge cases and give the audience confidence that they have a lot of knowledge about the field.</p><p>In reality, though, too much detail will often confuse your audience and make you look like you don&#8217;t know how to synthesize information. As a result, you will lose people in your communication and cause them to miss your key points since they are buried in all the details (think the VP who starts to scroll on his phone when you get into the nitty gritty of how you pulled the data in SQL).</p><p>I have literally had stakeholders telling me &#8220;<em>Don&#8217;t put this person in front of execs because they will confuse everyone</em>&#8221;.</p><p>This might sound harsh, but everyone who has ever been in a large corporate meeting knows how inefficient they are <strong>even when things are going well.</strong><em> </em>That means as a manager you cannot afford to add additional friction and confusion by having a team member lead the discussion who is not ready.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vGX8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F108d6c4d-5ce5-477d-9ef0-f0b1a68405fd_1368x1600.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vGX8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F108d6c4d-5ce5-477d-9ef0-f0b1a68405fd_1368x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!vGX8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F108d6c4d-5ce5-477d-9ef0-f0b1a68405fd_1368x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!vGX8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F108d6c4d-5ce5-477d-9ef0-f0b1a68405fd_1368x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!vGX8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F108d6c4d-5ce5-477d-9ef0-f0b1a68405fd_1368x1600.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vGX8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F108d6c4d-5ce5-477d-9ef0-f0b1a68405fd_1368x1600.jpeg" width="1368" height="1600" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/108d6c4d-5ce5-477d-9ef0-f0b1a68405fd_1368x1600.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1600,&quot;width&quot;:1368,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!vGX8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F108d6c4d-5ce5-477d-9ef0-f0b1a68405fd_1368x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!vGX8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F108d6c4d-5ce5-477d-9ef0-f0b1a68405fd_1368x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!vGX8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F108d6c4d-5ce5-477d-9ef0-f0b1a68405fd_1368x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!vGX8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F108d6c4d-5ce5-477d-9ef0-f0b1a68405fd_1368x1600.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Don&#8217;t get me wrong, it is crucial to consider all the edge cases and pay attention to details during your analysis. <strong>But that doesn&#8217;t mean all those details and edge cases are worthy of being communicated to others</strong> when you present your work (unless they specifically ask about them).</p><h3><em><strong>What to do about it</strong></em></h3><p>This should be practiced in both written and verbal form.</p><p><strong>In written form</strong>, try to summarize your work in a <em>TL;DR</em> (a brief blurb highlighting the most important takeaways) that you put at the top of your document or email.</p><p>Ideally write your TL;DR using the <a href="https://www.divingintodata.com/p/analytics-frameworks-every-data-scientist">Pyramid Principle</a>: Start with the conclusion related to the business question you want to answer, and then add key supporting evidence as needed.</p><p><strong>Remember, being able to decide what NOT to communicate is as important as deciding what to mention.</strong></p><p>For everything you want to communicate, ask yourself if it&#8217;s &#8220;important enough&#8221;. If not, put it in the appendix in case someone asks but remove it from the summary.</p><p><strong>For example</strong>, imagine that during your analysis you discover a data issue that applies to 0.1% of the member base. Ask yourself: &#8220;<em>Does this data error fundamentally change the conclusion of my analysis and/or my recommendation?</em>&#8221;</p><p>The answer is most likely &#8220;<em>No</em>&#8221;, so only bring it up when asked.</p><p><strong>In verbal form,</strong> practice talking about your project with peers who are not deep in the weeds of the work with you to see if your summary confuses them. And when crafting this verbal summary in your head, imagine you are doing an &#8220;elevator pitch&#8221; to the CEO &#8211; giving him/her a good enough description of your work in less than 30 seconds that makes them deeply understand the &#8220;why&#8221; of your work and gives just enough context on the &#8220;what&#8221; so that they can form an opinion.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.divingintodata.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Diving Into Data! Subscribe for free to receive regular DS career advice in your inbox.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2><strong>Junior trait #2: Not having an opinion or recommendation </strong></h2><p>When asked &#8220;Based on your analysis, what would be your recommendation?&#8221;, one of my former team members loved using the phrase &#8220;I have no dog in this fight&#8221;. This might not seem like a big deal, but it&#8217;s actually an example of one of the most common career-limiting traits of data scientists.</p><blockquote><p>The more senior you become as a data scientist (and the more AI replaces the &#8220;execution&#8221; part of the job), the more important it becomes to translate your analysis into a recommendation for your stakeholders.</p></blockquote><p>But it&#8217;s not surprising that junior DS struggle with this part. Forming an opinion or recommendation takes more than just pulling the data; it requires digesting and understanding the data and connecting it to the needs of the business. And let&#8217;s face it, when you&#8217;re deep in the weeds of a 500-line query or a complex data pipeline, it&#8217;s easy to lose sight of the bigger picture. So a lot of people do a great job executing the data pull and then leave it to their audience to make sense of it.</p><p>The other reason a lot of data scientists or analysts are hesitant to give a clear recommendation is the perceived risk involved. Hedging is deeply wired into our risk-averse analytical brains.</p><p>This makes sense, since data is nuanced and nothing is ever completely black-and-white. But this can backfire when you&#8217;re working in a fast-moving environment like a startup and becoming a blocker because you&#8217;re drowning your audience &#8212; who are trying to make a decision &#8212; in caveats.</p><p><strong>Every decision</strong> comes with risks, and <strong>every recommendation</strong> has caveats. But putting too much emphasis on them results in analysis paralysis &#8212; gathering more and more data and doing more and more analysis, hoping the decision will make itself clear in the process.</p><p><strong>Ultimately, not giving recommendations shows a lack of ownership of the problem and lack of the ability or willingness to make sense of the data.</strong></p><p>And you&#8217;ll be limited to simple &#8220;execution&#8221; work as a result.</p><h3><em><strong>What to do about it</strong></em></h3><p>First and foremost, adopt an ownership mindset.</p><p>Most of us are able to make decisions and have strong opinions when it comes to things we care about. For example, you were able to decide where to travel last time you planned a trip, right?</p><p>You probably gathered information about weather, flight tickets, reviews etc. to decide between a couple of options. And despite all that complex information, you landed on a clear decision in the end. Why would business decisions be different?</p><p><strong>Imagine that you&#8217;re the decision-maker and have to make this call</strong>:</p><ul><li><p>What data would you need?</p></li><li><p>And once you pull those data points, ask yourself: Would you convinced by the data presented?</p></li><li><p>If not, what else can possibly convince you?</p></li></ul><p>Just don&#8217;t fall into the trap of thinking that gathering more and more data is the solution when you&#8217;re stuck. In a lot of cases, what we actually need is to digest and make sense of the data we already have. 1,000 different data cuts will only confuse people and take them further away from reaching a decision since each cut is telling a slightly different story and it&#8217;s difficult to make sense of them all.</p><p>And if the fear of risk is what&#8217;s stopping you from having a recommendation, remember this: The only way to generate above-average returns in financial markets is by taking risks; risk-free strategies only yield the low risk-free rate of return. The same applies at work; if the decision were crystal clear with no uncertainty or risk involved, nobody would need you to make a recommendation. The value you add comes from giving robust recommendations <em>despite</em> nuance and ambiguity.</p><p>Don&#8217;t get me wrong, you should definitely list out the caveats of your recommendation. But trust me: Most people prefer that you have a recommendation that they don&#8217;t agree with instead of having no recommendation at all. Because if you do have a recommendation and list out how you arrived at it, they can dig in to see which of your assumptions and decisions they agree with and which ones they don&#8217;t.</p><blockquote><p><strong>At the end of the day, data teams are not paid to only pull and present data; we are paid to help drive business decisions with data.</strong></p></blockquote><p>So if you always leave the mental burden of digesting data and making the decision to your audience or others on your team, you limit yourself to being a junior analyst instead of a true thought partner.</p><h2><strong>Junior trait #3: Not being clear about the &#8220;why&#8221; behind the analysis</strong></h2><p>I can&#8217;t count how many times &#8212; after I talked to my team about an analysis &#8212; I felt like we were solving the wrong problem.</p><p>When I ask the individual &#8220;Why are we doing this analysis? What&#8217;s the business decision behind this that we are trying to drive?&#8221;, the answer I usually get from junior DS is &#8220;XYZ stakeholder asked for this&#8221;.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dt5-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb12d4e-3d15-43eb-80bf-1d9638235c10_1600x1600.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dt5-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb12d4e-3d15-43eb-80bf-1d9638235c10_1600x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!dt5-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb12d4e-3d15-43eb-80bf-1d9638235c10_1600x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!dt5-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb12d4e-3d15-43eb-80bf-1d9638235c10_1600x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!dt5-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb12d4e-3d15-43eb-80bf-1d9638235c10_1600x1600.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dt5-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb12d4e-3d15-43eb-80bf-1d9638235c10_1600x1600.jpeg" width="1456" height="1456" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1bb12d4e-3d15-43eb-80bf-1d9638235c10_1600x1600.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dt5-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb12d4e-3d15-43eb-80bf-1d9638235c10_1600x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!dt5-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb12d4e-3d15-43eb-80bf-1d9638235c10_1600x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!dt5-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb12d4e-3d15-43eb-80bf-1d9638235c10_1600x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!dt5-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb12d4e-3d15-43eb-80bf-1d9638235c10_1600x1600.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>That should <strong>never</strong> be the only reason you are working on an analysis.</p><p>The interesting thing is, one of the most common complaints I hear from data scientists is being treated as a &#8220;data puller&#8221;; but when involved in an analysis, a lot of junior analysts voluntarily demote themselves to &#8220;data pullers&#8221;.</p><p>They are so laser-focused on finding out what data they need or what data cuts they should produce that they leave the big picture questions like &#8220;<strong>What problem</strong> are we trying to solve?&#8221; and &#8220;<strong>Why is it important</strong> to solve this problem?&#8221; to their managers or tech leads to figure out.</p><p>Whenever this happens, it serves as a signal to me, the manager, that this person is not ready to run projects independently because I need to go back to the stakeholder to inquire about the motivation of the analysis.</p><p>So why is this so problematic, especially if you want to grow into a lead role?</p><p>It goes back to the ownership mindset. Not knowing the &#8220;why&#8221; behind the analysis means you are not taking ownership of the business problem. More importantly, not being clear about the why means not being clear about the problem space, which severely hinders your ability to deliver the most effective solutions.</p><p>People who don&#8217;t have a deep understanding of the problem space often conduct analyses that don't answer the exact question stakeholders have in mind. These stakeholders will then ask for additional data, hoping that it will help them answer the original question. As a result, the analyst gets frustrated because the &#8220;data requests&#8221; constantly change, and a vicious cycle starts.</p><h3><em><strong>What to do about it</strong></em></h3><p>Similar to my suggestion in the last section &#8212; start by owning the problem. If your stakeholder tells you &#8220;I&#8217;m trying to understand how many people used our search function in the past month&#8221;, instead of directly jumping into pulling the number, you should try to find out <em><strong>why</strong></em> they need that data. What decision are they trying to make?</p><p>Once you understand the ultimate problem your business partners have in mind, you can help them brainstorm the best data that can be used to solve that problem (and trust me, in a lot of cases it&#8217;s not the original data they asked you to pull).</p><p>This way, you are truly elevating yourself to be a thought partner and owner of the problem, not just somebody who executes data pulls. If you find yourself struggling with this, The Operator&#8217;s Handbook has an <a href="https://www.operatorshandbook.com/p/how-to-solve-problems">in-depth guide</a> on how to deeply understand a problem space and find the most effective solution from there.</p><h2><strong>Junior trait #4: Not having the basics down to be able to stay &#8220;one step ahead&#8221;</strong></h2><p>Imagine an economist is giving an interview about their opinion on the tariffs, but stumbles when asked &#8220;Roughly what % of American goods are imported?&#8221;. Would you trust anything they say afterwards?</p><p>Probably not. Similarly, the easiest way to lose credibility as a data scientist is when people feel like you don&#8217;t really know the data you&#8217;re working with or the area of the business you&#8217;re covering. For example, if you&#8217;re doing a deep-dive analysis into user behavior on your company&#8217;s platform, you should know at the top of your head roughly how many active users the platform has.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZynZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F586094af-4ed5-4343-8661-f52c7b286fc5_2048x2048.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZynZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F586094af-4ed5-4343-8661-f52c7b286fc5_2048x2048.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ZynZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F586094af-4ed5-4343-8661-f52c7b286fc5_2048x2048.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ZynZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F586094af-4ed5-4343-8661-f52c7b286fc5_2048x2048.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ZynZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F586094af-4ed5-4343-8661-f52c7b286fc5_2048x2048.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZynZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F586094af-4ed5-4343-8661-f52c7b286fc5_2048x2048.jpeg" width="1456" height="1456" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/586094af-4ed5-4343-8661-f52c7b286fc5_2048x2048.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:446951,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.divingintodata.com/i/161838591?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F586094af-4ed5-4343-8661-f52c7b286fc5_2048x2048.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZynZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F586094af-4ed5-4343-8661-f52c7b286fc5_2048x2048.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ZynZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F586094af-4ed5-4343-8661-f52c7b286fc5_2048x2048.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ZynZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F586094af-4ed5-4343-8661-f52c7b286fc5_2048x2048.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ZynZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F586094af-4ed5-4343-8661-f52c7b286fc5_2048x2048.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>If you want to establish yourself as an expert in an area, you need to stay &#8220;one step ahead&#8221; of your stakeholders. If you are the POC for a topic, you are expected to be the most familiar with the data (even compared to your manager) and have an answer ready for the most common questions people might have.</p><p>Don&#8217;t fret, I&#8217;m not saying you need to be able to anticipate <em><strong>every single</strong></em> &#8220;follow up&#8221; question that people may have. But imagine you&#8217;re presenting an analysis and share the fact that more than 80% of your product&#8217;s users are based in the US.</p><p>Most of your audience will naturally wonder things like:</p><ul><li><p>&#8220;What are the other major countries besides the US?&#8221;</p></li><li><p>&#8220;Does the US also account for 80% of DAUs and revenue, or a larger or smaller share?&#8221;</p></li></ul><p>If you don&#8217;t have an answer for any of the &#8220;natural follow-up questions&#8221;, it will make people wonder whether you really spent enough time to understand and explore the data.</p><h3><em><strong>What to do about it</strong></em></h3><p>Try to be curious about the data you&#8217;re working with. Start with a basic question, and then explore the data from there.</p><p>For example, imagine you work at Facebook and are analyzing marketplace data. The first thing you check is how big the product is and you find that &#8220;10% of all Facebook DAUs use FB Marketplace&#8221;.</p><p>What natural follow-up questions could you explore from here?</p><p>For example, you might want to check how this number has changed over time, and how that 10% compares to other product surfaces. Let your curiosity carry you a little bit and jot down the answers to these questions. The key is to deeply understand the context of the data you plan to present without wandering too far from the key question you&#8217;re trying to answer for the business. It&#8217;s a tricky balance, I know.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EVpu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff19fb7ae-6b0c-4a66-b907-07e17d7b8361_2048x2048.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EVpu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff19fb7ae-6b0c-4a66-b907-07e17d7b8361_2048x2048.jpeg 424w, https://substackcdn.com/image/fetch/$s_!EVpu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff19fb7ae-6b0c-4a66-b907-07e17d7b8361_2048x2048.jpeg 848w, https://substackcdn.com/image/fetch/$s_!EVpu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff19fb7ae-6b0c-4a66-b907-07e17d7b8361_2048x2048.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!EVpu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff19fb7ae-6b0c-4a66-b907-07e17d7b8361_2048x2048.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EVpu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff19fb7ae-6b0c-4a66-b907-07e17d7b8361_2048x2048.jpeg" width="1456" height="1456" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f19fb7ae-6b0c-4a66-b907-07e17d7b8361_2048x2048.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:457526,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.divingintodata.com/i/161838591?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff19fb7ae-6b0c-4a66-b907-07e17d7b8361_2048x2048.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!EVpu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff19fb7ae-6b0c-4a66-b907-07e17d7b8361_2048x2048.jpeg 424w, https://substackcdn.com/image/fetch/$s_!EVpu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff19fb7ae-6b0c-4a66-b907-07e17d7b8361_2048x2048.jpeg 848w, https://substackcdn.com/image/fetch/$s_!EVpu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff19fb7ae-6b0c-4a66-b907-07e17d7b8361_2048x2048.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!EVpu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff19fb7ae-6b0c-4a66-b907-07e17d7b8361_2048x2048.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Generally, you can expect follow-up questions from stakeholders to fall into three buckets:</p><ol><li><p><strong>Foundational knowledge:</strong> Different stakeholders will have different levels of pre-existing knowledge on the topic you&#8217;re presenting. Some of them might even have very basic questions (e.g. how the product works, what steps the onboarding flow has, how many users there are etc.)</p></li><li><p><strong>Your analysis: </strong>You should always share only the most important insights from your analysis rather than dumping everything you did on your stakeholders. However, stakeholders might ask for more detail on certain aspects, and you should be prepared to elaborate when needed</p></li><li><p><strong>Next steps: </strong>Stakeholders will often be curious about what your analysis means for them. What are the recommended next steps? What changes should they make based on your findings?</p></li></ol><p>Think through these three areas before you present an analysis, and you&#8217;ll be able to anticipate many of the questions you&#8217;ll get.</p><p>Lastly: Get a second pair of eyes on your work (or use ChatGPT) before you present to a broader audience. Ideally, this person is someone who&#8217;s not as deep in the weeds of the analysis as you are; this will help you find any obvious things you missed because you were too close to the data.</p><h1><strong>Conclusion</strong></h1><p>A lot of the &#8220;symptoms&#8221; mentioned in this article represent a common theme &#8212; lacking an ownership mindset. What establishes you as a lead in an area is not the title you have or how long you have been working in that area; it&#8217;s the mindset you adopt and how you take responsibility for owning and solving a problem.</p><p>So if you want to be perceived as a trusted thought partner instead of a junior IC who can only help pull data, put yourself in the shoes of the decision makers and treat the decisions with the same level of rigor as the ones you care deeply about.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.divingintodata.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Diving Into Data! Subscribe for free to receive regular DS career advice in your inbox</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[How To Answer Business Questions with Data ]]></title><description><![CDATA[Data analysis is the key to driving business decisions and answering questions, but it&#8217;s hard to get right]]></description><link>https://www.divingintodata.com/p/how-to-answer-business-questions</link><guid isPermaLink="false">https://www.divingintodata.com/p/how-to-answer-business-questions</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Thu, 14 Nov 2024 11:10:55 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/b3c480e8-b360-465d-a54b-7c0fd4639bcd_2688x1667.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>&#8220;We have observed a drop in our core metric, what is going on?&#8221;</p><p>&#8220;What are the drivers for churn?&#8221;</p><p>As data scientists, we encounter questions like these every day. A stakeholder comes across something they want to understand better, and they look to us to take an open-ended question like this and make sense of it.</p><p>However, even seasoned data scientists are sometimes stunned by just how vague or open-ended these questions are. As a result, they don&#8217;t know where to start or how to carry these analyses forward.</p><p>In fact, a lot of analyses either circle high level at 30,000 feet and only scratch the surface, or get stuck in a rabbit hole that&#8217;s very far from the original question we hoped to answer.&nbsp;</p><p>Like mentioned at the beginning, there are generally two categories of open-ended business questions data scientists encounter:&nbsp;</p><ol><li><p>Questions that are more <strong>investigative</strong> and aim to identify drivers of a past event; most companies/people refer to this as <strong>root cause</strong> analysis (e.g. &#8220;XX metrics decreased week over week, what happened?&#8221;)</p></li><li><p>Questions that are of <strong>exploratory</strong> nature and focus on identifying drivers of improvements (e.g. &#8220;we want to improve XX metric, what are the things we can do?&#8221;)</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!w3kW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25da44cc-f225-46ff-999f-fd110f189c38_1336x1036.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!w3kW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25da44cc-f225-46ff-999f-fd110f189c38_1336x1036.png 424w, https://substackcdn.com/image/fetch/$s_!w3kW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25da44cc-f225-46ff-999f-fd110f189c38_1336x1036.png 848w, https://substackcdn.com/image/fetch/$s_!w3kW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25da44cc-f225-46ff-999f-fd110f189c38_1336x1036.png 1272w, https://substackcdn.com/image/fetch/$s_!w3kW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25da44cc-f225-46ff-999f-fd110f189c38_1336x1036.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!w3kW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25da44cc-f225-46ff-999f-fd110f189c38_1336x1036.png" width="630" height="488.5329341317365" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/25da44cc-f225-46ff-999f-fd110f189c38_1336x1036.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1036,&quot;width&quot;:1336,&quot;resizeWidth&quot;:630,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!w3kW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25da44cc-f225-46ff-999f-fd110f189c38_1336x1036.png 424w, https://substackcdn.com/image/fetch/$s_!w3kW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25da44cc-f225-46ff-999f-fd110f189c38_1336x1036.png 848w, https://substackcdn.com/image/fetch/$s_!w3kW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25da44cc-f225-46ff-999f-fd110f189c38_1336x1036.png 1272w, https://substackcdn.com/image/fetch/$s_!w3kW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25da44cc-f225-46ff-999f-fd110f189c38_1336x1036.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>There are a lot of similarities between how you would carry out these types of analyses, but there are also slight differences.</p><p>In this article, I will focus on <strong>investigative root cause analysis </strong>and:</p><ul><li><p><strong>Share general tips</strong> for how to conduct analysis for open-ended questions</p></li><li><p><strong>Introduce the high level steps</strong> for this type of analysis</p></li><li><p><strong>Go through a hands-on guide</strong> for each detailed step</p></li><li><p><strong>Apply the hands-on guide on a realistic example</strong> so you can see how it can be carried out in practice</p></li></ul><p>I will get to <strong>exploratory</strong> analysis in another article.</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.divingintodata.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Diving Into Data! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>General advice &amp; pitfalls to avoid when answering open ended business questions</h2><h3>Have a framework</h3><p>From <a href="https://www.divingintodata.com/p/analytics-frameworks-every-data-scientist">my last article</a>, you should already know how much I love frameworks and hopefully learned a little about the common frameworks that DS should apply in their analyses.</p><p>When it comes to <strong>investigative </strong>work, the most relevant framework is the &#8220;hypothesis tree&#8221;, where you try to list out all the probable causes of a problem in a <em><strong>MECE</strong></em> way and rule them out one by one until you find the culprit(s).&nbsp;</p><h3>Don&#8217;t expect to get there in a straight line</h3><p>These types of analysis are hard to do because the process sometimes can be more of an art than strict science. It&#8217;s a creative process and it&#8217;s guided by curiosity and <em>quality </em>questions/hypotheses.</p><p>Because it&#8217;s open-ended, there isn&#8217;t <em><strong>one</strong></em> predetermined &#8220;correct&#8221; way to get to the answer. In fact, the steps that will eventually lead us to a conclusion cannot all be planned in advance.</p><p>The &#8220;hypothesis tree&#8221; can only be built out a couple of levels at a time, because you don&#8217;t know what insights or patterns you will find in the data and what hypotheses you can reject with the data.</p><p><strong>These insights will largely dictate what your next move will be.</strong></p><p>This means you will explore and reject a lot of &#8220;not-it&#8221; hypotheses along the way. <strong>However, these detours are not a waste of time</strong>. Doing data-related investigation is not that different from investigations in police work&#8202;&#8212;&#8202;ruling out &#8220;suspects&#8221; is a key step in having a thorough investigation, so <strong>expect to chase down a lot of dead-end leads before getting to the &#8220;lead suspect&#8221;.</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nERj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb399dfb4-cc6b-4e69-a188-ddf9a1e9fcba_1310x1122.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nERj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb399dfb4-cc6b-4e69-a188-ddf9a1e9fcba_1310x1122.png 424w, https://substackcdn.com/image/fetch/$s_!nERj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb399dfb4-cc6b-4e69-a188-ddf9a1e9fcba_1310x1122.png 848w, https://substackcdn.com/image/fetch/$s_!nERj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb399dfb4-cc6b-4e69-a188-ddf9a1e9fcba_1310x1122.png 1272w, https://substackcdn.com/image/fetch/$s_!nERj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb399dfb4-cc6b-4e69-a188-ddf9a1e9fcba_1310x1122.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nERj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb399dfb4-cc6b-4e69-a188-ddf9a1e9fcba_1310x1122.png" width="652" height="558.4305343511451" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b399dfb4-cc6b-4e69-a188-ddf9a1e9fcba_1310x1122.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1122,&quot;width&quot;:1310,&quot;resizeWidth&quot;:652,&quot;bytes&quot;:845461,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nERj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb399dfb4-cc6b-4e69-a188-ddf9a1e9fcba_1310x1122.png 424w, https://substackcdn.com/image/fetch/$s_!nERj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb399dfb4-cc6b-4e69-a188-ddf9a1e9fcba_1310x1122.png 848w, https://substackcdn.com/image/fetch/$s_!nERj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb399dfb4-cc6b-4e69-a188-ddf9a1e9fcba_1310x1122.png 1272w, https://substackcdn.com/image/fetch/$s_!nERj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb399dfb4-cc6b-4e69-a188-ddf9a1e9fcba_1310x1122.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Have your eyes on the goal at all times</h3><p>This is super important. When I was a junior DS, I sometimes found myself elbow deep in my analysis and only after a couple of hours of exploring found myself wondering &#8220;How did I end up here? What am I looking for? How does this help answer my original question?&#8221;&nbsp;</p><p>It&#8217;s important to ask yourself these questions frequently in your process. There will likely be a lot of people with different opinions about what you should look into, and you will have a million data points you could be investigating.</p><p>One way to help decide what is worth digging deeper into is to see whether it has a clear lineage to trace back to the ultimate question you want to answer.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NDGu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F236aa319-fd36-443a-aef3-92aed1434260_1438x1088.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NDGu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F236aa319-fd36-443a-aef3-92aed1434260_1438x1088.png 424w, https://substackcdn.com/image/fetch/$s_!NDGu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F236aa319-fd36-443a-aef3-92aed1434260_1438x1088.png 848w, https://substackcdn.com/image/fetch/$s_!NDGu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F236aa319-fd36-443a-aef3-92aed1434260_1438x1088.png 1272w, https://substackcdn.com/image/fetch/$s_!NDGu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F236aa319-fd36-443a-aef3-92aed1434260_1438x1088.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NDGu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F236aa319-fd36-443a-aef3-92aed1434260_1438x1088.png" width="1438" height="1088" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/236aa319-fd36-443a-aef3-92aed1434260_1438x1088.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1088,&quot;width&quot;:1438,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:458522,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!NDGu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F236aa319-fd36-443a-aef3-92aed1434260_1438x1088.png 424w, https://substackcdn.com/image/fetch/$s_!NDGu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F236aa319-fd36-443a-aef3-92aed1434260_1438x1088.png 848w, https://substackcdn.com/image/fetch/$s_!NDGu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F236aa319-fd36-443a-aef3-92aed1434260_1438x1088.png 1272w, https://substackcdn.com/image/fetch/$s_!NDGu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F236aa319-fd36-443a-aef3-92aed1434260_1438x1088.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Act on curiosity, but don&#8217;t let it take you too far down a rabbit hole</h3><p>Going back to my comparison between data investigation and detective work:</p><p>More often than not, when you come across something unexpected, it can lead to a big discovery that will change the course of your investigation. So it&#8217;s important to act on curiosity when it comes to this type of analysis&#8202;&#8212;&#8202;if you see something weird or interesting, dig in and see if there&#8217;s anything valuable you haven&#8217;t considered.&nbsp;</p><p>But remember to utilize the tip above to know when it&#8217;s time to call it quits and stop the descent into a rabbit hole. Think about the hypothesis you&#8217;re investigating and ask yourself:</p><ul><li><p>Have you seen enough evidence to prove or reject it?</p></li><li><p>If not, what else do you need to close the loop?</p></li></ul><div><hr></div><h2>The 5 steps of an analytical investigation</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!srGi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb07e5d9e-3e14-4419-a0ca-b96f746a6486_1600x1600.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!srGi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb07e5d9e-3e14-4419-a0ca-b96f746a6486_1600x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!srGi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb07e5d9e-3e14-4419-a0ca-b96f746a6486_1600x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!srGi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb07e5d9e-3e14-4419-a0ca-b96f746a6486_1600x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!srGi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb07e5d9e-3e14-4419-a0ca-b96f746a6486_1600x1600.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!srGi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb07e5d9e-3e14-4419-a0ca-b96f746a6486_1600x1600.jpeg" width="654" height="654" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b07e5d9e-3e14-4419-a0ca-b96f746a6486_1600x1600.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:654,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!srGi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb07e5d9e-3e14-4419-a0ca-b96f746a6486_1600x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!srGi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb07e5d9e-3e14-4419-a0ca-b96f746a6486_1600x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!srGi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb07e5d9e-3e14-4419-a0ca-b96f746a6486_1600x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!srGi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb07e5d9e-3e14-4419-a0ca-b96f746a6486_1600x1600.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Step 1: Scope the analysis&#8202;</h3><p>It&#8217;s crucial to understand the business problem at hand so you can set the scope of your analysis. Let&#8217;s say your stakeholders asked the following question: &#8220;XX metric is down 10%, what caused it?&#8221;</p><p>In this case, it&#8217;s important to understand <em><strong>what the benchmark is</strong></em> (i.e. is it down 10% compared to forecast? Or is it down 10% compared to a previous period?) and <em><strong>what time period</strong></em> is the observation based on.</p><p>This will help you decide which time period of data you should be observing, what else you need to understand (for example, if the comparison is to a forecast, you need to understand how the forecast was created) and what hypothesis you can quickly rule out even without data.&nbsp;</p><h3>Step 2: Generate hypotheses (using a hypothesis tree)</h3><p>You should always rely on two things in hypothesis generation&#8202;&#8212;&#8202;your curiosity and a framework.&nbsp;</p><p>A framework will get you started with a basic set of hypotheses that can represent the most common issues based on logical reasoning. Your curiosity will then lead you to observe &#8220;interesting&#8221; things in the data (e.g. counter intuitive findings) that can inspire new hypotheses.</p><p>In other words, finding the root cause will be an iterative process. Even if it&#8217;s tempting, don&#8217;t try to map out the whole hypothesis tree at the beginning (trust me, I have tried, it will quickly fall apart).</p><p>As a reminder, this is what a hypothesis tree looks like:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8XDO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b6198c8-1499-402f-8b3a-88edddf79c24_1400x1049.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8XDO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b6198c8-1499-402f-8b3a-88edddf79c24_1400x1049.jpeg 424w, https://substackcdn.com/image/fetch/$s_!8XDO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b6198c8-1499-402f-8b3a-88edddf79c24_1400x1049.jpeg 848w, https://substackcdn.com/image/fetch/$s_!8XDO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b6198c8-1499-402f-8b3a-88edddf79c24_1400x1049.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!8XDO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b6198c8-1499-402f-8b3a-88edddf79c24_1400x1049.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8XDO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b6198c8-1499-402f-8b3a-88edddf79c24_1400x1049.jpeg" width="1400" height="1049" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6b6198c8-1499-402f-8b3a-88edddf79c24_1400x1049.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1049,&quot;width&quot;:1400,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8XDO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b6198c8-1499-402f-8b3a-88edddf79c24_1400x1049.jpeg 424w, https://substackcdn.com/image/fetch/$s_!8XDO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b6198c8-1499-402f-8b3a-88edddf79c24_1400x1049.jpeg 848w, https://substackcdn.com/image/fetch/$s_!8XDO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b6198c8-1499-402f-8b3a-88edddf79c24_1400x1049.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!8XDO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b6198c8-1499-402f-8b3a-88edddf79c24_1400x1049.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Step 3: Decide what data you need to test the hypotheses, retrieve and QA the data</h3><p>Once you have a new set of hypotheses developed, the goal is to quickly find ways to validate or reject them so you can move forward to the next iteration of the cycle.</p><p>The very key to this is securing the right data and making sure it can be trusted.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!SurJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a099183-d296-44dc-9111-fe3bb75f6d8b_1438x1202.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!SurJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a099183-d296-44dc-9111-fe3bb75f6d8b_1438x1202.png 424w, https://substackcdn.com/image/fetch/$s_!SurJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a099183-d296-44dc-9111-fe3bb75f6d8b_1438x1202.png 848w, https://substackcdn.com/image/fetch/$s_!SurJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a099183-d296-44dc-9111-fe3bb75f6d8b_1438x1202.png 1272w, https://substackcdn.com/image/fetch/$s_!SurJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a099183-d296-44dc-9111-fe3bb75f6d8b_1438x1202.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!SurJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a099183-d296-44dc-9111-fe3bb75f6d8b_1438x1202.png" width="1438" height="1202" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6a099183-d296-44dc-9111-fe3bb75f6d8b_1438x1202.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1202,&quot;width&quot;:1438,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:749564,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!SurJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a099183-d296-44dc-9111-fe3bb75f6d8b_1438x1202.png 424w, https://substackcdn.com/image/fetch/$s_!SurJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a099183-d296-44dc-9111-fe3bb75f6d8b_1438x1202.png 848w, https://substackcdn.com/image/fetch/$s_!SurJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a099183-d296-44dc-9111-fe3bb75f6d8b_1438x1202.png 1272w, https://substackcdn.com/image/fetch/$s_!SurJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a099183-d296-44dc-9111-fe3bb75f6d8b_1438x1202.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Once you get your hands on the data, the first step should never be to jump into hypotheses testing or insights generation&#8202;&#8212;<strong>&#8202;it should be to QA your data.</strong></p><p>Because simply put, you should never blindly trust the data you work with. The data gathered in real life is never as clean and structured as Kaggle datasets. The work also doesn&#8217;t go to waste; a lot of these QA visualizations and summary statistics can be reused when you pull all the insights and story together in the end.</p><p>Some typical QA questions to look into:&nbsp;</p><h4>Are there any missing/NULL data?</h4><p>If there are missing/NULL values, is it big enough to be worried about? If it&#8217;s a small amount, should you simply ignore those records or delete them if they distort the conclusion you are trying to draw.</p><p>Speed often matters more in investigations than perfection.</p><p>If it&#8217;s a significant amount, did you just discover a data issue? In this case, you should aim to understand <em><strong>why</strong></em> the values are NULL. Is there a legitimate reason why this field wouldn&#8217;t have a value in many cases, or is there a problem with the data pipeline?&nbsp;</p><p>Once you understand the &#8220;why&#8221;, you can decide whether it makes sense to replace the NULL values or not.</p><h4>How is the data distributed, and does it align with your expectation?</h4><p>It&#8217;s a good habit to always pull some useful summary stats of the data you are looking into&#8202;:</p><ul><li><p>What&#8217;s the mean, median, percentiles?</p></li><li><p>Are there any outliers?</p></li><li><p>Does the distribution align with your understanding / expectation?</p></li></ul><p><strong>For example</strong>, let&#8217;s say one of your hypotheses is:</p><p>&#8220;The Growth team told me we recently increased our email send frequency from once a week to 3 times a week. That might have led to more email unsubscribes, and in turn led to fewer people coming to our website.&#8221;</p><p>To validate this hypothesis, you pulled email sent/unsubscribe data for the past weeks. Instead of directly looking at whether unsubscribe quantity increased, you might want to check the distributions of emails sent as well.</p><p>You might find that some people are receiving <em>way more</em> than the &#8220;3 emails per week&#8221; you have been told and assumed. <strong>That might send your investigation in a completely new direction:</strong></p><p>Is this expected behavior? Are the email sequences correctly configured, or could excessive email volume (e.g. because a suppression rule is not working) explain the metric drops you saw?</p><h3>Step 4: Test hypotheses</h3><p>Now that you know you have trust-worthy data, you can use it to test the hypotheses you have in mind.</p><p><strong>The key here is&#8202; to decide ahead of time what you need to accept/reject a hypothesis.</strong> This can prevent you from randomly exploring and getting distracted, or continuing down a branch of your investigation when you should have already moved on.</p><p>There are two components to what&#8217;s &#8220;sufficient&#8221;, one from an academic angle, one from a storytelling angle.</p><ul><li><p><strong>For the academic side</strong>, it&#8217;s usually best to stick to the standard suite of statistical tools for hypothesis testing (t-test, z-test etc.); I will not cover them in detail here, but likely provide a separate deep dive on experimentation in the future.</p></li><li><p>The harder piece for most people to figure out is the <strong>storytelling angle</strong>. What data do you need to create a convincing narrative about what has happened?</p></li></ul><p>Proving something academically through data is one thing; getting all stakeholders on the same page with regards to what&#8217;s going on is a whole other challenge.</p><p>And in order to convince others, <strong>YOU</strong> need to form an opinion first. Like I mentioned in<a href="https://medium.com/towards-data-science/one-mindset-shift-that-will-make-you-a-better-data-scientist-a015f8000ad7"> my article</a> about the key mindset data scientists should possess, a data scientist&#8217;s job is not to simply present data, but to use data to guide decisions.</p><p><em>If you can&#8217;t even convince yourself, how can you convince others?&nbsp;</em></p><h3>Step 5: Generate and present insights with ranked importance</h3><p>Once you identify the &#8220;smoking gun(s)&#8221; and can exit the &#8220;<em>hypothesis generation &#8594; hypothesis testing&#8221; </em>loop, you need to organize your findings and tell a compelling story.</p><p>Storytelling with data is another key craft for data scientists to master; it deserves its own article and I will cover it in detail in the future. In the meantime, here are some high-level tips:</p><p>My high level suggestion is to ONLY present relevant data points to avoid confusion and distraction. Ask yourself: &#8220;What is the simplest, most direct way I can explain what happened?&#8221; and then create the story by either writing the headlines for a slide deck or by drafting the outline of a summary document.</p><p>Only once you have the storyline in place, you should start to fill in the charts and tables supporting that narrative. This ensures that every piece of data you are showing is crucial for the explanation.</p><p>With that being said, all the &#8220;dead-end&#8221; leads you explored are not wasted. They go into the appendix, and you can point to them when people inevitably ask &#8220;have you considered&#8230;&#8221; / &#8220;what about&#8230;&#8221;.</p><p><strong>And all of your data QA?</strong> That also deserves a place in the appendix so when people ask questions about the data and its integrity, you can show your in-depth knowledge and the thoroughness of your analysis.</p><p>This helps build trust in you and your analysis: Because if you can&#8217;t even answer the most basic question about the data you are using, how can I trust the insights you developed?</p><p>In this process, it&#8217;s important to ask yourself: &#8220;What questions would others ask to challenge and poke holes in the narrative?&#8221;. If you want to go deeper on this topic, check out the articles on <a href="https://www.operatorshandbook.com/p/how-to-challenge-your-own-work-so">sanity checking</a> your work and <a href="https://www.operatorshandbook.com/p/never-get-blindsided-again">anticipating questions</a> from stakeholders by <a href="https://www.linkedin.com/in/torsten-walbaum/">Torsten Walbaum</a>.</p><div><hr></div><h2>Case Study: Investigating why total video view time is down on YouTube</h2><p><em><strong>Disclaimer:</strong> This is a simplified, made up example for demonstration purposes; all data is fabricated by the author</em></p><p>It&#8217;s time to put all of this into practice.</p><p>Let&#8217;s say you work at YouTube and someone from leadership asks you to dig into why the total video view time is down.</p><p><strong>Step 1: Understand the concrete question/issue at hand</strong></p><p>First, ask clarifying questions so you are <em><strong>crystal clear</strong></em> about the question you&#8217;re answering. For example:</p><ul><li><p>"What time period are we talking about?&#8221;</p></li><li><p>&#8220;What&#8217;s the benchmark we compare it to when we say the view time is down?&#8221;</p></li></ul><p>For demonstration purposes, let&#8217;s assume we found out that the weekly total video view time is down by 10% compared to last week.&nbsp;</p><p><strong>Step 2: Generate hypotheses</strong></p><p>Start by laying out the debugging structure (AKA the hypothesis tree) to break the problem into smaller pieces you can tackle. As much as possible, you should follow the <a href="https://www.divingintodata.com/p/analytics-frameworks-every-data-scientist">MECE</a> principle.</p><p>In this case, the first layer of the hypothesis tree is very simple:</p><blockquote><p>Total watch time = <strong>number of unique watchers * average watch time</strong></p></blockquote><p>Can we isolate the issue to one of part of that equation?</p><p>For this example, let&#8217;s assume we can: Average watch time is flat, but the number of unique watchers is down by 10%. Congrats, you just narrowed down your scope for the analysis.&nbsp;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5Zh0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff45f429d-d55e-4f13-9c40-347b9f17eda6_1194x958.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5Zh0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff45f429d-d55e-4f13-9c40-347b9f17eda6_1194x958.png 424w, https://substackcdn.com/image/fetch/$s_!5Zh0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff45f429d-d55e-4f13-9c40-347b9f17eda6_1194x958.png 848w, https://substackcdn.com/image/fetch/$s_!5Zh0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff45f429d-d55e-4f13-9c40-347b9f17eda6_1194x958.png 1272w, https://substackcdn.com/image/fetch/$s_!5Zh0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff45f429d-d55e-4f13-9c40-347b9f17eda6_1194x958.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5Zh0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff45f429d-d55e-4f13-9c40-347b9f17eda6_1194x958.png" width="608" height="487.8257956448911" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f45f429d-d55e-4f13-9c40-347b9f17eda6_1194x958.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:958,&quot;width&quot;:1194,&quot;resizeWidth&quot;:608,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5Zh0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff45f429d-d55e-4f13-9c40-347b9f17eda6_1194x958.png 424w, https://substackcdn.com/image/fetch/$s_!5Zh0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff45f429d-d55e-4f13-9c40-347b9f17eda6_1194x958.png 848w, https://substackcdn.com/image/fetch/$s_!5Zh0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff45f429d-d55e-4f13-9c40-347b9f17eda6_1194x958.png 1272w, https://substackcdn.com/image/fetch/$s_!5Zh0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff45f429d-d55e-4f13-9c40-347b9f17eda6_1194x958.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Then one level down, one (of many) MECE way to break down all the factors is <strong>&#8220;internal factor&#8221;</strong> (e.g. data error, engineering change etc.) vs. <strong>&#8220;external factor&#8221;</strong> (e.g. seasonality, macroeconomic factor etc.). However, you need to be a little more concrete than this to have testable/verifiable hypotheses.&nbsp;</p><p>In reality (not in an interview setting), some of the low hanging fruit hypotheses can even be verified without much data retrieval. So it&#8217;s crucial to quickly identify and test them.</p><p>The hypotheses listed below are some easy-to verify ones that we can quickly rule out or accept:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CEKM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F276c1567-f989-4d59-8860-06b24c19ba94_1322x1110.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CEKM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F276c1567-f989-4d59-8860-06b24c19ba94_1322x1110.png 424w, https://substackcdn.com/image/fetch/$s_!CEKM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F276c1567-f989-4d59-8860-06b24c19ba94_1322x1110.png 848w, https://substackcdn.com/image/fetch/$s_!CEKM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F276c1567-f989-4d59-8860-06b24c19ba94_1322x1110.png 1272w, https://substackcdn.com/image/fetch/$s_!CEKM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F276c1567-f989-4d59-8860-06b24c19ba94_1322x1110.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CEKM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F276c1567-f989-4d59-8860-06b24c19ba94_1322x1110.png" width="678" height="569.2738275340394" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/276c1567-f989-4d59-8860-06b24c19ba94_1322x1110.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1110,&quot;width&quot;:1322,&quot;resizeWidth&quot;:678,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CEKM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F276c1567-f989-4d59-8860-06b24c19ba94_1322x1110.png 424w, https://substackcdn.com/image/fetch/$s_!CEKM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F276c1567-f989-4d59-8860-06b24c19ba94_1322x1110.png 848w, https://substackcdn.com/image/fetch/$s_!CEKM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F276c1567-f989-4d59-8860-06b24c19ba94_1322x1110.png 1272w, https://substackcdn.com/image/fetch/$s_!CEKM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F276c1567-f989-4d59-8860-06b24c19ba94_1322x1110.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Step 3&#8202;&#8212;&#8202;Step 4: Decide what data you need to test the hypotheses, QA the data if needed, test the hypothesis and attribute the change</strong></p><p>You should easily be able to verify if any <strong>major initiatives (e.g. new launches)</strong> happened in the relevant time frame that could be responsible for tanking the metric.</p><p>Most companies A/B test big changes, so you should look through the data for recent experiments to see if any can explain the change. Let&#8217;s say we see that one of the changes we recently rolled out affected the SEO ranking of our videos on Google, and it caused a reduction in SEO traffic and fewer unique watchers as a result.&nbsp;</p><p>We can also quickly verify whether there&#8217;s <strong>any seasonality </strong>at play. In this case, we are looking at weekly aggregated data, so we can disregard intra-week seasonality. The best way to check this is to look at weekly data trends over long time periods (e.g. by plotting different years on top of each other) and see if there are any repeating patterns.</p><p>Judging <strong>macroeconomic factors</strong> is more of an art than science; they are typically not recurring, so you won&#8217;t have historical data to use as a benchmark. One way to attribute/estimate the effect of these factors to look at industry or competitor data since macroeconomic developments will affect the market broadly.</p><p>For example, during COVID, many retail stores experienced similar dips in foot traffic and revenue.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!O9Qk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fceb1a8be-f608-46e6-99b7-7449afb0ddfb_1600x1600.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!O9Qk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fceb1a8be-f608-46e6-99b7-7449afb0ddfb_1600x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!O9Qk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fceb1a8be-f608-46e6-99b7-7449afb0ddfb_1600x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!O9Qk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fceb1a8be-f608-46e6-99b7-7449afb0ddfb_1600x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!O9Qk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fceb1a8be-f608-46e6-99b7-7449afb0ddfb_1600x1600.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!O9Qk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fceb1a8be-f608-46e6-99b7-7449afb0ddfb_1600x1600.jpeg" width="664" height="664" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ceb1a8be-f608-46e6-99b7-7449afb0ddfb_1600x1600.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:664,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!O9Qk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fceb1a8be-f608-46e6-99b7-7449afb0ddfb_1600x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!O9Qk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fceb1a8be-f608-46e6-99b7-7449afb0ddfb_1600x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!O9Qk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fceb1a8be-f608-46e6-99b7-7449afb0ddfb_1600x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!O9Qk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fceb1a8be-f608-46e6-99b7-7449afb0ddfb_1600x1600.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Repeat steps 2&#8211;4 until you can attribute all the changes</strong></p><p>While practice problems might make it seem like that, in reality, it&#8217;s usually not just one or two factors that caused the change you&#8217;re observing. Often, many factors combined explain the metric movement, although some contribute more than others:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!yVEC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb49ca148-9546-4d58-8f95-ff7a1cbe8e76_1600x1600.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!yVEC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb49ca148-9546-4d58-8f95-ff7a1cbe8e76_1600x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!yVEC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb49ca148-9546-4d58-8f95-ff7a1cbe8e76_1600x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!yVEC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb49ca148-9546-4d58-8f95-ff7a1cbe8e76_1600x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!yVEC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb49ca148-9546-4d58-8f95-ff7a1cbe8e76_1600x1600.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!yVEC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb49ca148-9546-4d58-8f95-ff7a1cbe8e76_1600x1600.jpeg" width="688" height="688" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b49ca148-9546-4d58-8f95-ff7a1cbe8e76_1600x1600.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:688,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!yVEC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb49ca148-9546-4d58-8f95-ff7a1cbe8e76_1600x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!yVEC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb49ca148-9546-4d58-8f95-ff7a1cbe8e76_1600x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!yVEC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb49ca148-9546-4d58-8f95-ff7a1cbe8e76_1600x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!yVEC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb49ca148-9546-4d58-8f95-ff7a1cbe8e76_1600x1600.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>So it&#8217;s important to treat &#8220;<em>hypothesis generation -&gt; hypothesis testing</em>&#8221; as a recurring and iterative process until we can explain the whole magnitude of change we observed.&nbsp;</p><h2><strong>Final Thoughts</strong></h2><p>Using data to answer business questions is never an easy process. The questions are usually open-ended, and it&#8217;s up to you to develop hypotheses and eventually identify the relevant driver(s).</p><p>The most important part is to <strong>have a framework</strong> that you can follow when exploring different possibilities. Otherwise, it&#8217;s easy to get lost in an ocean of random guesses and hunches.</p><p>And without a way to systematically rule out hypotheses one by one, working groups often forget what they have already concluded and what they still need to test, causing you to go in circles.</p><p>Hopefully this article provides you a helpful guide for structured investigations.</p><p><strong>Since the final (and arguably most important) step is to package the findings into a convincing story, I will cover data storytelling in my next post. Stay tuned!</strong></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.divingintodata.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Diving Into Data! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Analytics Frameworks Every Data Scientist Should Know]]></title><description><![CDATA[Why I believe my experience at McKinsey made me a better data scientist]]></description><link>https://www.divingintodata.com/p/analytics-frameworks-every-data-scientist</link><guid isPermaLink="false">https://www.divingintodata.com/p/analytics-frameworks-every-data-scientist</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Wed, 28 Aug 2024 11:43:43 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/bba71a8f-080e-4a63-984a-c0c344c363b5_1400x1055.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Unlike a lot of data scientists in tech, my career in data science started in consulting, and I think it&#8217;s the best career move I have made. Say what you will about consulting culture and the hours, I learned so much in the two years I was at McKinsey and I still benefit from it every day.</p><p>As a manager, part of my job is to coach data scientists on the team when it comes to projects and career growth in general. I realized what junior data scientists struggle with the most is usually not the technical/execution part of the job &#8212; that&#8217;s the easy to teach/easy to learn part.</p><p>It&#8217;s usually the more abstract/soft-skill-related part of the job that most people don&#8217;t know how to navigate &#8212; things like how to break down an abstract business problem into smaller, clearly defined analyses that can eventually lead to concrete business impact.</p><p><strong>These are the things I got to practice day in day out as a consultant, and I think the learnings carry over to data science very well.</strong></p><p>To help my fellow data scientists, I want to summarize my learnings from my consulting days so you can benefit from them without going through the grind.</p><p><strong>In this article, I will:</strong></p><ul><li><p>Convince you why consulting-style training can immensely benefit data scientists of any level</p></li><li><p>Walk you in detail through the most valuable frameworks I learned at McKinsey that you can apply in your day-to-day work</p></li></ul><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.divingintodata.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribe for free to receive hands-on data science career guide in your inbox!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h1><strong>Why I think consulting experience can benefit every data scientist</strong></h1><h2><strong>Reason 1: It grows your ability to learn about an area and make impact fast.</strong></h2><p>Because consulting projects are always strapped for time (consultants are billed by the hour), consultants don&#8217;t have the luxury to spend months to learn about all the background, context and client-specific subject matter in depth before being pressured to deliver solutions.</p><p>So consultants are trained to learn about an area efficiently and make impact along the way. There are a lot of skills involved in this process:</p><ul><li><p>Asking the right questions to the right people; what is the information you <em><strong>really</strong></em> need to know to solve the problem?</p></li><li><p>Discovering gaps and figuring out short-term solutions to plug the gap</p></li><li><p>Turning short-term solutions into long-term ones and identifying the right stakeholders to help pushing things forward.</p></li></ul><blockquote><p><em><strong>These skills are essential because while &#8220;great things take time&#8221;, people still expect you to deliver incremental &#8220;good things&#8221; on the way of delivering great things</strong></em></p></blockquote><p>Delivering the (albeit not perfect) MVP in a short timeframe that makes a big (enough) impact is the goal. This forces consultants to constantly deploy the 80/20 rule:</p><blockquote><p><em><strong>With 20% of the effort you can get 80% of the impact.</strong></em></p></blockquote><p>There are two reasons for this:</p><ol><li><p><strong>Every activity quickly hits diminishing returns. </strong>For example, building the first version of a dashboard when the team previously had <em><strong>no</strong></em> way of looking at data has a huge impact. As you keep refining the dashboard, you quickly get to the point where a new filter or toggle only adds a small benefit for individual users</p></li><li><p><strong>You don&#8217;t need perfect accuracy to make most decisions. </strong>For example, to make a &#8220;Go&#8221; or &#8220;No-Go&#8221; decision, you often just need to know whether a number will land in a certain ballpark range (e.g. $150M &#8212; $200M), not what the exact number will be.</p></li></ol><p>This will be completely out of the comfort zone of a lot of engineers and data scientists. But remember that at the end of the day we are trying to run a business, not write a white paper; it&#8217;s crucial to build this mental flexibility.</p><h2><strong>Reason 2: It teaches you to be a full-stack data scientist when needed.</strong></h2><p>Most consulting teams are not staffed with data engineers + data scientists + ML engineers + PMs + biz ops.</p><p>So if you need raw data cleaned and built into pipelines to run an analysis, you don&#8217;t have data engineers to count on, you have to pick it up; if you need to estimate the impact of an initiative, put on your biz ops hat and do the back-of-the-envelop calculation yourself.</p><p><strong>Most data scientists in consulting are full-stack data scientists; they can wear different hats when needed.</strong> For one project, we built an end-to-end AB test analytics solution in a month for an e-commerce company; for another project, we built a prediction model for movies&#8217; box office performance in a few weeks so the team can make fast decisions about which screenplays to fund.</p><ul><li><p><strong>Are these solutions as good as a proper SaaS product from a big tech company? </strong>Definitely not.</p></li><li><p><strong>But are they good enough to solve a business problem timely?</strong> Almost certainly.</p></li></ul><p>And most importantly, I learned a ton about how to deliver solutions end-to-end wearing different hats. This makes my collaboration nowadays with partner teams like ML engineers, data engineers and PMs that much smoother because I know a little about their jobs.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!b2df!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0cc8e96-aa45-48ea-ae99-c98d6a45b282_1400x538.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!b2df!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0cc8e96-aa45-48ea-ae99-c98d6a45b282_1400x538.png 424w, https://substackcdn.com/image/fetch/$s_!b2df!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0cc8e96-aa45-48ea-ae99-c98d6a45b282_1400x538.png 848w, https://substackcdn.com/image/fetch/$s_!b2df!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0cc8e96-aa45-48ea-ae99-c98d6a45b282_1400x538.png 1272w, https://substackcdn.com/image/fetch/$s_!b2df!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0cc8e96-aa45-48ea-ae99-c98d6a45b282_1400x538.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!b2df!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0cc8e96-aa45-48ea-ae99-c98d6a45b282_1400x538.png" width="1400" height="538" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a0cc8e96-aa45-48ea-ae99-c98d6a45b282_1400x538.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:538,&quot;width&quot;:1400,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!b2df!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0cc8e96-aa45-48ea-ae99-c98d6a45b282_1400x538.png 424w, https://substackcdn.com/image/fetch/$s_!b2df!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0cc8e96-aa45-48ea-ae99-c98d6a45b282_1400x538.png 848w, https://substackcdn.com/image/fetch/$s_!b2df!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0cc8e96-aa45-48ea-ae99-c98d6a45b282_1400x538.png 1272w, https://substackcdn.com/image/fetch/$s_!b2df!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0cc8e96-aa45-48ea-ae99-c98d6a45b282_1400x538.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Sometimes you have to wear all the hats (image by author)</figcaption></figure></div><h2><strong>Reason 3: It &#8220;forces&#8221; you to see the big picture and communicate in a succinct but clear way (both written and verbal communication).</strong></h2><p>While in bigger tech companies you won&#8217;t get a lot of visibility in front of executives as a junior IC, consultants can usually bypass the hierarchy and get the opportunity to work directly with C-level executives.</p><p>These executives usually have a million things on their plates and they are constantly context switching &#8212; so communication with them needs to be high level and effective.</p><blockquote><p><em><strong>Junior ICs in the analytics realm usually struggle with this type of communication because the analytics work itself requires you to be deep in the weeds and it&#8217;s hard to then to zoom out when communicating your findings.</strong></em></p></blockquote><p>Consultants spends hours honing every client-facing deck and documentation to nail this skill. <strong>Every piece of material goes through peer review, manager review and review from partners so people at different levels of familiarity with the project can help judge whether the material is easy to digest.</strong></p><p>Compared to junior ICs in the big tech companies who may only get a handful of opportunities per year to practice this type of communication, consultants&#8217; frequent communication with execs forces them to practice and master the ability to not only see but also communicate the big picture.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kY9M!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5752213-924b-4df8-9621-1a1b5d03d2ee_1400x579.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kY9M!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5752213-924b-4df8-9621-1a1b5d03d2ee_1400x579.jpeg 424w, https://substackcdn.com/image/fetch/$s_!kY9M!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5752213-924b-4df8-9621-1a1b5d03d2ee_1400x579.jpeg 848w, https://substackcdn.com/image/fetch/$s_!kY9M!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5752213-924b-4df8-9621-1a1b5d03d2ee_1400x579.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!kY9M!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5752213-924b-4df8-9621-1a1b5d03d2ee_1400x579.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kY9M!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5752213-924b-4df8-9621-1a1b5d03d2ee_1400x579.jpeg" width="1400" height="579" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f5752213-924b-4df8-9621-1a1b5d03d2ee_1400x579.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:579,&quot;width&quot;:1400,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!kY9M!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5752213-924b-4df8-9621-1a1b5d03d2ee_1400x579.jpeg 424w, https://substackcdn.com/image/fetch/$s_!kY9M!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5752213-924b-4df8-9621-1a1b5d03d2ee_1400x579.jpeg 848w, https://substackcdn.com/image/fetch/$s_!kY9M!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5752213-924b-4df8-9621-1a1b5d03d2ee_1400x579.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!kY9M!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5752213-924b-4df8-9621-1a1b5d03d2ee_1400x579.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Image by Author</figcaption></figure></div><h1><strong>McKinsey&#8217;s &#8220;Secret Sauce&#8221;: The most useful consulting frameworks and how to apply them to analytics</strong></h1><p>But not everyone will get a chance or is interested in being in consulting during their career.</p><p>So let me introduce you to some of the most important frameworks that can help you get the essence of the consulting way of thinking without going through the grind.</p><h2><strong>MECE framework</strong></h2><p>If there&#8217;s one thing you take away from this article, it should be this framework.</p><p>This is the first framework new consultants are introduced to and it&#8217;s a framework that can help you think through just about any problem in life.</p><p>MECE stands for <em><strong>mutually exclusive</strong></em> and <em><strong>collectively exhaustive</strong></em>.</p><p>It&#8217;s a way to break down something big or abstract into smaller buckets to make the problem more approachable while making sure that there&#8217;s no duplicated effort (<em><strong>&#8220;mutually exclusive&#8221;</strong></em>) or missed area (<em><strong>&#8220;collectively exhaustive&#8221;</strong></em>) in the process.</p><p>Let me give an example: Let&#8217;s say you want to segment the member base of Instagram by age. It&#8217;s not MECE to have groups like &#8220;underage (&lt;18)&#8221;, &#8220;teenagers (13&#8211;19)&#8221; because there&#8217;s duplication.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4mjh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe18598b0-6802-4165-a4d2-9a3f4a36558b_914x584.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4mjh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe18598b0-6802-4165-a4d2-9a3f4a36558b_914x584.png 424w, https://substackcdn.com/image/fetch/$s_!4mjh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe18598b0-6802-4165-a4d2-9a3f4a36558b_914x584.png 848w, https://substackcdn.com/image/fetch/$s_!4mjh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe18598b0-6802-4165-a4d2-9a3f4a36558b_914x584.png 1272w, https://substackcdn.com/image/fetch/$s_!4mjh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe18598b0-6802-4165-a4d2-9a3f4a36558b_914x584.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4mjh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe18598b0-6802-4165-a4d2-9a3f4a36558b_914x584.png" width="914" height="584" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e18598b0-6802-4165-a4d2-9a3f4a36558b_914x584.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:584,&quot;width&quot;:914,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!4mjh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe18598b0-6802-4165-a4d2-9a3f4a36558b_914x584.png 424w, https://substackcdn.com/image/fetch/$s_!4mjh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe18598b0-6802-4165-a4d2-9a3f4a36558b_914x584.png 848w, https://substackcdn.com/image/fetch/$s_!4mjh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe18598b0-6802-4165-a4d2-9a3f4a36558b_914x584.png 1272w, https://substackcdn.com/image/fetch/$s_!4mjh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe18598b0-6802-4165-a4d2-9a3f4a36558b_914x584.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Image by Author</figcaption></figure></div><p>Similarly, segmenting the data into groups like &#8220;underage (&lt;18)&#8221; and &#8220;age 18&#8211;60&#8221; is not MECE; while there is no overlap, it doesn&#8217;t cover the whole universe of possibilities. What about our friendly senior citizens above 60?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vmb-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44bf59e2-535d-482b-ba2f-18e4841fd724_1400x1220.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vmb-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44bf59e2-535d-482b-ba2f-18e4841fd724_1400x1220.jpeg 424w, https://substackcdn.com/image/fetch/$s_!vmb-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44bf59e2-535d-482b-ba2f-18e4841fd724_1400x1220.jpeg 848w, https://substackcdn.com/image/fetch/$s_!vmb-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44bf59e2-535d-482b-ba2f-18e4841fd724_1400x1220.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!vmb-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44bf59e2-535d-482b-ba2f-18e4841fd724_1400x1220.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vmb-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44bf59e2-535d-482b-ba2f-18e4841fd724_1400x1220.jpeg" width="1400" height="1220" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/44bf59e2-535d-482b-ba2f-18e4841fd724_1400x1220.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1220,&quot;width&quot;:1400,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!vmb-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44bf59e2-535d-482b-ba2f-18e4841fd724_1400x1220.jpeg 424w, https://substackcdn.com/image/fetch/$s_!vmb-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44bf59e2-535d-482b-ba2f-18e4841fd724_1400x1220.jpeg 848w, https://substackcdn.com/image/fetch/$s_!vmb-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44bf59e2-535d-482b-ba2f-18e4841fd724_1400x1220.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!vmb-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44bf59e2-535d-482b-ba2f-18e4841fd724_1400x1220.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Image by Author</figcaption></figure></div><p>Not all problem can be neatly broken down in the MECE way; but if you apply concerted effort to solve problems with this framework, your approach will be a lot more comprehensive.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dKk2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1f9b221-f1ba-436c-b39f-97f90bc01118_1400x1587.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dKk2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1f9b221-f1ba-436c-b39f-97f90bc01118_1400x1587.jpeg 424w, https://substackcdn.com/image/fetch/$s_!dKk2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1f9b221-f1ba-436c-b39f-97f90bc01118_1400x1587.jpeg 848w, https://substackcdn.com/image/fetch/$s_!dKk2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1f9b221-f1ba-436c-b39f-97f90bc01118_1400x1587.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!dKk2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1f9b221-f1ba-436c-b39f-97f90bc01118_1400x1587.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dKk2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1f9b221-f1ba-436c-b39f-97f90bc01118_1400x1587.jpeg" width="1400" height="1587" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d1f9b221-f1ba-436c-b39f-97f90bc01118_1400x1587.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1587,&quot;width&quot;:1400,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!dKk2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1f9b221-f1ba-436c-b39f-97f90bc01118_1400x1587.jpeg 424w, https://substackcdn.com/image/fetch/$s_!dKk2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1f9b221-f1ba-436c-b39f-97f90bc01118_1400x1587.jpeg 848w, https://substackcdn.com/image/fetch/$s_!dKk2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1f9b221-f1ba-436c-b39f-97f90bc01118_1400x1587.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!dKk2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1f9b221-f1ba-436c-b39f-97f90bc01118_1400x1587.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Image by Author</figcaption></figure></div><p>MECE is the foundation of most other frameworks; in other words, any good framework that involves any segmentation or categorization should be mutually exclusive and collectively exhaustive.</p><h2><strong>Issue Trees</strong></h2><p>An issue tree is the best embodiment of the MECE framework. It is commonly used to decompose a complicated problem and show how different factors contribute to the whole.</p><p><strong>For example</strong>, the question &#8220;How can an e-commerce company increase its profit?&#8221; seems like a daunting one without a framework. A lot of people start to brainstorm without any structure:</p><ul><li><p>&#8220;Let&#8217;s do more marketing on social media&#8221;, or</p></li><li><p>&#8220;Let&#8217;s move our factories to lower-cost countries&#8221;</p></li></ul><p>These might be solid ideas, but where did they come from and how can you make sure you explored all the levers at your disposal? The truth is if you brainstorm like this, you will very likely only explore a couple of options and miss many others.</p><p>But if you can break down the problem more systematically, you can make sure you explore all avenues and it can even help you delegate since each sub-tree is a smaller-scale problem that can be solved individually.</p><p>Let me show you how:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!osuF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfe33489-80ae-4940-85e2-4c0b8c07c914_1400x791.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!osuF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfe33489-80ae-4940-85e2-4c0b8c07c914_1400x791.jpeg 424w, https://substackcdn.com/image/fetch/$s_!osuF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfe33489-80ae-4940-85e2-4c0b8c07c914_1400x791.jpeg 848w, https://substackcdn.com/image/fetch/$s_!osuF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfe33489-80ae-4940-85e2-4c0b8c07c914_1400x791.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!osuF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfe33489-80ae-4940-85e2-4c0b8c07c914_1400x791.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!osuF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfe33489-80ae-4940-85e2-4c0b8c07c914_1400x791.jpeg" width="1400" height="791" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cfe33489-80ae-4940-85e2-4c0b8c07c914_1400x791.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:791,&quot;width&quot;:1400,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!osuF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfe33489-80ae-4940-85e2-4c0b8c07c914_1400x791.jpeg 424w, https://substackcdn.com/image/fetch/$s_!osuF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfe33489-80ae-4940-85e2-4c0b8c07c914_1400x791.jpeg 848w, https://substackcdn.com/image/fetch/$s_!osuF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfe33489-80ae-4940-85e2-4c0b8c07c914_1400x791.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!osuF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfe33489-80ae-4940-85e2-4c0b8c07c914_1400x791.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Image by Author</figcaption></figure></div><p>The beauty of issue tree is it breaks down a huge problem into smaller ones so it&#8217;s easier to digest &#8212; thinking about ways to &#8220;increase product variety&#8221; is a lot more concrete and digestible than thinking about ways to &#8220;increase profit&#8221; directly.</p><h2><strong>Hypothesis Tree</strong></h2><p>A typical DS interview question is &#8220;XX metric is down, how would you go about investigating what&#8217;s causing it?&#8221; A lot of candidates again start grabbing hypothesis out of thin air. What interviewers (or your stakeholders and managers when it comes to your day-to-day work) want to see is that you can generate and test hypotheses in a structured way.</p><p>Hypothesis trees are a variant of issues trees and another great way of utilizing the MECE framework as data scientists is building hypothesis trees for those types of questions. Let&#8217;s say we work at or interview with a food delivery marketplace and want to investigate &#8220;Why did the # of deliveries in NY go down by 10% in the last week?&#8221;</p><p>We can start generating hypotheses in a MECE way like below:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JAca!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddd87df5-be65-427e-b135-6cae951e1d56_1400x1049.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JAca!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddd87df5-be65-427e-b135-6cae951e1d56_1400x1049.jpeg 424w, https://substackcdn.com/image/fetch/$s_!JAca!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddd87df5-be65-427e-b135-6cae951e1d56_1400x1049.jpeg 848w, https://substackcdn.com/image/fetch/$s_!JAca!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddd87df5-be65-427e-b135-6cae951e1d56_1400x1049.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!JAca!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddd87df5-be65-427e-b135-6cae951e1d56_1400x1049.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JAca!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddd87df5-be65-427e-b135-6cae951e1d56_1400x1049.jpeg" width="1400" height="1049" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ddd87df5-be65-427e-b135-6cae951e1d56_1400x1049.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1049,&quot;width&quot;:1400,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JAca!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddd87df5-be65-427e-b135-6cae951e1d56_1400x1049.jpeg 424w, https://substackcdn.com/image/fetch/$s_!JAca!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddd87df5-be65-427e-b135-6cae951e1d56_1400x1049.jpeg 848w, https://substackcdn.com/image/fetch/$s_!JAca!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddd87df5-be65-427e-b135-6cae951e1d56_1400x1049.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!JAca!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddd87df5-be65-427e-b135-6cae951e1d56_1400x1049.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Image by Author</figcaption></figure></div><p>Once you have this framework, you can quickly verify or reject the hypotheses one by one and not have that nagging feeling &#8220;have I missed anything&#8221;.</p><p>Of course there are shortcuts you can take once you have accumulated some domain knowledge on the job &#8212; for example, if you know that every year during Chinese New Year delivery numbers go down (maybe because the majority of the customers are Chinese and they prefer to have home-cooked meals during Chinese New Year), then you can quickly check that hypothesis first without having to go through an entire framework.</p><p>But if you deal with a new problem, hypothesis trees can save the day.</p><h2><strong>2x2 Matrix</strong></h2><p>The two-by-two matrix might be the most well-known consulting framework of all.</p><p>It&#8217;s sometimes frowned upon because it&#8217;s extremely simple; but in my experience, this simplicity is often helpful because it <strong>forces you to cut through all the (sometimes unnecessary) complexity</strong> and distill the issue down to its core.</p><p>A two-by-two matrix helps you categorize something into four categories across two dimensions. <strong>Let&#8217;s look at a simple example;</strong> let&#8217;s say you are evaluating analytics projects for the next quarter.</p><p>Instead of assessing projects across many different factors, which risks dragging out the planning process, you could just focus on two key factors:</p><ol><li><p>Does the proposed project support any of the key company priorities?</p></li><li><p>What&#8217;s the expected business impact?</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DMSd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ca14b03-1044-4e54-83e3-b1bf5585492f_1400x1049.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DMSd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ca14b03-1044-4e54-83e3-b1bf5585492f_1400x1049.jpeg 424w, https://substackcdn.com/image/fetch/$s_!DMSd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ca14b03-1044-4e54-83e3-b1bf5585492f_1400x1049.jpeg 848w, https://substackcdn.com/image/fetch/$s_!DMSd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ca14b03-1044-4e54-83e3-b1bf5585492f_1400x1049.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!DMSd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ca14b03-1044-4e54-83e3-b1bf5585492f_1400x1049.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DMSd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ca14b03-1044-4e54-83e3-b1bf5585492f_1400x1049.jpeg" width="1400" height="1049" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7ca14b03-1044-4e54-83e3-b1bf5585492f_1400x1049.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1049,&quot;width&quot;:1400,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!DMSd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ca14b03-1044-4e54-83e3-b1bf5585492f_1400x1049.jpeg 424w, https://substackcdn.com/image/fetch/$s_!DMSd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ca14b03-1044-4e54-83e3-b1bf5585492f_1400x1049.jpeg 848w, https://substackcdn.com/image/fetch/$s_!DMSd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ca14b03-1044-4e54-83e3-b1bf5585492f_1400x1049.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!DMSd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ca14b03-1044-4e54-83e3-b1bf5585492f_1400x1049.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Image by Author</figcaption></figure></div><p><strong>Each combination requires a different approach.</strong> Laying the problem out in this simple grid can help facilitate discussions and move things forward, which is often more valuable than 100% accuracy.</p><h2><strong>Minto Pyramid</strong></h2><p>This one is mentioned in my articles countless times already so you know how important I think it is. It&#8217;s THE framework anyone should adopt when it comes <a href="https://www.divingintodata.com/p/how-to-better-communicate-as-a-data-scientist-6fc5428d3143">communication</a>.</p><p>Adopting it takes practice because it requires you to communicate in almost the opposite order compared to how we actually do the work. Most analytics work is open-ended and explorative in nature; so it&#8217;s not until the later stages of a project that we have some findings and eventually a conclusion.</p><p>But when it comes to communication, it&#8217;s crucial that we focus on the conclusion FIRST, then the supporting arguments, and lastly the supporting evidence. This requires you to structure your thoughts in advance rather than just walking through a recap of your analysis.</p><p>I have drawn the <a href="https://www.divingintodata.com/p/the-most-undervalued-skill-for-data">pyramid itself</a> in previous articles so I will not repeat it here, but it might be helpful to demonstrate the framework using a more concrete example:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!R3_5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fead7ccf4-045c-4ebd-9211-46491628015c_1400x1126.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!R3_5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fead7ccf4-045c-4ebd-9211-46491628015c_1400x1126.jpeg 424w, https://substackcdn.com/image/fetch/$s_!R3_5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fead7ccf4-045c-4ebd-9211-46491628015c_1400x1126.jpeg 848w, https://substackcdn.com/image/fetch/$s_!R3_5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fead7ccf4-045c-4ebd-9211-46491628015c_1400x1126.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!R3_5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fead7ccf4-045c-4ebd-9211-46491628015c_1400x1126.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!R3_5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fead7ccf4-045c-4ebd-9211-46491628015c_1400x1126.jpeg" width="1400" height="1126" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ead7ccf4-045c-4ebd-9211-46491628015c_1400x1126.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1126,&quot;width&quot;:1400,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!R3_5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fead7ccf4-045c-4ebd-9211-46491628015c_1400x1126.jpeg 424w, https://substackcdn.com/image/fetch/$s_!R3_5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fead7ccf4-045c-4ebd-9211-46491628015c_1400x1126.jpeg 848w, https://substackcdn.com/image/fetch/$s_!R3_5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fead7ccf4-045c-4ebd-9211-46491628015c_1400x1126.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!R3_5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fead7ccf4-045c-4ebd-9211-46491628015c_1400x1126.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Image and Example by Author</figcaption></figure></div><p>Contrary to how a lot of data scientists&#8217; default storytelling mode &#8220;we did A, then we discovered B &#8230; and here&#8217;s the conclusion&#8221;, the pyramid framework captures the audience&#8217;s attention a lot better; and if they don&#8217;t have time to read the whole email, they won&#8217;t miss the conclusion.</p><h1><strong>Conclusion: To have a framework at all</strong></h1><p>The best framework in my opinion is the one that works for you. Having a framework at all is always better than jumping in with no structure.</p><p>The frameworks above are not supposed to be the &#8220;end all be all&#8221; as there are so many more frameworks out there developed to solve different problems; they are just supposed to get you started in building this muscle.</p><p>The most important thing is to develop a way of thinking &#8212; the structured way of thinking. Without a framework, you are essentially letting your intuition guide you and hoping it will lead to the right solution by chance. But with a structured framework, on the other hand, problem solving becomes a repeatable and scalable skill.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.divingintodata.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribe for free to receive hands-on data science career guide in your inbox!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[The Most Undervalued Skill for Data Scientists]]></title><description><![CDATA[Why writing is crucial for technical roles, and how to be good at it]]></description><link>https://www.divingintodata.com/p/the-most-undervalued-skill-for-data</link><guid isPermaLink="false">https://www.divingintodata.com/p/the-most-undervalued-skill-for-data</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Wed, 03 Jul 2024 04:22:06 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!9CM3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7004319-13f1-4938-a156-b1e646ae5c26_1400x933.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9CM3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7004319-13f1-4938-a156-b1e646ae5c26_1400x933.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9CM3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7004319-13f1-4938-a156-b1e646ae5c26_1400x933.png 424w, https://substackcdn.com/image/fetch/$s_!9CM3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7004319-13f1-4938-a156-b1e646ae5c26_1400x933.png 848w, https://substackcdn.com/image/fetch/$s_!9CM3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7004319-13f1-4938-a156-b1e646ae5c26_1400x933.png 1272w, https://substackcdn.com/image/fetch/$s_!9CM3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7004319-13f1-4938-a156-b1e646ae5c26_1400x933.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9CM3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7004319-13f1-4938-a156-b1e646ae5c26_1400x933.png" width="1400" height="933" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d7004319-13f1-4938-a156-b1e646ae5c26_1400x933.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:933,&quot;width&quot;:1400,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!9CM3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7004319-13f1-4938-a156-b1e646ae5c26_1400x933.png 424w, https://substackcdn.com/image/fetch/$s_!9CM3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7004319-13f1-4938-a156-b1e646ae5c26_1400x933.png 848w, https://substackcdn.com/image/fetch/$s_!9CM3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7004319-13f1-4938-a156-b1e646ae5c26_1400x933.png 1272w, https://substackcdn.com/image/fetch/$s_!9CM3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7004319-13f1-4938-a156-b1e646ae5c26_1400x933.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>&#8220;Why is my manager nit picking my write-up? What difference does it make changing the wording from X to Y?&#8221;</p><p>You have probably caught yourself thinking this when you see your managers&#8217; numerous suggestions all over your document; I know I have. In fact, I used to think that writing is the most trivial part of the job of a data scientist; because the analyses and numbers should speak for themselves right? <strong>Wrong!</strong></p><p>Over the last years, I have realized that writing is an essential skill for data scientists, and that the ability to write well is one of the key things that sets high-impact data scientists apart from their peers.</p><p><strong>In this article, I will first convince you that writing is at least as important as your technical skills, and then give you concrete tips to help you improve your writing.</strong></p><h1><strong>Why is writing so important for data scientists?</strong></h1><p><strong>1. It&#8217;s used everywhere in the corporate world</strong>&#8202;&#8212;&#8202;I have highlighted the importance of communication in my previous <a href="https://www.divingintodata.com/p/how-to-better-communicate-as-a-data-scientist-6fc5428d3143">articles</a> and like it or not, majority of communication in the corporate world happens in a written form. From project-scoping documents to weekly updates, analysis and experiment write-ups, feedback and performance reviews, JIRA tickets and wiki pages, everything counts on effective written communication to get the message across.&nbsp;</p><p><strong>2. Writing helps to bring clarity to your thinking process&#8202;&#8212;&#8202;</strong>Paul Graham, cofounder of the famous startup accelerator Y Combinator (who&#8217;s a computer scientist AND writer) famously said in <a href="https://www.paulgraham.com/words.html">one of his memos</a>:</p><blockquote><p>If writing down your ideas always makes them more precise and more complete, then no one who hasn&#8217;t written about a topic has fully formed ideas about it. And someone who never writes has no fully formed ideas about anything nontrivial.&#8202;</p><p>&#8212;&#8202;Paul Graham</p></blockquote><p>Very often, when you start writing things down, you realize how little you know about a subject and the potential gaps in your thinking/analysis.&nbsp;</p><p><strong>3. Writing is the &#8220;last mile&#8221; of your data science work. </strong>None of your stakeholders will read your SQL query or look at your Jupyter Notebook (a lot of engineers and data scientists would like believe the opposite but trust me, they likely won&#8217;t). If you want your work to be understood by others and influence decisions, then you need to do the final step of packaging it in an effective write-up. If you skip this step, it&#8217;s like leaving the package in the warehouse instead of delivering it to the customer.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JKPS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8cab502-8a39-4615-8052-c5032f7b1474_1344x742.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JKPS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8cab502-8a39-4615-8052-c5032f7b1474_1344x742.png 424w, https://substackcdn.com/image/fetch/$s_!JKPS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8cab502-8a39-4615-8052-c5032f7b1474_1344x742.png 848w, https://substackcdn.com/image/fetch/$s_!JKPS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8cab502-8a39-4615-8052-c5032f7b1474_1344x742.png 1272w, https://substackcdn.com/image/fetch/$s_!JKPS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8cab502-8a39-4615-8052-c5032f7b1474_1344x742.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JKPS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8cab502-8a39-4615-8052-c5032f7b1474_1344x742.png" width="1344" height="742" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c8cab502-8a39-4615-8052-c5032f7b1474_1344x742.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:742,&quot;width&quot;:1344,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1825778,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!JKPS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8cab502-8a39-4615-8052-c5032f7b1474_1344x742.png 424w, https://substackcdn.com/image/fetch/$s_!JKPS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8cab502-8a39-4615-8052-c5032f7b1474_1344x742.png 848w, https://substackcdn.com/image/fetch/$s_!JKPS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8cab502-8a39-4615-8052-c5032f7b1474_1344x742.png 1272w, https://substackcdn.com/image/fetch/$s_!JKPS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8cab502-8a39-4615-8052-c5032f7b1474_1344x742.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>What does &#8220;good&#8221; writing look like in data&nbsp;science?</h1><p><strong>Be clear about your audience. </strong>If you are writing for everyone, you are writing for no one. Be very specific about who this particular piece of writing is for, and tailor it to that audience and their needs.</p><p><strong>Focus on the &#8220;so what&#8221;; the sausage-making goes in the appendix. </strong>As data scientists, we love to talk about the complex analysis we did or how we designed the experiment. Because we put in all that work it feels so wasteful to NOT talk about it. But the harsh truth is, most of the time, our audience does not care; they just want to understand the takeaways.</p><p>You can describe the technical details of your work in the appendix in case someone wants to go deep, but the main part should focus on the insights and recommendations.&nbsp;</p><p><strong>Have a clear storyline. </strong>Fiction or not, every piece of (long-form) writing should be a story. Because that&#8217;s how humans communicate and that how our brains process information. Usually the storyline for analysis goes like this:</p><p>&#11157; We found out about something interesting and this is why you should care about it / what you should do (<strong>summary to get your readers hooked, including a recommendation if applicable</strong>)</p><p>&#11157; Here&#8217;s how we arrived at these insights (<strong>analysis details for the curious explorers</strong>)</p><p>&#11157; Here are the caveats and alternative paths forward (<strong>optionality in case someone challenges the recommendation</strong>)</p><p>&#11157; Here are additional resources you might find interesting (<strong>appendix for those that really want to go deep on the topic</strong>)</p><p>It might help to build the skeleton first before adding the details. If the story depends on how the analysis goes (which is often the case for DS analysis since the nature is more exploratory), at least figure out the structure of the doc before diving into the details.</p><p>If you are building a deck/presentation, I have a little&nbsp;trick that I learned from my consulting days &#8212; orchestrate your slide titles as the main storyline. Imagine the reader flipping through the deck ONLY reading each slide&#8217;s title, they should get a pretty good idea about the key takeaways. </p><p><strong>Have a clear summary. </strong>If you remember the <a href="https://www.divingintodata.com/p/how-to-better-communicate-as-a-data-scientist-6fc5428d3143">pyramid principle</a> I mentioned in my previous post about communication, it&#8217;s especially important to written communication. Because the summary is your first touch point with your readers, it should be interesting enough to capture the their attention so they want to read on; at the same time, it should capture all the essence so if they decide to stop reading after the summary, they got all the most crucial information they need to know.&nbsp;</p><p><strong>Be succinct.</strong> When it comes to writing, less is more.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6JH_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2779b4c3-8dda-4d39-a0ca-f0ec6a44f7f2_245x245.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6JH_!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2779b4c3-8dda-4d39-a0ca-f0ec6a44f7f2_245x245.gif 424w, https://substackcdn.com/image/fetch/$s_!6JH_!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2779b4c3-8dda-4d39-a0ca-f0ec6a44f7f2_245x245.gif 848w, https://substackcdn.com/image/fetch/$s_!6JH_!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2779b4c3-8dda-4d39-a0ca-f0ec6a44f7f2_245x245.gif 1272w, https://substackcdn.com/image/fetch/$s_!6JH_!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2779b4c3-8dda-4d39-a0ca-f0ec6a44f7f2_245x245.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6JH_!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2779b4c3-8dda-4d39-a0ca-f0ec6a44f7f2_245x245.gif" width="320" height="320" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2779b4c3-8dda-4d39-a0ca-f0ec6a44f7f2_245x245.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:245,&quot;width&quot;:245,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;scrantonpaper: Andy: Here, we have a word code, the same... on Make a GIF&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="scrantonpaper: Andy: Here, we have a word code, the same... on Make a GIF" title="scrantonpaper: Andy: Here, we have a word code, the same... on Make a GIF" srcset="https://substackcdn.com/image/fetch/$s_!6JH_!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2779b4c3-8dda-4d39-a0ca-f0ec6a44f7f2_245x245.gif 424w, https://substackcdn.com/image/fetch/$s_!6JH_!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2779b4c3-8dda-4d39-a0ca-f0ec6a44f7f2_245x245.gif 848w, https://substackcdn.com/image/fetch/$s_!6JH_!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2779b4c3-8dda-4d39-a0ca-f0ec6a44f7f2_245x245.gif 1272w, https://substackcdn.com/image/fetch/$s_!6JH_!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2779b4c3-8dda-4d39-a0ca-f0ec6a44f7f2_245x245.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Keep it simple. </strong>We work in a technical field and use technical jargon all the time. Often, data scientists think it makes them seem more competent if they use technical language. If you look closely, though, you will notice that the more senior people become, the simpler their choice of words. VPs and C-Level executives can explain complex topics in language that anyone can understand, regardless of their (technical) background. You can use tools like the <a href="https://hemingwayapp.com/">Hemingway app</a> to check if your writing is too complex.</p><p><strong>Use signposting. </strong>Signposting is a technique that makes it easier for the reader to understand your document. The core idea is to use words and phrases that make it immediately clear what the sentence or section is about, so that readers can quickly skim the text and make sense of it. For example:</p><ul><li><p>Using the phrase &#8220;for example&#8221; before you give an example</p></li><li><p>Writing &#8220;in conclusion&#8221; before you summarize</p></li><li><p>Labeling sequences of arguments with &#8220;Firstly / secondly / finally&#8221;</p></li></ul><p><strong>Add visualizations. </strong>It&#8217;s a cliche for a reason: &#8220;A picture says more than a thousand words.&#8221; When you are trying to communicate dense technical content, a crisp diagram, framework or flowchart can help a lot to get your point across. For example, illustrating what &#8220;pyramid principle&#8221; means like the graph below will hopefully give you a better idea about how to carry it out in your own writing.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FAXJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d18878c-61b3-4d9a-b00d-4c721f4d688c_1600x1120.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FAXJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d18878c-61b3-4d9a-b00d-4c721f4d688c_1600x1120.png 424w, https://substackcdn.com/image/fetch/$s_!FAXJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d18878c-61b3-4d9a-b00d-4c721f4d688c_1600x1120.png 848w, https://substackcdn.com/image/fetch/$s_!FAXJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d18878c-61b3-4d9a-b00d-4c721f4d688c_1600x1120.png 1272w, https://substackcdn.com/image/fetch/$s_!FAXJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d18878c-61b3-4d9a-b00d-4c721f4d688c_1600x1120.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FAXJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d18878c-61b3-4d9a-b00d-4c721f4d688c_1600x1120.png" width="1456" height="1019" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3d18878c-61b3-4d9a-b00d-4c721f4d688c_1600x1120.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1019,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FAXJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d18878c-61b3-4d9a-b00d-4c721f4d688c_1600x1120.png 424w, https://substackcdn.com/image/fetch/$s_!FAXJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d18878c-61b3-4d9a-b00d-4c721f4d688c_1600x1120.png 848w, https://substackcdn.com/image/fetch/$s_!FAXJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d18878c-61b3-4d9a-b00d-4c721f4d688c_1600x1120.png 1272w, https://substackcdn.com/image/fetch/$s_!FAXJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d18878c-61b3-4d9a-b00d-4c721f4d688c_1600x1120.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"></figcaption></figure></div><h1>How can you improve your&nbsp;writing?</h1><p><strong>Read a lot. </strong>This includes both guides on how to write well (by reading this post, you&#8217;ve made the first step!) as well as strong technical writing that you can imitate (you can find some examples here).</p><p>If you want to dig deeper into the science of writing well, I recommend you take a look at <a href="https://www.amazon.com/Writing-Well-Classic-Guide-Nonfiction/dp/0060891548">&#8220;On Writing Well&#8221;</a> by William Zinsser.</p><p><strong>Practice, practice, practice. </strong>As with everything else, practice makes perfect. Here are a few concrete things you can do to practice your writing:</p><ol><li><p><strong>Document your work in a personal wiki.</strong> Few data scientists do this in my experience, but it&#8217;s a very useful resource to have and a great way to get more writing practice.</p></li><li><p><strong>Write structured Slack messages.</strong> Most of the Slack messages we send and receive all day feel like a stream of consciousness (or worse, like teenagers&#8217; text messages). People tend to type what comes to their mind and hit &#8220;Send&#8221; without taking the time to structure the message in a way that makes it easy for the reader to understand it. Writing succinct, structured Slack messages using the principles discussed above is a great way to stand out.</p></li><li><p><strong>Write online.</strong> Writing these posts on Medium is ongoing writing practice for me. Try it out; you might even enjoy it and find an audience that enjoy your insights.&nbsp;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nMpA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F612235a0-c961-4b69-afd9-d6518e9fb4e1_480x270.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nMpA!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F612235a0-c961-4b69-afd9-d6518e9fb4e1_480x270.gif 424w, https://substackcdn.com/image/fetch/$s_!nMpA!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F612235a0-c961-4b69-afd9-d6518e9fb4e1_480x270.gif 848w, https://substackcdn.com/image/fetch/$s_!nMpA!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F612235a0-c961-4b69-afd9-d6518e9fb4e1_480x270.gif 1272w, https://substackcdn.com/image/fetch/$s_!nMpA!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F612235a0-c961-4b69-afd9-d6518e9fb4e1_480x270.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nMpA!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F612235a0-c961-4b69-afd9-d6518e9fb4e1_480x270.gif" width="480" height="270" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/612235a0-c961-4b69-afd9-d6518e9fb4e1_480x270.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:270,&quot;width&quot;:480,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Anthony Carrigan Barry GIF by HBO&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Anthony Carrigan Barry GIF by HBO" title="Anthony Carrigan Barry GIF by HBO" srcset="https://substackcdn.com/image/fetch/$s_!nMpA!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F612235a0-c961-4b69-afd9-d6518e9fb4e1_480x270.gif 424w, https://substackcdn.com/image/fetch/$s_!nMpA!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F612235a0-c961-4b69-afd9-d6518e9fb4e1_480x270.gif 848w, https://substackcdn.com/image/fetch/$s_!nMpA!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F612235a0-c961-4b69-afd9-d6518e9fb4e1_480x270.gif 1272w, https://substackcdn.com/image/fetch/$s_!nMpA!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F612235a0-c961-4b69-afd9-d6518e9fb4e1_480x270.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">How it can feel when you start writing; ignore the self-doubt (and haters) and keep going!</figcaption></figure></div></li></ol><p><strong>Challenge yourself. </strong>&#8220;You are your own worst enemy&#8221; might not be a bad thing when it comes to writing. You need to be able to read your own writing like it&#8217;s your first time seeing it so you can be objective about what&#8217;s missing, what&#8217;s confusing and what needs to be shortened.</p><p><strong>Ask others to be your devil&#8217;s advocate. </strong>Being your own devil&#8217;s advocate can be extremely hard sometimes, because true objectivity requires you to abandon your current knowledge about the topic and your ego. It&#8217;s sometimes just easier to find another challenger for your work. Ideally this is someone who truly knows nothing about the subject matter and is willing to be very honest with you about their opinion.&nbsp;</p><h1>What are some good examples of strong technical writing?</h1><p>I described above what good writing looks like in theory but it&#8217;s easier to understand once you see a few examples of it. Here I&#8217;m providing some concrete examples for some of the points I mentioned above so you can have a better idea about how to put those suggestions into practice.&nbsp;</p><p><strong>Clear audience</strong></p><p>The <a href="https://www.newsletter.datadrivenvc.io/">Data-Driven VC</a> newsletter is targeted specifically towards Venture Capitalists and startup founders who want to take a data-driven approach to investing in and growing companies. While this creates a niche blog that might not appeal to everyone, picking this specific target audience makes it easier to provide value for them.</p><p><strong>Strong visualizations</strong></p><p>For a crash-course on how to visualize complex systems and technical subject matter in general, check out <a href="https://blog.bytebytego.com/">ByteByteGo</a>. Their diagrams make it super easy to understand things that would take multiple paragraphs of jargon to describe accurately.</p><p>SeattleDataGuy is also using plenty of visualizations, but typically in a slightly less serious way (e.g. see his post on Apache Iceberg <a href="https://seattledataguy.substack.com/p/apache-iceberg-what-is-it">here</a>).</p><p><strong>Keeping it simple</strong></p><p>Gergely Orosz, who writes The Pragmatic Engineer, does a good job summarizing complex topics in relatively simple terms. E.g. check out his <a href="https://newsletter.pragmaticengineer.com/p/ai-coding-agents">post on how AI Software Engineering Agents work</a>.</p><p><strong>Combining best practices: Simple, succinct language with clear visualizations</strong></p><p><a href="https://blog.dailydoseofds.com/p/confidence-interval-and-prediction-d53">Daily Dose of Data Science</a> is a prime example for how to combine multiple best practices to produce easy-to-understand but still insightful data science content.</p><p>For example, check out their recent post on <a href="https://blog.dailydoseofds.com/p/confidence-interval-and-prediction-d53">Confidence Intervals and Prediction Intervals</a>. Or their super brief, but informative post on <a href="https://blog.dailydoseofds.com/p/5-cross-validation-techniques-explained">Cross-Validation Techniques</a>.</p><h1>In conclusion</h1><p>Being able to write (well) is crucial for your work, even (or, you could argue, especially) for technical folks. Being able to succinctly communicate your thoughts on paper takes practice. Reading a lot, writing a lot and being open to feedback are the keys to getting better at this craft.</p>]]></content:encoded></item><item><title><![CDATA[How to Better Communicate as a Data Scientist]]></title><description><![CDATA[Most valuable lessons from my time at McKinsey]]></description><link>https://www.divingintodata.com/p/how-to-better-communicate-as-a-data-scientist-6fc5428d3143</link><guid isPermaLink="false">https://www.divingintodata.com/p/how-to-better-communicate-as-a-data-scientist-6fc5428d3143</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Wed, 29 May 2024 14:10:56 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/6f0f0785-6af5-4679-b033-ead3a3b468f6_800x800.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>In my <a href="https://www.divingintodata.com/p/one-mindset-shift-that-will-make-you-a-better-data-scientist-a015f8000ad7">previous post</a>, I made the point that &#8220;<em>communicating our work is as important as writing code and building the models and products to get the job done.&#8221;</em></p><p>Unfortunately, in reality I have observed a lot of data scientists (especially junior ones) struggling with the story-telling part of the job. They have all the data in hand, but for some reason the message just doesn&#8217;t seem to get through to the audience and the analysis ends up being another pretty report on the bookshelf collecting dust with no impact. So what exactly went wrong?</p><p>Here are some tips that I have learned over the years that will help you improve your communication as a data scientist.</p><h2><strong>Always use the pyramid principle</strong></h2><p>A lot of data scientists communicate in a linear manner (what is demonstrated on the left in the graph below). This is understandable because that&#8217;s how we experience things, and also how we tell stories in our everyday lives. But it&#8217;s not the most effective way to get important information across because audience can easily get lost or bored in this type of communication.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1ndz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314acd16-45ad-4bed-a075-618c84a194c3_800x386.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1ndz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314acd16-45ad-4bed-a075-618c84a194c3_800x386.png 424w, https://substackcdn.com/image/fetch/$s_!1ndz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314acd16-45ad-4bed-a075-618c84a194c3_800x386.png 848w, https://substackcdn.com/image/fetch/$s_!1ndz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314acd16-45ad-4bed-a075-618c84a194c3_800x386.png 1272w, https://substackcdn.com/image/fetch/$s_!1ndz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314acd16-45ad-4bed-a075-618c84a194c3_800x386.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1ndz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314acd16-45ad-4bed-a075-618c84a194c3_800x386.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/314acd16-45ad-4bed-a075-618c84a194c3_800x386.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1ndz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314acd16-45ad-4bed-a075-618c84a194c3_800x386.png 424w, https://substackcdn.com/image/fetch/$s_!1ndz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314acd16-45ad-4bed-a075-618c84a194c3_800x386.png 848w, https://substackcdn.com/image/fetch/$s_!1ndz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314acd16-45ad-4bed-a075-618c84a194c3_800x386.png 1272w, https://substackcdn.com/image/fetch/$s_!1ndz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314acd16-45ad-4bed-a075-618c84a194c3_800x386.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">Image by&nbsp;Author</figcaption></figure></div><p>The gold standard when it comes to communication in business is the pyramid principle&#8202;&#8212;&#8202;the one demonstrated in the graph on the right. To explain it simply, always start with the most important message&#8202;&#8212;&#8202;the insights; then drill down to the details.</p><p>Similar to headlines of newspaper articles, the insights will grab the audience attention. They are what you ultimately want to get across, so you need to put them front and center. If the audience is interested in the details, they will read (listen) on; if not, they got the most important message.</p><h2>Clean up the formatting and&nbsp;grammar</h2><p>This may seem like a nit pick, but let me tell you why it might be more important than you think.</p><p>Most of us know Albert Mehrabian&#8217;s &#8220;7&#8211;38&#8211;55&#8221; rule in verbal communication, which highlights that only 7% of the communication is about the actual content, the rest is composed of tonality (38%) and body language, facial expression etc. (55%). When it comes to written communication, it might not be as extreme, but you can bet that the visual presentation of your work has already communicated the quality of the work in silence.</p><p>When people open a doc with numbers and paragraphs that are in different font, not with the right indentation, full of underlines for misspelled words etc., they will likely start doubting the amount of diligence that went into the analysis, and by extension the quality of the conclusions as well.</p><p>It&#8217;s also disrespectful to the audience; messy formatting puts an additional mental burden on the reader that has to make sense of it all.</p><h3><strong>So what can you do about it?</strong></h3><p>The general rule of thumb is make the numbers as digestible as possible and get rid of all the useless details.</p><p>Let me use an example to demonstrate what I mean: Let&#8217;s say you want to convey insights about the composition of user base in terms of country.</p><p>Some data scientists would present something like this:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!WfsI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F674c2d2b-41b6-4066-ab89-3a1f300a1a40_444x424.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!WfsI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F674c2d2b-41b6-4066-ab89-3a1f300a1a40_444x424.png 424w, https://substackcdn.com/image/fetch/$s_!WfsI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F674c2d2b-41b6-4066-ab89-3a1f300a1a40_444x424.png 848w, https://substackcdn.com/image/fetch/$s_!WfsI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F674c2d2b-41b6-4066-ab89-3a1f300a1a40_444x424.png 1272w, https://substackcdn.com/image/fetch/$s_!WfsI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F674c2d2b-41b6-4066-ab89-3a1f300a1a40_444x424.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!WfsI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F674c2d2b-41b6-4066-ab89-3a1f300a1a40_444x424.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/674c2d2b-41b6-4066-ab89-3a1f300a1a40_444x424.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!WfsI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F674c2d2b-41b6-4066-ab89-3a1f300a1a40_444x424.png 424w, https://substackcdn.com/image/fetch/$s_!WfsI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F674c2d2b-41b6-4066-ab89-3a1f300a1a40_444x424.png 848w, https://substackcdn.com/image/fetch/$s_!WfsI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F674c2d2b-41b6-4066-ab89-3a1f300a1a40_444x424.png 1272w, https://substackcdn.com/image/fetch/$s_!WfsI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F674c2d2b-41b6-4066-ab89-3a1f300a1a40_444x424.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">image by&nbsp;author</figcaption></figure></div><p>A couple of very low effort things you can do off the bat:</p><ul><li><p>Change the number format to separate groups of thousand (in Excel / Google Sheets, change to &#8220;Number&#8221; format <em><strong>and remove the decimal points</strong></em>)</p></li><li><p>Rank in descending order so that the information is easier to digest</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fOf6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0a5dcd4-8b26-4719-a9ff-d8c5741e0357_446x428.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fOf6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0a5dcd4-8b26-4719-a9ff-d8c5741e0357_446x428.png 424w, https://substackcdn.com/image/fetch/$s_!fOf6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0a5dcd4-8b26-4719-a9ff-d8c5741e0357_446x428.png 848w, https://substackcdn.com/image/fetch/$s_!fOf6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0a5dcd4-8b26-4719-a9ff-d8c5741e0357_446x428.png 1272w, https://substackcdn.com/image/fetch/$s_!fOf6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0a5dcd4-8b26-4719-a9ff-d8c5741e0357_446x428.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fOf6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0a5dcd4-8b26-4719-a9ff-d8c5741e0357_446x428.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d0a5dcd4-8b26-4719-a9ff-d8c5741e0357_446x428.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fOf6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0a5dcd4-8b26-4719-a9ff-d8c5741e0357_446x428.png 424w, https://substackcdn.com/image/fetch/$s_!fOf6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0a5dcd4-8b26-4719-a9ff-d8c5741e0357_446x428.png 848w, https://substackcdn.com/image/fetch/$s_!fOf6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0a5dcd4-8b26-4719-a9ff-d8c5741e0357_446x428.png 1272w, https://substackcdn.com/image/fetch/$s_!fOf6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0a5dcd4-8b26-4719-a9ff-d8c5741e0357_446x428.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">image by&nbsp;author</figcaption></figure></div><p>Detailed numbers are harder to grok. Depending on the audience, they likely don&#8217;t need that level of detail, so you can even simplify more and express the numbers in thousands instead:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rBC_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4b04dbd1-8d87-418b-8364-3b436f8f13e1_410x430.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rBC_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4b04dbd1-8d87-418b-8364-3b436f8f13e1_410x430.png 424w, https://substackcdn.com/image/fetch/$s_!rBC_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4b04dbd1-8d87-418b-8364-3b436f8f13e1_410x430.png 848w, https://substackcdn.com/image/fetch/$s_!rBC_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4b04dbd1-8d87-418b-8364-3b436f8f13e1_410x430.png 1272w, https://substackcdn.com/image/fetch/$s_!rBC_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4b04dbd1-8d87-418b-8364-3b436f8f13e1_410x430.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rBC_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4b04dbd1-8d87-418b-8364-3b436f8f13e1_410x430.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4b04dbd1-8d87-418b-8364-3b436f8f13e1_410x430.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rBC_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4b04dbd1-8d87-418b-8364-3b436f8f13e1_410x430.png 424w, https://substackcdn.com/image/fetch/$s_!rBC_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4b04dbd1-8d87-418b-8364-3b436f8f13e1_410x430.png 848w, https://substackcdn.com/image/fetch/$s_!rBC_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4b04dbd1-8d87-418b-8364-3b436f8f13e1_410x430.png 1272w, https://substackcdn.com/image/fetch/$s_!rBC_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4b04dbd1-8d87-418b-8364-3b436f8f13e1_410x430.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">image by&nbsp;author</figcaption></figure></div><h2>Choose the right chart to get your point&nbsp;across</h2><p>The type of chart you choose can carry or bury your insight; the chart should act as a visual aid to the audience. Being able to judge what type of chart/table can communicate insights more effectively is a key ability data scientists should possess.</p><p>In the example above, likely your ultimate goal is to convey the member base breakdown by country. A pie chart will provide more direct visuals than a table of raw numbers or even a column chart and makes it easier to tell the relative size of each bucket.</p><p>Looking at the chart below, even without any narrative, I can visually deduce insights such as &#8220;more than half of our member base resides in the USA&#8221; or &#8220;Germany, UK and Italy have similar amount of users&#8221;.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!B5Fh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48ace22e-77b2-45a2-b29b-a337680406fb_800x495.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!B5Fh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48ace22e-77b2-45a2-b29b-a337680406fb_800x495.png 424w, https://substackcdn.com/image/fetch/$s_!B5Fh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48ace22e-77b2-45a2-b29b-a337680406fb_800x495.png 848w, https://substackcdn.com/image/fetch/$s_!B5Fh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48ace22e-77b2-45a2-b29b-a337680406fb_800x495.png 1272w, https://substackcdn.com/image/fetch/$s_!B5Fh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48ace22e-77b2-45a2-b29b-a337680406fb_800x495.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!B5Fh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48ace22e-77b2-45a2-b29b-a337680406fb_800x495.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/48ace22e-77b2-45a2-b29b-a337680406fb_800x495.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!B5Fh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48ace22e-77b2-45a2-b29b-a337680406fb_800x495.png 424w, https://substackcdn.com/image/fetch/$s_!B5Fh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48ace22e-77b2-45a2-b29b-a337680406fb_800x495.png 848w, https://substackcdn.com/image/fetch/$s_!B5Fh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48ace22e-77b2-45a2-b29b-a337680406fb_800x495.png 1272w, https://substackcdn.com/image/fetch/$s_!B5Fh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48ace22e-77b2-45a2-b29b-a337680406fb_800x495.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">image by&nbsp;author</figcaption></figure></div><h2>Focus on the important numbers instead of presenting ALL of the&nbsp;numbers</h2><p>Communication is all about prioritization. You don&#8217;t have to include all the numbers just because you have them. You want to help your readers focus instead of distracting them with irrelevant information.</p><p>To continue our example, is there really a point of showing the numbers of the long tail? Most likely not. Note that the auto-generated pie chart above hid some of the countries for us already for that exact reason&#8202;&#8212;&#8202;they are too small to matter, so there&#8217;s no point cram them onto the chart.</p><p>So a better way to present the data is either to group the long tails together (with a footnote describing what is included in the grouping) or getting rid of it completely (if it&#8217;s another type of chart like a distribution and the long tail doesn&#8217;t offer any value).</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!oFWs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F411cb4b1-1e55-468a-b53f-b1893fda6bed_800x537.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!oFWs!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F411cb4b1-1e55-468a-b53f-b1893fda6bed_800x537.png 424w, https://substackcdn.com/image/fetch/$s_!oFWs!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F411cb4b1-1e55-468a-b53f-b1893fda6bed_800x537.png 848w, https://substackcdn.com/image/fetch/$s_!oFWs!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F411cb4b1-1e55-468a-b53f-b1893fda6bed_800x537.png 1272w, https://substackcdn.com/image/fetch/$s_!oFWs!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F411cb4b1-1e55-468a-b53f-b1893fda6bed_800x537.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!oFWs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F411cb4b1-1e55-468a-b53f-b1893fda6bed_800x537.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/411cb4b1-1e55-468a-b53f-b1893fda6bed_800x537.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!oFWs!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F411cb4b1-1e55-468a-b53f-b1893fda6bed_800x537.png 424w, https://substackcdn.com/image/fetch/$s_!oFWs!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F411cb4b1-1e55-468a-b53f-b1893fda6bed_800x537.png 848w, https://substackcdn.com/image/fetch/$s_!oFWs!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F411cb4b1-1e55-468a-b53f-b1893fda6bed_800x537.png 1272w, https://substackcdn.com/image/fetch/$s_!oFWs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F411cb4b1-1e55-468a-b53f-b1893fda6bed_800x537.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">image by&nbsp;author</figcaption></figure></div><p>Keep in mind that the ability to tell stories effectively with data doesn&#8217;t come naturally without practice. But it is one of the key skills that distinguishes a great DS from good ones.</p><p>The best way to practice is by doing. When there&#8217;s an opportunity to present, seize it, do dry runs with a teammate or your manager and get feedback before and after the presentation.</p>]]></content:encoded></item><item><title><![CDATA[One Mindset Shift That Will Make You a Better Data Scientist]]></title><description><![CDATA[Actually, any good employee should adopt this mindset]]></description><link>https://www.divingintodata.com/p/one-mindset-shift-that-will-make-you-a-better-data-scientist-a015f8000ad7</link><guid isPermaLink="false">https://www.divingintodata.com/p/one-mindset-shift-that-will-make-you-a-better-data-scientist-a015f8000ad7</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Mon, 15 Apr 2024 14:34:11 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/7555149f-4564-4658-8ccf-0e752f288f15_800x533.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>After transitioning out of quant finance, my first experience as a data scientist was in consulting. Most of the feedback I received in my early days at McKinsey was not related to my code or technical skills, but rather consisted of advice like &#8220;you need to tie your work to the higher-level priority of the company/organization&#8221;, &#8220;you should add more crisp insights&#8221; or &#8220;you need to be more of a thought partner&#8221;.</p><p>At the time, being one of a few data scientists in a sea of consulting generalists, a lot of this feedback initially felt like fuzzy consulting jargon to me and I would have rather had someone critique my code. Now being a manager myself and looking back, I realize that these seemingly disconnected points are all related to one thing&#8202;&#8212;&#8202;the mindset I was missing back when I was a junior IC focused on polishing my technical skillset.</p><p>I was purely focused on execution instead of acting like an owner of the problem; however, thinking that my job was only to perform a task well was a mistake in hindsight. Over the years since then, I&#8217;ve grown convinced that an ownership mentality is one of the key things that sets high performers apart from their peers.</p><p>If you are thinking to yourself &#8220;this sounds very vague and abstract&#8221;, you are not alone. From my observation, most ICs struggle a lot with the feedback &#8220;you should adopt more ownership mindset&#8221;; which is probably why most managers prefer to give more tactical feedback about the &#8220;symptoms&#8221; mentioned above and not the core issue itself.</p><p>To make it less abstract and more actionable, I will attempt to break down the &#8220;ownership mindset&#8221; (or lack thereof) into the three most common manifestations I have observed among junior DS and talk about mindset shifts that can help you act more like an owner.</p><h2><strong>From jumping into &#8220;How we should do this?&#8221; to starting with &#8220;Why are we doing&nbsp;this?&#8221;</strong></h2><p>In order to come up with the best &#8220;How&#8221;, you need to first understand the &#8220;Why&#8221;. So instead of directly jumping into the solution, you should dwell a little longer in the problem space and truly understand why this work matters and what decisions or business outcomes it is supposed to influence.</p><p>Remember, every analytics problem started as an abstract business problem. But very likely, as a junior IC, when a project is communicated to you from your manager or stakeholder, some thinking has already been done on your behalf (i.e. your manager already did some scoping based on his/her understanding of the business ask). I would encourage you to probe for the original motivation behind the scoped-out analysis task. Understanding this will help you to make a couple of crucial decisions&#8202;&#8212;&#8202;how to prioritize this piece of work against everything else on your plate, who your key partner is, and whether you can propose a more efficient way to solve the original question.</p><p>For example, very often data scientists get what I call &#8220;<a href="https://medium.com/towards-data-science/6-essential-steps-to-building-a-great-data-culture-e529d4dcad7e">data Siri questions</a>&#8221; like &#8220;how many active users do we have?&#8221;. Once you pull that stat, you often get follow up questions (e.g. geographic breakdown of the users etc.). Before you know it, what seemed to be a 5-minute ad hoc data pull turns into 100 seemingly disconnected data questions.</p><p>The most efficient way to deal with these situations is actually to start from the &#8220;why&#8221;. When you understand that the PMs are trying to launch a new feature on IOS and need data to decide which countries to prioritize based on engagement, then you will be able to make suggestions on exactly what data cuts will be most insightful to answer that question; or, if you dig even one level deeper, you might pose the question whether engagement is even the right metric to prioritize countries. As a result of this kind of approach, you will also be viewed more like a partner who should be involved in the original scoping of the project instead of someone who only helps to pull data.</p><h2>From &#8220;What do I NEED to do?&#8221; to &#8220;What CAN I do to&nbsp;help?&#8221;</h2><p>Most junior ICs are passively waiting to be assigned to a project with a clear a scope and outlined steps so they can focus on executing. It&#8217;s not surprising because most of us grow up having someone telling us what to do, usually parents and teachers in school. Even in college, there&#8217;s a syllabus outlining the exact things we need to do to get an A. The good news is that managers usually don&#8217;t expect junior ICs to proactively scout and scope projects independently. But if you want to stand out from your peers and grow to the next level, that&#8217;s something you should be doing.</p><p>Instead of passively waiting for an assignment and only doing what is required of you, go above and beyond by paying attention in cross functional meetings and conversations you are involved in. If you tune your senses to spot gaps and needs proactively, you will notice the lack of data is frequently the blocker of progress and decision making, and that&#8217;s something the data science team is in a unique position to help with.</p><p>Spotting gaps and figuring out a solution to plug the gap is one of the crucial skills you need to possess if you want to be a tech lead or manager one day. Building up this skill requires getting rid of the attitude of &#8220;this is not my problem&#8221; or &#8220;I have no dog in this fight&#8221; and instead acting as a true stakeholder in the company&#8217;s success. This may sometime require you to step out of your comfort zone and extend your current skillset, but that&#8217;s exactly how you grow your skillset proactively.</p><h2>From &#8220;This is the number&#8221; to &#8220;This is the takeaway&#8221;</h2><p>Many ICs struggle to distill succinct insights from analyses they do; raw data dumped in a spreadsheet or document is rarely a satisfactory deliverable. To quote from a leader I respect a lot:</p><blockquote><p>Everything we do is for the benefit of others. If others are not understanding or seeing the benefit of the work we do, our work is incomplete. Hence, communicating our work is as important as writing code and building the models and products to get the job done.</p></blockquote><p>A data scientist&#8217;s job is to &#8220;drive business decisions through analytical insights&#8221; not &#8220;to pull the numbers and leave it to others to distill the insights&#8221;. Not to mention, the &#8220;pulling numbers&#8221; piece is the easiest part for AI to replace.</p><p>Imagine you are the reader/consumer of the analysis instead of the one conducting it. Without any in-the-weeds knowledge about the analysis, can you quickly distill what it&#8217;s trying to do and what the takeaway is? Is it clear why engineers, PMs etc. should care about this work? If not, you have not succeeded at communicating insights through your analysis.</p><h2>Conclusion</h2><p>Remember you are the sole owner of your career and every project that powers it. So taking the initiative to make it more successful is your responsibility and yours alone. Adopting this mindset definitely takes time and practice; but once you master it, it will benefit not only your career, but also your personal life once you start applying the same mindset to everything you do.</p>]]></content:encoded></item><item><title><![CDATA[How to Make Yourself More Layoff-Proof as a Data Scientist]]></title><description><![CDATA[What tech layoffs taught me in 2023]]></description><link>https://www.divingintodata.com/p/how-to-make-yourself-more-layoff-proof-as-a-data-scientist-0f92d75ac9f0</link><guid isPermaLink="false">https://www.divingintodata.com/p/how-to-make-yourself-more-layoff-proof-as-a-data-scientist-0f92d75ac9f0</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Wed, 17 Jan 2024 14:41:37 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/84e3a738-1929-4860-b44c-001cc165e9a3_800x533.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2076!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F414d76a0-aeaa-432c-aff8-5387118c4457_800x533.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2076!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F414d76a0-aeaa-432c-aff8-5387118c4457_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!2076!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F414d76a0-aeaa-432c-aff8-5387118c4457_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!2076!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F414d76a0-aeaa-432c-aff8-5387118c4457_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!2076!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F414d76a0-aeaa-432c-aff8-5387118c4457_800x533.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2076!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F414d76a0-aeaa-432c-aff8-5387118c4457_800x533.jpeg" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/414d76a0-aeaa-432c-aff8-5387118c4457_800x533.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2076!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F414d76a0-aeaa-432c-aff8-5387118c4457_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!2076!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F414d76a0-aeaa-432c-aff8-5387118c4457_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!2076!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F414d76a0-aeaa-432c-aff8-5387118c4457_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!2076!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F414d76a0-aeaa-432c-aff8-5387118c4457_800x533.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">Photo by <a href="https://unsplash.com/@magnetme?utm_source=medium&amp;utm_medium=referral">Magnet.me</a> on&nbsp;<a href="https://unsplash.com?utm_source=medium&amp;utm_medium=referral">Unsplash</a></figcaption></figure></div><h4>What tech layoffs taught me in&nbsp;2023</h4><p>Just when we thought that life is going back to normal after the pandemic outbreak, a wave of tech layoff caught all of us tech workers by surprise. In 2023, more than 240K tech workers were laid off across more than 1,000 companies; the most recent layoff at Google and Discord indicates that it&#8217;s continuing in 2024. Both my current company and my pervious company had more than one round of layoff during the last year, with some of my friends impacted.</p><p>Even though this is the not the first time layoffs happened in the industry, the cuts were deeper than in the past. Past layoffs had often focused on functions such as Sales and Recruiting that were immediately affected by slowdowns in the business and hiring. This time, however, the impact was across the board including technical functions like software engineers and data scientists.</p><p>It&#8217;s human nature wanting to distill patterns and learnings from things, especially as a data scientist (after all, that&#8217;s one of the key aspects of your job). So like a good data scientist, I sat down at the beginning of the year to conduct a &#8220;post mortem&#8221;.</p><p><strong>While it&#8217;s hard to predict beforehand which team/who will be impacted by a layoff (unless you are one of the decision makers), I think as a data scientist, there are things you can do to make yourself more &#8220;layoff-proof&#8221;. </strong>And I will share a few here:</p><h4>Stay up to date about the company and org&#8217;s strategic direction</h4><p><strong>Why? </strong>When it comes to layoffs, the decision about which teams/individuals to let go is usually highly dependent on the company&#8217;s strategy and prioritization at the moment. For example, if a company for several quarters has been trying to make dashboard building and metric reporting more self-serviceable, when a round of layoff comes, any team that focuses on dashboard making and metric reporting will be considered.</p><p>Remember, things don&#8217;t happen overnight; there are signs that you can take as a heads up. Priority shifts usually can&#8217;t be implemented overnight or even in one quarter. In the example above, before letting go of the whole team of DS who are building dashboards/reporting metrics, the company needs to make sure the self-serviceable vision it has in mind is actually happening. So usually it will take a couple of months or quarters to build tools and set up workshops to democratize the work.</p><p>Hence it&#8217;s important to NOT just keep your head down and focus on things in your scope, but to understand what direction the company is heading in terms of its strategy and prioritization to deduce whether your work is, and will remain, important for the company. If the answer is no, you should really think long and hard about why you are still working on it and what you should be working on instead and have a discussion with your manager about it.</p><p><strong>How? </strong>There are countless ways to stay close to strategic decisions and priorities. If you have a good manager, they should be sharing organizational context with you and the team; if they don&#8217;t do this, ask them proactively to share relevant strategic context. In situations where your manager is not capable of doing so (which is more common than you think), there are other options available; for example, All Hands meetings at all levels (company wide or org wide) are one of the best ways to get this information so you should definition try to attend those. Other ways (depending on your company&#8217;s culture) include asking in relevant Slack channels (e.g. leadership Q&amp;A), talking to other managers in the org or your skip-level manager (use those skip-level 1v1s wisely instead of just treating them as a coffee chat!).</p><h4>Make sure your scope is not too narrow and you work with different teams</h4><p><strong>Why? </strong>Continuing with the example above, if you are on the team that ONLY works on dashboard building and metric reporting, when those are democratized, your team&#8217;s value is hard to justify. But if the scope of the team is broader, it can adjust and shift its focus to one of the remaining areas it covers. At the very least, it will give the decision makers pause before completely cutting the team.</p><p><strong>How? </strong>If you are in a position to influence the team&#8217;s scope and roadmap, make sure the team is not ONLY focusing on one thing, especially not on things that are easy to democratize or automate. Note that I&#8217;m not saying you should spread the team or yourself thin to work on EVERYTHING; focus is still important. But unless you only have one or two people on the team, the team should be able to take on a slightly broader scope and get more opportunities to pivot in dire situations. If you are not able to influence the team&#8217;s scope and it is already very narrow, you can branch out and establish relationships with other teams to get opportunities, either officially or unofficially, to work on things that might be slightly outside of your team&#8217;s immediate scope.</p><h4>Consider staying close to the core&nbsp;business</h4><p><strong>Why? </strong>When it comes to a business, there is usually the core business (think about what a company is known for, e.g. Search for Google, Rides and Eats for Uber) and then there are the shiny new things or bets. Take Uber as an example; it started with Uber Rides then expanded into Uber Eats, which is a fairly direct extension of the core business and closely intertwined (e.g. shared supply as many Uber drivers are active on both platforms). Both businesses achieved relevant market share and became core revenue and profit drivers for the company</p><p>Uber also launched a number of new business lines that were much more experimental such as self-driving, freight, Uber Works (a staffing marketplace) and <a href="https://www.uber.com/blog/new-york-city/uber-copter/">Uber Copter</a> (helicopter rides). When those initiatives were announced, I have no doubt they were painted as top priorities and important bets for the company. However, compared to Uber Eats, which quickly became a core pillar, 1) none of these new bets contributed meaningfully to the top or bottom line within the first few years of their existence, and 2) they were much less intertwined with the core business and those easier to spin off or wind down. At the end of the day, businesses have stakeholders to answer to (e.g. Wall Street in the case of public companies), and when times get tough, non-core activities are the first ones to be scaled back; so it&#8217;s not surprising that over time, Uber ultimately shut down or scaled back most of these bets, while Rides and Eats jobs were less affected besides a temporary COVID cut-back.</p><p><strong>How? </strong>This one is a personal choice; of course it&#8217;s exciting to work on new initiatives and the sexy new things, and these can sometimes offer career-defining growth opportunities, but if you want to increase your job security, it can be beneficial to stay close to the cash cow of your company.</p><h4>Make sure you advertise for your own&nbsp;work</h4><p><strong>Why? </strong>In most layoff situations, the decision and information is kept confidential and only a few key individuals are in the loop. So maybe your manager is blindsided, maybe your skip-level manager as well. But most likely <em>SOMEONE</em> in the command chain <em>IS</em> helping with the decision making. Imagine you are that someone and you look at a team in your org and find yourself asking, &#8220;hmm.. what does this person do again exactly?&#8221;; I bet you will put them on the top of the cut list.</p><p><strong>How? </strong>I know this doesn&#8217;t come naturally to most people, especially people in more technical roles. But I would really suggest that you take any opportunity you get to showcase your work to leadership. Every organization provides different opportunities of doing so&#8202;&#8212;&#8202;special topics during All Hands, deep dive on certain topics etc. Seize those opportunities when you can to make people remember your work and how your work is contributing to the company&#8217;s top priorities.</p><p>But remember, while everything I mentioned above aims to reduce the chance of you getting impacted by a layoff, NOTHING can guarantee that you won&#8217;t be impacted. Even if you prepare as much as possible in advance and tried everything to de-risk yourself, life happens and you might find yourself out of a job. So I have one last piece of advice I would like to give:</p><h4>Never turn down an opportunity</h4><p><strong>Why? </strong>You may think you love your job, you love the company and there&#8217;s no chance you will look for a new job. But sometimes it&#8217;s not 100% within your control; when a layoff hits out of the blue, you might look back and wish you had entertained the recruiter outreach a while back.</p><p>Plus, even though you don&#8217;t want to always go through the whole painful recruiting process, it never hurts to at least take the intro calls to understand what opportunities are out there and how much your current skillset is valued in the market.</p><p><strong>How? </strong>Pretty simple, when recruiters reach out on LinkedIn and the opportunity looks interesting, just reply to them, take an intro call and see what they say. It also doesn&#8217;t hurt to keep in touch with folks in your network so you hear about new opportunities as they become available.</p><p>I hope these lessons I learned from the last few years of layoffs can help you in some way. If you are interested in learning more about data science related topics, check out some of my other articles.</p><p><strong><a href="https://towardsdatascience.com/soft-skills-is-what-sets-you-apart-in-your-data-science-interviews-d927872f07e6" title="https://towardsdatascience.com/soft-skills-is-what-sets-you-apart-in-your-data-science-interviews-d927872f07e6">Soft Skills Is What Sets You Apart in Your Data Science Interviews</a></strong><a href="https://towardsdatascience.com/soft-skills-is-what-sets-you-apart-in-your-data-science-interviews-d927872f07e6" title="https://towardsdatascience.com/soft-skills-is-what-sets-you-apart-in-your-data-science-interviews-d927872f07e6"><br></a><em><a href="https://towardsdatascience.com/soft-skills-is-what-sets-you-apart-in-your-data-science-interviews-d927872f07e6" title="https://towardsdatascience.com/soft-skills-is-what-sets-you-apart-in-your-data-science-interviews-d927872f07e6">How to up-level your structured problem solving skills and communication skills</a></em><a href="https://towardsdatascience.com/soft-skills-is-what-sets-you-apart-in-your-data-science-interviews-d927872f07e6" title="https://towardsdatascience.com/soft-skills-is-what-sets-you-apart-in-your-data-science-interviews-d927872f07e6">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1" title="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1">5 Mistakes I Wish I Had Avoided in My Data Science Career</a></strong><a href="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1" title="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1"><br></a><em><a href="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1" title="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1">I learned these lessons the hard way so you don&#8217;t have to</a></em><a href="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1" title="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba" title="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba">5 Lessons McKinsey Taught Me That Will Make You a Better Data Scientist</a></strong><a href="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba" title="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba"><br>towardsdatascience.com</a></p>]]></content:encoded></item><item><title><![CDATA[Soft Skills Is What Sets You Apart in Your Data Science Interviews]]></title><description><![CDATA[How to up-level your structured problem solving skills and communication skills]]></description><link>https://www.divingintodata.com/p/soft-skills-is-what-sets-you-apart-in-your-data-science-interviews-d927872f07e6</link><guid isPermaLink="false">https://www.divingintodata.com/p/soft-skills-is-what-sets-you-apart-in-your-data-science-interviews-d927872f07e6</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Sun, 24 Dec 2023 16:54:35 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/be00a922-ab61-4812-ac10-5d07c5875f95_800x533.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_e0d!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81b0643-3fe0-43f6-b5cd-b65ec530f80a_800x533.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_e0d!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81b0643-3fe0-43f6-b5cd-b65ec530f80a_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!_e0d!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81b0643-3fe0-43f6-b5cd-b65ec530f80a_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!_e0d!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81b0643-3fe0-43f6-b5cd-b65ec530f80a_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!_e0d!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81b0643-3fe0-43f6-b5cd-b65ec530f80a_800x533.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_e0d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81b0643-3fe0-43f6-b5cd-b65ec530f80a_800x533.jpeg" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a81b0643-3fe0-43f6-b5cd-b65ec530f80a_800x533.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_e0d!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81b0643-3fe0-43f6-b5cd-b65ec530f80a_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!_e0d!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81b0643-3fe0-43f6-b5cd-b65ec530f80a_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!_e0d!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81b0643-3fe0-43f6-b5cd-b65ec530f80a_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!_e0d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa81b0643-3fe0-43f6-b5cd-b65ec530f80a_800x533.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">Photo by <a href="https://unsplash.com/@jasongoodman_youxventures?utm_source=medium&amp;utm_medium=referral">Jason Goodman</a> on&nbsp;<a href="https://unsplash.com?utm_source=medium&amp;utm_medium=referral">Unsplash</a></figcaption></figure></div><h4>How to up-level your structured problem solving skills and communication skills</h4><p>So you have brushed up on ML concepts, practiced Python and SQL for months, you think you are done with interview prep. But you might be missing the most important and hardest-to-prepare-for part of the interview&#8202;&#8212;&#8202;problem-solving skills. And based on my experience of interviewing others as well as going through interviews myself, I can confidently tell you, this part is more than often what makes or breaks your interviews.</p><p>This is the fifth article for the data science interview guide, in previous articles, I have mostly touched on technical concepts that commonly show up in interviews. For reference, the previous articles are listed below:</p><ol><li><p><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50">Part I. Distribution</a></p></li><li><p><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-ii-probability-5c8830f13fb5">Part II. Probability</a></p></li><li><p><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57">Part III. Basic Supervised Learning Models</a></p></li><li><p><a href="https://medium.com/towards-data-science/concepts-you-have-to-know-for-data-science-interviews-part-iv-random-forest-5c125e4b5777">Part IV. Random Forest</a></p></li></ol><p>But contrary to most people, I don&#8217;t believe technical portion of the interview is the most difficult part; I think the soft skills are the hardest to grasp, learn or teach when it comes to interview prep. As a manager, that&#8217;s the most important thing I look for when I&#8217;m hiring for my team. Because let&#8217;s face it, technical skills are easy to brush up on and learn (not only for humans, but also for machines&#8202;&#8212;&#8202;ChatGPT can implement code pretty well these days), but the ability to understand, solve business problems and communicate effectively is not something you can build up in a day or two.</p><p>This portion of the interview is also the most unpredictable as there&#8217;s less of a template interviewers follow; instead, a lot of times the interview would go in the direction you take it. Before you get discouraged, I have summarized a couple of ways the soft skills are tested in interviews and tips that can help you prepare for this part of the interview.</p><p>There are in general two modules of the interview that fit into the soft-skill category&#8202;&#8212;&#8202;case study (which is to test your ability to carry out structured problem solving) and behavioral interview (which tests your communication and again, structured thinking ability).</p><h4>Ability to translate business problems into data science problems&#8202;&#8212;&#8202;case&nbsp;study</h4><p>This portion of the interview is usually a case that&#8217;s very similar to the consulting case interview, with a slight data science flavor added on top. It&#8217;s usually a RCA (root cause analysis) type of question for a metric movement or business decision. Examples are questions like &#8220;our daily active user decreased by 10% in the past week, how to debug what the cause is?&#8221; OR &#8220;we want to put electric chargers for our fleet around SF, how should we make the decision about where to put them?&#8221;</p><p><strong>What is it testing? </strong>These questions are testing your ability to come up with a problem-solving framework and your ability to explain the framework. It&#8217;s extremely important to have one in your day-to-day work as a data scientist so you can make sure that your solution is thorough, comprehensive and can easily make sense to others.</p><p><strong>What do interviewers want to see?</strong> Keep in mind there&#8217;s no right or wrong answer for this kind of questions, only organized or unorganized ones. The interviewers really want to see how you organize your thoughts and walk people through the problem-solving process. So make sure you verbalize all the assumptions you are making, all the details you are exploring and ask all the clarifying questions you might need to ask.</p><p>It&#8217;s also important to show your ability to take feedback and work with others. Sometimes your interviewer will give you some guidance or hint about where they want you to take the case</p><p><strong>How to prepare? </strong>The best way to prepare for a case interview is practice, practice and practice. I suggest getting yourself one or two books related to consulting case interviews or PM case interviews, books like <em><strong>cracking the PM interview </strong></em>and <em><strong>case interview secrets </strong></em>come to mind as recommendations. But just reading through the examples the trying to force memorize the frameworks in the books won&#8217;t help as much. The most effective thing to do in my opinion is to take sample cases in the book and mock the case with a friend. Very likely your answer will not be identical as the &#8220;sample solution&#8221;, but ask your friend to check &#8220;is my thinking process easy to follow?&#8221;, &#8220;was it clear how to arrived at each step in the process?&#8221;.</p><h4>Cultural fit&#8202;&#8212;&#8202;communication skills and working&nbsp;style</h4><p><strong>What is it testing? </strong>Interviewers want to see that you have the qualities that the team is looking for aside from hard skills. And if the interviewer is a hiring manager, they would want to see that they can effectively communicate and work with you.</p><p><strong>What do interviewers want to see?</strong> Depending on the team, interviewers/hiring managers may care about/want to test different things. But in most cases, I have seen interview questions aiming to prob about your ability to learn, to adapt to changing scope, to collaborate with others and to manage stakeholders.</p><p><strong>How to prepare? </strong>A couple things can be helpful for this portion of the interview. 1) Make sure you are super familiar with any project you have on your resume, because the interviewer will likely ask about details of different projects on your resume to get a sense of your working style. And 2) make sure you practice the top-down communication style and follow the <a href="https://www.indeed.com/career-advice/interviewing/how-to-use-the-star-interview-response-technique">STAR framework</a> when talking about your projects.</p><p>Finally, always remember, interviews are a two-way assessment; you are gauging the company&#8217;s fit for you as much as they you, and the behavioral portion of the interview is the best way to test whether YOU like THEM. The people who interview you will likely be the group of people you work really closely with; so if you feel like the &#8220;vibe is off&#8221; (which trust me, happens more than you would expect), maybe it&#8217;s a good signal you should take into the consideration when deciding between offers.</p><p><strong>Preparing for a data science interview and want to read more about interview preparation? Here are some articles you might enjoy!</strong></p><p><strong><a href="https://towardsdatascience.com/acing-the-ml-portion-of-mckinsey-data-science-interview-d816862733fc" title="https://towardsdatascience.com/acing-the-ml-portion-of-mckinsey-data-science-interview-d816862733fc">Acing the ML Portion of McKinsey Data Science Interview</a></strong><a href="https://towardsdatascience.com/acing-the-ml-portion-of-mckinsey-data-science-interview-d816862733fc" title="https://towardsdatascience.com/acing-the-ml-portion-of-mckinsey-data-science-interview-d816862733fc"><br></a><em><a href="https://towardsdatascience.com/acing-the-ml-portion-of-mckinsey-data-science-interview-d816862733fc" title="https://towardsdatascience.com/acing-the-ml-portion-of-mckinsey-data-science-interview-d816862733fc">A detailed guide for the what, why, and how of the ML part of consulting interviews</a></em><a href="https://towardsdatascience.com/acing-the-ml-portion-of-mckinsey-data-science-interview-d816862733fc" title="https://towardsdatascience.com/acing-the-ml-portion-of-mckinsey-data-science-interview-d816862733fc">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27">Productivity Tips for Data Scientists</a></strong><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27"><br></a><em><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27">How to work better, smarter but not necessarily harder as a data scientist</a></em><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27">towardsdatascience.com</a></p>]]></content:encoded></item><item><title><![CDATA[Acing the ML Portion of McKinsey Data Science Interview]]></title><description><![CDATA[A detailed guide for the what, why, and how of the ML part of consulting interviews]]></description><link>https://www.divingintodata.com/p/acing-the-ml-portion-of-mckinsey-data-science-interview-d816862733fc</link><guid isPermaLink="false">https://www.divingintodata.com/p/acing-the-ml-portion-of-mckinsey-data-science-interview-d816862733fc</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Sat, 15 Oct 2022 01:19:51 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/d858f9c5-2a77-4968-a047-2494b95bdf43_800x533.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!V2Dn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf995a93-dd9c-40e4-818a-da60ab93309b_800x533.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!V2Dn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf995a93-dd9c-40e4-818a-da60ab93309b_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!V2Dn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf995a93-dd9c-40e4-818a-da60ab93309b_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!V2Dn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf995a93-dd9c-40e4-818a-da60ab93309b_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!V2Dn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf995a93-dd9c-40e4-818a-da60ab93309b_800x533.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!V2Dn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf995a93-dd9c-40e4-818a-da60ab93309b_800x533.jpeg" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cf995a93-dd9c-40e4-818a-da60ab93309b_800x533.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!V2Dn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf995a93-dd9c-40e4-818a-da60ab93309b_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!V2Dn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf995a93-dd9c-40e4-818a-da60ab93309b_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!V2Dn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf995a93-dd9c-40e4-818a-da60ab93309b_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!V2Dn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf995a93-dd9c-40e4-818a-da60ab93309b_800x533.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">Photo by <a href="https://unsplash.com/@austinchan?utm_source=medium&amp;utm_medium=referral">Austin Chan</a> on&nbsp;<a href="https://unsplash.com?utm_source=medium&amp;utm_medium=referral">Unsplash</a></figcaption></figure></div><h4>A detailed guide for the what, why, and how of the ML part of consulting interviews</h4><p>After I wrote about my <strong><a href="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d">decision to join McKinsey</a></strong> as well as the <strong><a href="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba">most valuable lessons I learned there about data science</a></strong>, I have also shared the reasons for my <strong><a href="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5">decision to eventually leave the firm</a></strong>, I have gotten a lot of messages from readers of those articles asking about HOW to prepare for data science interviews for consulting firms. So I thought I would address all of these questions with several articles talking about my consulting interview process and my preparation for it.</p><p>My interview with McKinsey was 3 years ago. As the practice evolve, some of the interview processes might have changed a little bit. But the general idea should remain the same and the preparation process shouldn&#8217;t differ that much.</p><p>So let&#8217;s talk about what consulting firms usually look for in candidates and how to prepare for the interviews. There are different sections of the interview&#8202;&#8212;&#8202;general topics are ML knowledge, case study and cultural fit. I will address them in separate articles to take care of everyone&#8217;s attention span. You might be wondering how come SQL and Python (which are commonly tested by other companies for DS interviews) are no on the list. I still strongly recommend knowing at least the basics about SQL and Python before your interview in case the interview format has changed since mine. But McK generally believes that the super technical skills (like coding) can be learned on the job as long as you have some coding experience. What they rather spend time testing are your &#8220;soft skills&#8221; like ability to learn, structured problem solving and the analytical way of thinking.</p><h4><strong>Why It Is&nbsp;Tested</strong></h4><p>In the tech world, usually Machine Learning Engineers (MLE) are the ones that build models and data scientist work mostly on analyses and insight generation. But as a DS consultant, you are viewed as a &#8220;full stack&#8221; DS, meaning you need to be able to cover things from data pipelining to ML modeling, all the way to insights generation and &#8220;storytelling&#8221; with data.</p><p>So consulting firms want to make sure that you have enough ML knowledge to work on or even lead modeling projects.</p><h4><strong>How It Is&nbsp;Tested</strong></h4><p>Like I mentioned in my interview <a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57">series about ML</a>, there are generally two ways to test ML knowledge I haven seen&#8202;&#8212;&#8202;resume based or theory based.</p><p>Most consulting interviews are resume based from what I have heard (and experienced). That means two things&#8202;&#8212;&#8202;you need to have some modeling experience and you need to know how to talk about it.</p><h4><strong>How to Prepare for&nbsp;It</strong></h4><p>In order to be able to talk about modeling experience, you need to, well, have some modeling experience. You can get modeling experience through your current work by getting on modeling-intensive projects; if that&#8217;s not an option, you can always utilize websites like <a href="https://www.kaggle.com/">Kaggle</a> to get some modeling experience through side projects.</p><p>When it come to learning the <strong>basic theoretical</strong> ML knowledge. The best book to use in my opinion is <em><strong>The Analytics Edge </strong></em>written by MIT operations research professor Dimitris Bertsimas.</p><p>It covers things from basic concepts like linear regression, CART model all the way to more complicated models like the random forest. It doesn&#8217;t cover the super advanced models like neural networks, reinforcement learning etc. But based on my experience, I rarely used those models in my consulting life, so I will be surprised if consulting interviews put a lot of weight on those.</p><p>If I were to re-read this book from scratch for an interview prep, I would start from Chapter VIII, where each model is dissected and explained in details, before moving onto some of the previous chapters which demonstrate case studies of the models.</p><p>If you lack model-building experience outside of academia and Kaggle environment, you might need to also get some knowledge about the <strong>operation side of things</strong> of ML. I personally found Educative.io&#8217;s <a href="https://www.educative.io/module/grokking-ml-interview">Grokking ML Interview</a> course very helpful in that regard. It gets into the details about <strong>operational aspects</strong> of ML like how to choose metrics and online/offline model evaluation.</p><p>If you are relatively familiar with machine learning foundations and just need a <strong>brush up</strong> on it before the interview, I have used Springboard&#8217;s <a href="https://www.springboard.com/blog/data-science/machine-learning-interview-questions/">51 Essential Machine Learning Interview Questions and Answers</a> as a <strong>quick refresher</strong>.</p><h4>Final Advice</h4><p>Like I mentioned in my previous articles, one thing consulting firms REALLY care about is your ability to communicate, especially when it comes to complicated analytical concepts. Data scientist in consulting firms more than often need to work with clients who are NOT from an analytics background; so it&#8217;s essential that you showcase your ability to do just that. When explaining ML concepts and models, try to use as much plain English as possible instead of obscure jargons and make sure you can really explain them without getting into the details of the calculation; because you will encounter clients who ask you to &#8220;explain how clustering works without the math&#8221;.</p><h4>Want to Read More About DS Interviews? Here Are Some Recommendations:</h4><p><strong><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50">Concepts You Have to Know for Data Science Interviews&#8202;&#8212;&#8202;Part I: Distribution</a></strong><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50"><br></a><em><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50">Most frequently asked questions in data scientist interviews</a></em><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-ii-probability-5c8830f13fb5" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-ii-probability-5c8830f13fb5">Concepts You Have to Know for Data Science Interviews&#8202;&#8212;&#8202;Part II. Probability</a></strong><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-ii-probability-5c8830f13fb5" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-ii-probability-5c8830f13fb5"><br></a><em><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-ii-probability-5c8830f13fb5" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-ii-probability-5c8830f13fb5">Most frequently asked questions in data scientist interviews</a></em><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-ii-probability-5c8830f13fb5" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-ii-probability-5c8830f13fb5">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57">Concepts You Have to Know for Data Science Interviews&#8202;&#8212;&#8202;Part III. Basic Supervised Learning Models</a></strong><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57"><br></a><em><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57">Most frequently asked questions in data scientist interviews for modeling</a></em><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iv-random-forest-5c125e4b5777" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iv-random-forest-5c125e4b5777">Concepts You Have to Know for Data Science Interviews&#8202;&#8212;&#8202;Part IV. Random Forest</a></strong><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iv-random-forest-5c125e4b5777" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iv-random-forest-5c125e4b5777"><br></a><em><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iv-random-forest-5c125e4b5777" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iv-random-forest-5c125e4b5777">Most frequently asked questions in data scientist interviews</a></em><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iv-random-forest-5c125e4b5777" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iv-random-forest-5c125e4b5777">towardsdatascience.com</a></p>]]></content:encoded></item><item><title><![CDATA[Concepts You Have to Know for Data Science Interviews — Part IV. Random Forest]]></title><description><![CDATA[Most frequently asked questions in data scientist interviews]]></description><link>https://www.divingintodata.com/p/concepts-you-have-to-know-for-data-science-interviews-part-iv-random-forest-5c125e4b5777</link><guid isPermaLink="false">https://www.divingintodata.com/p/concepts-you-have-to-know-for-data-science-interviews-part-iv-random-forest-5c125e4b5777</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Mon, 18 Jul 2022 15:16:43 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/fa5b2692-4b3a-426c-9049-003dc266452f_800x533.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8qdt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d2e0dfb-6e43-46c8-a9d3-df8de53f9b31_800x533.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8qdt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d2e0dfb-6e43-46c8-a9d3-df8de53f9b31_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!8qdt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d2e0dfb-6e43-46c8-a9d3-df8de53f9b31_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!8qdt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d2e0dfb-6e43-46c8-a9d3-df8de53f9b31_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!8qdt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d2e0dfb-6e43-46c8-a9d3-df8de53f9b31_800x533.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8qdt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d2e0dfb-6e43-46c8-a9d3-df8de53f9b31_800x533.jpeg" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7d2e0dfb-6e43-46c8-a9d3-df8de53f9b31_800x533.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8qdt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d2e0dfb-6e43-46c8-a9d3-df8de53f9b31_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!8qdt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d2e0dfb-6e43-46c8-a9d3-df8de53f9b31_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!8qdt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d2e0dfb-6e43-46c8-a9d3-df8de53f9b31_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!8qdt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d2e0dfb-6e43-46c8-a9d3-df8de53f9b31_800x533.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">Photo by <a href="https://unsplash.com/@iriser?utm_source=medium&amp;utm_medium=referral">Irina Iriser</a> on&nbsp;<a href="https://unsplash.com?utm_source=medium&amp;utm_medium=referral">Unsplash</a></figcaption></figure></div><h4>Most frequently asked questions in data scientist interviews</h4><p>This is the 4th article in the interview series. I&#8217;m hoping this series will function as a <strong>centralized starting point for aspiring data scientists</strong> in terms of interview preparation. So far, we have talked about the following concepts:</p><ol><li><p><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50">Part I: Distribution</a></p></li><li><p><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-ii-probability-5c8830f13fb5">Part II. Probability</a></p></li><li><p><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57">Part III. Basic Supervised Learning Models</a></p></li></ol><p>In this article, I want to continue the journey down the ML lane and talk about <strong>advanced supervised learning models</strong>. More specifically, I will be focusing on <strong>Random Forest</strong> since it is probably the most commonly used and commonly asked in DS interviews among the more complicated/advanced ML models. In fact, I was asked about Random Forest in my interview with McKinsey as a data scientist; knowing how to explain the algorithm on the high level and intuitively will definitely help you stand out from the rest of the interviewees if there&#8217;s a modeling component in your interviews.</p><h4>What is Random&nbsp;Forest</h4><p>We talked about <a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57?source=your_stories_page-------------------------------------">CART model</a> in the previous article. As a reminder, CART stands for Classification and Regression Tree; and Random Forest, well, is a forest. The descriptive naming convention reveals the clear relationship between the two&#8202;&#8212;&#8202;Random Forest consists multiple decision trees.</p><p>Random forest is an ensemble method (ensemble methods is a category of ML methods that combines multiple learning algorithms to obtain better results, achieving that 1+1&gt;2 effect), utilizing multiple trees to avoid overfitting (<strong>decision trees are prone to overfit</strong>). Imagine each tree in the forest casting a vote and having a say in the final decision of the whole model; the final decision/prediction is achieved through the forest taking the majority vote.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9EsH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072c17d1-f879-4a17-a760-1430c6c896d4_800x727.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9EsH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072c17d1-f879-4a17-a760-1430c6c896d4_800x727.png 424w, https://substackcdn.com/image/fetch/$s_!9EsH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072c17d1-f879-4a17-a760-1430c6c896d4_800x727.png 848w, https://substackcdn.com/image/fetch/$s_!9EsH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072c17d1-f879-4a17-a760-1430c6c896d4_800x727.png 1272w, https://substackcdn.com/image/fetch/$s_!9EsH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072c17d1-f879-4a17-a760-1430c6c896d4_800x727.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9EsH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072c17d1-f879-4a17-a760-1430c6c896d4_800x727.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/072c17d1-f879-4a17-a760-1430c6c896d4_800x727.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!9EsH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072c17d1-f879-4a17-a760-1430c6c896d4_800x727.png 424w, https://substackcdn.com/image/fetch/$s_!9EsH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072c17d1-f879-4a17-a760-1430c6c896d4_800x727.png 848w, https://substackcdn.com/image/fetch/$s_!9EsH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072c17d1-f879-4a17-a760-1430c6c896d4_800x727.png 1272w, https://substackcdn.com/image/fetch/$s_!9EsH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F072c17d1-f879-4a17-a760-1430c6c896d4_800x727.png 1456w" sizes="100vw"></picture><div></div></div></a><figcaption class="image-caption">Image by&nbsp;Author</figcaption></figure></div><p>In order to avoid overfitting, the trees shouldn&#8217;t be correlated. How does the model avoid trees being correlated? You might ask. Random Forest makes sure of that by doing two things&#8202;&#8212;<strong>&#8202;when building each tree, randomly selecting a subset of training samples, and randomly selecting a subset of features</strong>. This &#8220;random subsetting&#8221; is often used in ensemble models and is commonly referred as <strong>&#8220;bagging&#8221;</strong> and is often used to <strong>reduce variance in trained models</strong>.</p><p>A little digression here since we are talking about &#8220;bagging&#8221;. &#8220;Boosting&#8221; is another method commonly used in ML models; in fact, Gradient Boosted Decision Trees is a high-performing cousin in the decision-tree family. I&#8217;m mentioning this because <strong>&#8220;Bagging&#8221; and &#8220;boosting&#8221; are often compared</strong>; I was asked in different interviews about the <strong>differences between them</strong>. What&#8217;s special about &#8220;boosting&#8221; is it can improve weak learners. Different from &#8220;bagging&#8221;, which builds trees in parallel and separately, the &#8220;boosting&#8221; process builds trees sequentially; so each tree can &#8220;learn&#8221; from the previous one&#8217;s mistakes and improve.</p><p>There are a lot of detailed explanations about &#8220;boosting&#8221; on Medium so I won&#8217;t go into any technicalities here. But it&#8217;s worth noting that due to this &#8220;sequential&#8221; nature, &#8220;boosting&#8221; algorithms train more slowly and are more prone to overfitting comparing to their &#8220;bagging&#8221; counterparts.</p><p>Back to Random Forest. If you remember what we talked about in the <a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57">previous article</a>, CART&#8217;s biggest advantage is arguably its interpretability. Even though random forest is usually an <strong>improvement from CART in terms of performance</strong> (especially on test sets since it is less prone to overfit), it sacrifices interpretability to some extend in the process. As the number of trees grows, it will be harder and harder to plot each tree and see the features they used in splitting the data; thus it becomes harder to understand exactly how each tree was built. But it&#8217;s still possible to generate a plot for feature importance (across the trees in the forest). Most random forest packages come with such a plot that&#8217;s easy to access.</p><h4>How are these tested and what to watch out&nbsp;for</h4><p>I have already went through in the <a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57">previous post</a> about how ML concepts are usually tested and the most important things to remember when you answer ML modeling questions. Click to that post (linked below) if you want to read more.</p><p><strong><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57">Concepts You Have to Know for Data Science Interviews&#8202;&#8212;&#8202;Part III. Basic Supervised Learning Models</a></strong><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57"><br></a><em><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57">Most frequently asked questions in data scientist interviews for modeling</a></em><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57">towardsdatascience.com</a></p><p>The only thing to add about random forest, or any complicated ML algorithms in general, is it&#8217;s important to be able to explain the algorithms in layman terms. What I mean by that is the interviewers usually are interested in testing your <strong>understanding of the algorithm</strong>, but are <strong>not looking for a memorized version of the wikipedia page of it</strong>.</p><p>Little tip at the end, the best audience for practicing your intuitive explanation of ML algorithms is friends who are not in the analytics field; you can quickly tell if your description of the algorithm is making any sense.</p><p><strong>Interested in reading more about data science career tips? I might have something for you:</strong></p><p><strong><a href="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba" title="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba">5 Lessons McKinsey Taught Me That Will Make You a Better Data Scientist</a></strong><a href="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba" title="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba"><br>towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5" title="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5">Why I Left McKinsey as a Data Scientist</a></strong><a href="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5" title="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5"><br></a><em><a href="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5" title="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5">Things you should consider before starting as a data science consultant</a></em><a href="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5" title="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47" title="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47">The Ultimate Interview Prep Guide for Data Scientists and Data Analysts</a></strong><a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47" title="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47"><br></a><em><a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47" title="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47">What helped me interview successfully with FANG as well as unicorns</a></em><a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47" title="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47">towardsdatascience.com</a></p>]]></content:encoded></item><item><title><![CDATA[Concepts You Have to Know for Data Science Interviews — Part III. Basic Supervised Learning Models]]></title><description><![CDATA[Most frequently asked questions in data scientist interviews for modeling]]></description><link>https://www.divingintodata.com/p/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57</link><guid isPermaLink="false">https://www.divingintodata.com/p/concepts-you-have-to-know-for-data-science-interviews-part-iii-basic-supervised-learning-models-5115673f57</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Fri, 24 Jun 2022 15:11:33 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/a58fbdb6-7417-47ac-abec-d79642cd8cd8_800x467.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!KL7x!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2162163-90ca-4fd2-8154-5c775b7d2ae2_800x467.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!KL7x!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2162163-90ca-4fd2-8154-5c775b7d2ae2_800x467.jpeg 424w, https://substackcdn.com/image/fetch/$s_!KL7x!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2162163-90ca-4fd2-8154-5c775b7d2ae2_800x467.jpeg 848w, https://substackcdn.com/image/fetch/$s_!KL7x!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2162163-90ca-4fd2-8154-5c775b7d2ae2_800x467.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!KL7x!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2162163-90ca-4fd2-8154-5c775b7d2ae2_800x467.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!KL7x!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2162163-90ca-4fd2-8154-5c775b7d2ae2_800x467.jpeg" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f2162163-90ca-4fd2-8154-5c775b7d2ae2_800x467.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!KL7x!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2162163-90ca-4fd2-8154-5c775b7d2ae2_800x467.jpeg 424w, https://substackcdn.com/image/fetch/$s_!KL7x!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2162163-90ca-4fd2-8154-5c775b7d2ae2_800x467.jpeg 848w, https://substackcdn.com/image/fetch/$s_!KL7x!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2162163-90ca-4fd2-8154-5c775b7d2ae2_800x467.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!KL7x!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2162163-90ca-4fd2-8154-5c775b7d2ae2_800x467.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">Photo by <a href="https://unsplash.com/@jleeems?utm_source=medium&amp;utm_medium=referral">Jason Leem</a> on&nbsp;<a href="https://unsplash.com?utm_source=medium&amp;utm_medium=referral">Unsplash</a></figcaption></figure></div><h4>Most frequently asked questions in data scientist interviews for&nbsp;modeling</h4><p>This is the 3rd article in my interview series. I&#8217;m hoping this series will function as a centralized starting point for aspiring data scientists in terms of interview preparation. The first two articles are here if you are interested:</p><ol><li><p><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50">Concepts You Have to Know for Data Science Interviews&#8202;&#8212;&#8202;Part I: Distribution</a></p></li><li><p><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-ii-probability-5c8830f13fb5">Concepts You Have to Know for Data Science Interviews&#8202;&#8212;&#8202;Part II. Probability</a></p></li></ol><p>In this article, I want to talk about arguably the most interesting part for a lot of aspiring data scientists&#8202;&#8212;&#8202;ML. ML is a super complicated topic, so I won&#8217;t even attempt to cover EVERYTHING about it in one post. In fact, I will separate the ML part into several posts and talk about different aspects of ML and different categories of models in each of them. In this post, I will focus only on the <strong>basic supervised learning models</strong>.</p><p>There are countless types of models out there when it comes to ML, from simple models like <strong>linear and logistic regression</strong> all the way to convoluted ones like <strong>deep learning and reinforcement learning</strong>. I will post by post cover the important ones that will show up in interviews.</p><p>First things first, it&#8217;s important to know that unless you are targeting super modeling-intensive roles such as machine learning engineer (MLE) or research scientists, <strong>ML usually is NOT the biggest focus for interviews</strong>. The interviewers usually ONLY want to see that you have a basic level of understanding of different models and modeling techniques; you WON&#8217;T be asked to whiteboard proofs or defend the choice for the number of layers in a neural network you built.</p><h4><strong>In general, ML models/techniques can be separated into two categories&#8212; Supervised vs. unsupervised learning.</strong></h4><p>The difference between supervised and unsupervised learning is one of the most fundamental concepts data scientists should be familiar with. <strong>Supervised learning</strong> is the general category of all the ML techniques that <strong>utilizes labeled datasets, </strong>whereas <strong>unsupervised learning</strong> techniques work with <strong>unlabeled datasets</strong>. A concrete example: If you want to build a model to distinguish dogs from cats, you would be using supervised learning because there should be a clearly correct answer to each data point whether it is a dog or a cat (the categories you want to predict). Whereas if you just want to cluster house pets together, you would use unsupervised learning. There&#8217;s not a clearly &#8220;correct&#8221; answer for each data point&#8217;s label&#8202;&#8212;&#8202;a small dog may be grouped into the same group as other small cats instead of big dogs. This article will focus on most commonly tested supervised learning techniques; I will go into more <strong>advanced supervised learning models</strong> and <strong>unsupervised learning</strong> in the following articles.</p><h4><strong>Linear regression</strong></h4><p>This is arguably the most rudimentary model used in ML (if you think it &#8220;counts&#8221; as ML). Linear regression is the most common method of <strong>supervised learning</strong> that&#8217;s most suitable to predict a continuous variable (as opposed to categorical variables).</p><p>Linear regression essentially is trying to fit the best line through all training data points. The definition of &#8220;the best line&#8221; varies depending on the choice of the loss function. The most commonly used one is &#8220;ordinary least squares (OLS)&#8221;, which finds the best line by minimizing the sum of squared errors. The error is defined as <strong>r&#7522; = y&#7522; -y^</strong>, where <strong>y^</strong> is the predicted value by the model and <strong>y&#7522; </strong>is the observed (true) value. The value you want to minimize is the sum of the aforementioned error terms</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!OGLc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b8b2ad2-0770-4ab3-b17e-ae2d021f50c6_128x64.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!OGLc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b8b2ad2-0770-4ab3-b17e-ae2d021f50c6_128x64.png 424w, https://substackcdn.com/image/fetch/$s_!OGLc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b8b2ad2-0770-4ab3-b17e-ae2d021f50c6_128x64.png 848w, https://substackcdn.com/image/fetch/$s_!OGLc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b8b2ad2-0770-4ab3-b17e-ae2d021f50c6_128x64.png 1272w, https://substackcdn.com/image/fetch/$s_!OGLc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b8b2ad2-0770-4ab3-b17e-ae2d021f50c6_128x64.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!OGLc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b8b2ad2-0770-4ab3-b17e-ae2d021f50c6_128x64.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8b8b2ad2-0770-4ab3-b17e-ae2d021f50c6_128x64.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!OGLc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b8b2ad2-0770-4ab3-b17e-ae2d021f50c6_128x64.png 424w, https://substackcdn.com/image/fetch/$s_!OGLc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b8b2ad2-0770-4ab3-b17e-ae2d021f50c6_128x64.png 848w, https://substackcdn.com/image/fetch/$s_!OGLc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b8b2ad2-0770-4ab3-b17e-ae2d021f50c6_128x64.png 1272w, https://substackcdn.com/image/fetch/$s_!OGLc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b8b2ad2-0770-4ab3-b17e-ae2d021f50c6_128x64.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Sum of squared error&nbsp;terms</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!abBg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F593226a2-555a-49cc-9ce6-4efd3d3fe1be_800x666.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!abBg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F593226a2-555a-49cc-9ce6-4efd3d3fe1be_800x666.png 424w, https://substackcdn.com/image/fetch/$s_!abBg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F593226a2-555a-49cc-9ce6-4efd3d3fe1be_800x666.png 848w, https://substackcdn.com/image/fetch/$s_!abBg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F593226a2-555a-49cc-9ce6-4efd3d3fe1be_800x666.png 1272w, https://substackcdn.com/image/fetch/$s_!abBg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F593226a2-555a-49cc-9ce6-4efd3d3fe1be_800x666.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!abBg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F593226a2-555a-49cc-9ce6-4efd3d3fe1be_800x666.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/593226a2-555a-49cc-9ce6-4efd3d3fe1be_800x666.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!abBg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F593226a2-555a-49cc-9ce6-4efd3d3fe1be_800x666.png 424w, https://substackcdn.com/image/fetch/$s_!abBg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F593226a2-555a-49cc-9ce6-4efd3d3fe1be_800x666.png 848w, https://substackcdn.com/image/fetch/$s_!abBg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F593226a2-555a-49cc-9ce6-4efd3d3fe1be_800x666.png 1272w, https://substackcdn.com/image/fetch/$s_!abBg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F593226a2-555a-49cc-9ce6-4efd3d3fe1be_800x666.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Linear Regression Illustration from <a href="https://en.wikipedia.org/wiki/Regression_analysis#/media/File:Normdist_regression.png">wikipedia</a></figcaption></figure></div><h4>Logistic regression</h4><p>The biggest difference between logistic regression and linear regression is that while linear regression models a continuous variable, logistic regression models the <strong>probability of an event </strong>(an event with binary outcomes), hence the output is <strong>between 0 and 1</strong>.</p><p>It&#8217;s worth noting that, even though commonly used for classification, the logistic regression itself is NOT a classifier. But it can be turned into a classifier when layered with a threshold on top (usually 0.5). With a cutoff determined, the probability can be turned into a binary output. As an example, if you are trying to predict whether an e-mail is spam or not, a logistic regression with a threshold of 0.5 would classify any e-mail with a predicted &#8220;probability of being spam&#8221; &#8805; 50% as spam.</p><p>Understanding that the classifier consists of two parts&#8202;&#8212;&#8202;a threshold and the logistic regression&#8202;&#8212;&#8202;can help you understand why choosing a proper cutoff is an essential but usually overlooked part of model tuning when it comes to classifiers powered by logistic regression.</p><h4>CART</h4><p>CART is short for categorization and regression trees; as the name suggest, it is a <strong>supervised learning</strong> technique that can be used to predict categorical or continuous variables. CART is usually talked about together with <strong>Random Forest</strong>, because CART is the simpler version; it&#8217;s a single tree instead of, well, a forest (I will cover Random Forest in the next article).</p><p>CART&#8217;s biggest advantage when compared to regressions, or even random forest, is its interpretability, because it&#8217;s possible to plot the feature split used to build the tree. The illustration below showcases a super simple CART model. Illustrations like this can be easily plotted by most CART packages you use to fit the model and it can help you visualize how the model is built in the background. Another neat thing is that it shows you what <strong>features were used first to split </strong>the model; so it also illustrates feature importance.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dCfh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3bf4b7a-29b7-4c76-8338-648cf80a4ec1_800x726.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dCfh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3bf4b7a-29b7-4c76-8338-648cf80a4ec1_800x726.png 424w, https://substackcdn.com/image/fetch/$s_!dCfh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3bf4b7a-29b7-4c76-8338-648cf80a4ec1_800x726.png 848w, https://substackcdn.com/image/fetch/$s_!dCfh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3bf4b7a-29b7-4c76-8338-648cf80a4ec1_800x726.png 1272w, https://substackcdn.com/image/fetch/$s_!dCfh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3bf4b7a-29b7-4c76-8338-648cf80a4ec1_800x726.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dCfh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3bf4b7a-29b7-4c76-8338-648cf80a4ec1_800x726.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f3bf4b7a-29b7-4c76-8338-648cf80a4ec1_800x726.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dCfh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3bf4b7a-29b7-4c76-8338-648cf80a4ec1_800x726.png 424w, https://substackcdn.com/image/fetch/$s_!dCfh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3bf4b7a-29b7-4c76-8338-648cf80a4ec1_800x726.png 848w, https://substackcdn.com/image/fetch/$s_!dCfh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3bf4b7a-29b7-4c76-8338-648cf80a4ec1_800x726.png 1272w, https://substackcdn.com/image/fetch/$s_!dCfh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3bf4b7a-29b7-4c76-8338-648cf80a4ec1_800x726.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Illustration by&nbsp;Author</figcaption></figure></div><h4><strong>How are these tested and what to watch out&nbsp;for</strong></h4><p>In general, there are two approaches to test these fundamental ML modeling concepts&#8202;&#8212;&#8202;resume-driven or theory-driven.</p><ul><li><p><strong>Resume-driven</strong>: the interview questions will be based on your resume. So make sure you take a couple of hours to take a trip down memory lane and refresh your understanding of the modeling experiences you mention in your resume. Depending on the interviewer and his/her background in machine learning, they might ask in-depth questions about certain algorithms you mention.</p></li><li><p><strong>Theory-driven</strong>: If you lack modeling experience, or modeling shows up in a case study portion of the interview, the questions will be theoretical and hypothetical. The interviewers will ask you what you WOULD DO in a certain situation. And they might throw you a curve ball by planting some problems with the dataset and see if you know how to deal with difficult data problems in modeling (we will cover that in detail in my upcoming article talking about model training).</p></li><li><p><strong>Things to remember: </strong>When talking about your modeling experience, make sure you showcase that you understand <strong>interpretability</strong> is the most important thing when building a model. There are countless ways to visualize model performance and feature importance for almost every commonly-used model. Stay tuned for an article on that too. Another thing to remember is that <strong>data cleaning</strong> and <strong>feature engineering </strong>are as important, if not more so than the modeling itself. Otherwise your model will suffer from the infamous &#8220;garbage in garbage out&#8221; symptom.</p></li></ul><p><strong>Interested in reading more about data science career tips? I might have something for you:</strong></p><p><strong><a href="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba" title="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba">5 Lessons McKinsey Taught Me That Will Make You a Better Data Scientist</a></strong><a href="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba" title="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba"><br>towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5" title="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5">Why I Left McKinsey as a Data Scientist</a></strong><a href="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5" title="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5"><br></a><em><a href="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5" title="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5">Things you should consider before starting as a data science consultant</a></em><a href="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5" title="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5">towardsdatascience.com</a></p>]]></content:encoded></item><item><title><![CDATA[4 Lessons I Learned From Writing An Article That Made Me More Than $1,000]]></title><description><![CDATA[I can&#8217;t promise you can do the same, but it&#8217;s worth a try]]></description><link>https://www.divingintodata.com/p/4-lessons-i-learned-from-writing-an-article-that-made-me-more-than-1-000-53be4b720669</link><guid isPermaLink="false">https://www.divingintodata.com/p/4-lessons-i-learned-from-writing-an-article-that-made-me-more-than-1-000-53be4b720669</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Fri, 17 Jun 2022 17:03:03 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/602e1c83-9538-4281-97ce-90defbb3fda3_800x533.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jI0c!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a1a71a6-f870-4161-aa96-0971d8c224df_800x533.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jI0c!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a1a71a6-f870-4161-aa96-0971d8c224df_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!jI0c!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a1a71a6-f870-4161-aa96-0971d8c224df_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!jI0c!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a1a71a6-f870-4161-aa96-0971d8c224df_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!jI0c!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a1a71a6-f870-4161-aa96-0971d8c224df_800x533.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jI0c!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a1a71a6-f870-4161-aa96-0971d8c224df_800x533.jpeg" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4a1a71a6-f870-4161-aa96-0971d8c224df_800x533.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jI0c!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a1a71a6-f870-4161-aa96-0971d8c224df_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!jI0c!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a1a71a6-f870-4161-aa96-0971d8c224df_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!jI0c!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a1a71a6-f870-4161-aa96-0971d8c224df_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!jI0c!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a1a71a6-f870-4161-aa96-0971d8c224df_800x533.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">Photo by <a href="https://unsplash.com/@mathieustern?utm_source=medium&amp;utm_medium=referral">Mathieu Stern</a> on&nbsp;<a href="https://unsplash.com?utm_source=medium&amp;utm_medium=referral">Unsplash</a></figcaption></figure></div><h4>And maybe you can do the&nbsp;same</h4><p>Let me explain the subtitle first. This is not one of those feel-good posts where I PROMISE new writers that the success can be replicated; I personally hate those because they ignore the <a href="https://en.wikipedia.org/wiki/Survivorship_bias">survivorship bias</a> and the luck factor in success and suffer from the &#8220;<a href="https://en.wikipedia.org/wiki/Just-world_hypothesis#:~:text=The%20just%2Dworld%20hypothesis%20or,fitting%20consequences%20for%20the%20actor.">fair world&#8221; fallacy</a>. But before you lose hope and click away, while the tips here CANNOT guarantee <strong>views and income on your article</strong>, they DO increase your <strong>CHANCE of being successful</strong> on the platform.</p><p>So, I have been writing on Medium since the beginning of 2021. My very first article was born out of the frustration that there was no good tutorial online about how to make an API call to get data, even though I have personally run into this problem multiple times as a data scientist. So I decided to take the issue into my own hands and <strong><a href="https://towardsdatascience.com/this-tutorial-will-make-your-api-data-pull-so-much-easier-9ab4c35f9af">wrote an article documenting my learnings</a></strong>, and the rest is history. My best performing article to this day is my article about the <strong><a href="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba">data science lessons McKinsey has taught me during my time working there</a></strong>. It made me over $1000 since I published it in May last year. In fact, it generated over $1000 in the first 2 months after it was published.</p><p>So what have I learned from writing in the past year with several &#8220;viral&#8221; articles?</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NYBj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48e6ecd3-91b3-49df-a3cf-382f613fe039_800x401.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NYBj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48e6ecd3-91b3-49df-a3cf-382f613fe039_800x401.png 424w, https://substackcdn.com/image/fetch/$s_!NYBj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48e6ecd3-91b3-49df-a3cf-382f613fe039_800x401.png 848w, https://substackcdn.com/image/fetch/$s_!NYBj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48e6ecd3-91b3-49df-a3cf-382f613fe039_800x401.png 1272w, https://substackcdn.com/image/fetch/$s_!NYBj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48e6ecd3-91b3-49df-a3cf-382f613fe039_800x401.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NYBj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48e6ecd3-91b3-49df-a3cf-382f613fe039_800x401.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/48e6ecd3-91b3-49df-a3cf-382f613fe039_800x401.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!NYBj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48e6ecd3-91b3-49df-a3cf-382f613fe039_800x401.png 424w, https://substackcdn.com/image/fetch/$s_!NYBj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48e6ecd3-91b3-49df-a3cf-382f613fe039_800x401.png 848w, https://substackcdn.com/image/fetch/$s_!NYBj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48e6ecd3-91b3-49df-a3cf-382f613fe039_800x401.png 1272w, https://substackcdn.com/image/fetch/$s_!NYBj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F48e6ecd3-91b3-49df-a3cf-382f613fe039_800x401.png 1456w" sizes="100vw"></picture><div></div></div></a><figcaption class="image-caption">The stats of my best-performing article; screenshot by&nbsp;author</figcaption></figure></div><h4><strong>I. Write consistently</strong></h4><p>I know this one is a clich&#233;, but I still want to repeat it because it&#8217;s definitely true. The internet is a weird place, you can never know for certain which article will be popular as an author. I have had articles that I was very excited about sink like a stone with absolutely no ripple; and vice versa, some articles that I thought would only speak to a small audience found broad success. So the best way to write a viral article is to write a lot of good articles and see which one will get the attention it deserves. It&#8217;s almost impossible for your very first article to go viral; it usually takes a bit of work to accumulate readership so your article gets the baseline level of views as a start and has a fair chance at getting more attention from there. Looking at stats of the articles right before the viral article, there&#8217;s undeniably a build up and it takes time and consistency.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FOTT!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe701186-5fc6-4b0a-93a6-732b157c3620_800x430.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FOTT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe701186-5fc6-4b0a-93a6-732b157c3620_800x430.png 424w, https://substackcdn.com/image/fetch/$s_!FOTT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe701186-5fc6-4b0a-93a6-732b157c3620_800x430.png 848w, https://substackcdn.com/image/fetch/$s_!FOTT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe701186-5fc6-4b0a-93a6-732b157c3620_800x430.png 1272w, https://substackcdn.com/image/fetch/$s_!FOTT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe701186-5fc6-4b0a-93a6-732b157c3620_800x430.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FOTT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe701186-5fc6-4b0a-93a6-732b157c3620_800x430.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/be701186-5fc6-4b0a-93a6-732b157c3620_800x430.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FOTT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe701186-5fc6-4b0a-93a6-732b157c3620_800x430.png 424w, https://substackcdn.com/image/fetch/$s_!FOTT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe701186-5fc6-4b0a-93a6-732b157c3620_800x430.png 848w, https://substackcdn.com/image/fetch/$s_!FOTT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe701186-5fc6-4b0a-93a6-732b157c3620_800x430.png 1272w, https://substackcdn.com/image/fetch/$s_!FOTT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe701186-5fc6-4b0a-93a6-732b157c3620_800x430.png 1456w" sizes="100vw"></picture><div></div></div></a><figcaption class="image-caption">Screenshot by&nbsp;Author</figcaption></figure></div><h4><strong>But talk is cheap; the harder problem is HOW to actually write consistently.</strong></h4><p>With English as my second language, I have always found writing hard. There&#8217;s a mental hurdle for me when it comes to writing because we are naturally scared of things we are not good at or familiar with. What I have found helpful is to open my laptop everyday to write SOMETHING, ANYTHING to get over that mental hurdle. If you have great ideas that day, write a paragraph or two, or even a whole article; if your writer&#8217;s juice is not flowing that day, write once sentence or two, or even just outline some ideas. ANYTHING counts.</p><p>Just remember, you don&#8217;t need to finish each article in a single setting, so write down any piecemeal inspiration you have WHEN you have them. Getting started is the hardest part; building the habit of writing into your daily routine is the key to having successful article(s).</p><h4>II. Get your article the initial&nbsp;exposure</h4><p>I publish with several different publications and occasionally self-publish as well. Depending on which editor/publication picks up the article, it might get very different treatment in the publishing process and trust me, it makes a noticeable difference.</p><p>I have noticed that if the article gets an initial boost from the editor&#8217;s applause or immediate posting across social media, it gets more attention faster than if the social media post only follows after a week of the article&#8217;s posting date, or if there&#8217;s no social media promotion at all. It makes sense, that snow ball effect is real. Having built several machine learning models myself, I wouldn&#8217;t be surprised if Medium&#8217;s decision algorithm for whether to promote an article places a high weight on the article&#8217;s initial traction (views and reactions like applause or comment). The more exposure your article gets at the beginning (getting posted on not only Medium, but also other platforms like LinkedIn), the more traffic it will receive, and the more signal it will send to Medium&#8217;s algorithm: &#8220;Hey, push me to more people on the platform because I&#8217;m a good article and people will read me&#8221;.</p><p>Now you understand the why, let&#8217;s talk about the how. If you publish with publications, try to work with the editors who help you publish and remind them to promote your article through their social media platforms if there are any. Also try to promote the article yourself on different platforms like LinkedIn, Facebook, Twitter etc.</p><h4><strong>III. Find your niche and your publication(s)</strong></h4><p>By finding your niche, I don&#8217;t just mean finding the general area you want to write about; of course, that&#8217;s the first step. I mean within that general area, find the gap that nobody has written a lot about that fits your expertise and/or interest.</p><p>In my case, I knew data science is the general area I want to write about because of my interest and background. There is a fair amount of authors writing about ML, AI and the technical side of data science, but I realized that career development and interview guides in data science are lesser touched areas. Since I have been through several rounds of recruiting in data science myself that happens to be the area where I have a good amount of experience to share and help my readers. As you write more articles in the niche you find, you will continue attracting readers who are interested in the topic and gain followers that way. Which means your future articles in the same area will get a certain baseline level of reads from your followers.</p><p>Equally importantly, finding one or more publications that you frequently work with will help you leverage their existing reader bases (usually a lot bigger than your own). I published most of my articles through Towards Data Science since it fits my focus area, and was able to get good traction on my initial articles despite having virtually no followers at the time. Going back to the point mentioned above, having publication(s) you are familiar with will also help you establish and maintain a relationship with editors that can help you promote your articles.</p><h4><strong>IV. Name dropping still works, for better or for&nbsp;worse</strong></h4><p>Why do you think so many YouTubers and Medium writers use prestigious companies&#8217; names in their titles (Google, Amazon, Uber, McKinsey, you name it)? Because it gets views. Not only does the name dropping get people&#8217;s attention right away, it helps the articles get picked up by search engines and other curations mechanisms. Believe or not, Google will even curate and push it to potentially interested parties if the articles gets enough attention. I know this because one McKinsey alumni I have no connection with read the article in question from his Google Chrome recommendation section and reached out to me (it&#8217;s probably because he indicated his interest in McKinsey related topics somehow).</p><p>So far, I have written 3 McKinsey-related articles and on average, they outperformed my other articles. So if you have any ideas in the pipeline that are related to your experience with big companies, don&#8217;t be shy to use the name in the title. With that being said, don&#8217;t be one of those annoying people who use big names as click baits when the content has nothing to do with those companies. You don&#8217;t want to throw away your integrity and credibility just to get a couple more clicks.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iGIo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ad780de-6d48-44c0-8b7b-78d64dcac6da_800x348.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iGIo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ad780de-6d48-44c0-8b7b-78d64dcac6da_800x348.png 424w, https://substackcdn.com/image/fetch/$s_!iGIo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ad780de-6d48-44c0-8b7b-78d64dcac6da_800x348.png 848w, https://substackcdn.com/image/fetch/$s_!iGIo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ad780de-6d48-44c0-8b7b-78d64dcac6da_800x348.png 1272w, https://substackcdn.com/image/fetch/$s_!iGIo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ad780de-6d48-44c0-8b7b-78d64dcac6da_800x348.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iGIo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ad780de-6d48-44c0-8b7b-78d64dcac6da_800x348.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0ad780de-6d48-44c0-8b7b-78d64dcac6da_800x348.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!iGIo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ad780de-6d48-44c0-8b7b-78d64dcac6da_800x348.png 424w, https://substackcdn.com/image/fetch/$s_!iGIo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ad780de-6d48-44c0-8b7b-78d64dcac6da_800x348.png 848w, https://substackcdn.com/image/fetch/$s_!iGIo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ad780de-6d48-44c0-8b7b-78d64dcac6da_800x348.png 1272w, https://substackcdn.com/image/fetch/$s_!iGIo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ad780de-6d48-44c0-8b7b-78d64dcac6da_800x348.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Screenshot by&nbsp;author</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rXth!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0645160e-b66a-46fd-a98c-657fe1bc35be_800x264.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rXth!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0645160e-b66a-46fd-a98c-657fe1bc35be_800x264.png 424w, https://substackcdn.com/image/fetch/$s_!rXth!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0645160e-b66a-46fd-a98c-657fe1bc35be_800x264.png 848w, https://substackcdn.com/image/fetch/$s_!rXth!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0645160e-b66a-46fd-a98c-657fe1bc35be_800x264.png 1272w, https://substackcdn.com/image/fetch/$s_!rXth!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0645160e-b66a-46fd-a98c-657fe1bc35be_800x264.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rXth!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0645160e-b66a-46fd-a98c-657fe1bc35be_800x264.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0645160e-b66a-46fd-a98c-657fe1bc35be_800x264.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rXth!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0645160e-b66a-46fd-a98c-657fe1bc35be_800x264.png 424w, https://substackcdn.com/image/fetch/$s_!rXth!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0645160e-b66a-46fd-a98c-657fe1bc35be_800x264.png 848w, https://substackcdn.com/image/fetch/$s_!rXth!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0645160e-b66a-46fd-a98c-657fe1bc35be_800x264.png 1272w, https://substackcdn.com/image/fetch/$s_!rXth!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0645160e-b66a-46fd-a98c-657fe1bc35be_800x264.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Screenshot by&nbsp;author</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ALhs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e67fefa-aa96-4863-bef3-a815e53f9514_800x297.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ALhs!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e67fefa-aa96-4863-bef3-a815e53f9514_800x297.png 424w, https://substackcdn.com/image/fetch/$s_!ALhs!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e67fefa-aa96-4863-bef3-a815e53f9514_800x297.png 848w, https://substackcdn.com/image/fetch/$s_!ALhs!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e67fefa-aa96-4863-bef3-a815e53f9514_800x297.png 1272w, https://substackcdn.com/image/fetch/$s_!ALhs!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e67fefa-aa96-4863-bef3-a815e53f9514_800x297.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ALhs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e67fefa-aa96-4863-bef3-a815e53f9514_800x297.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1e67fefa-aa96-4863-bef3-a815e53f9514_800x297.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ALhs!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e67fefa-aa96-4863-bef3-a815e53f9514_800x297.png 424w, https://substackcdn.com/image/fetch/$s_!ALhs!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e67fefa-aa96-4863-bef3-a815e53f9514_800x297.png 848w, https://substackcdn.com/image/fetch/$s_!ALhs!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e67fefa-aa96-4863-bef3-a815e53f9514_800x297.png 1272w, https://substackcdn.com/image/fetch/$s_!ALhs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e67fefa-aa96-4863-bef3-a815e53f9514_800x297.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Screenshot by&nbsp;author</figcaption></figure></div><p>I hope these tips are helpful to new writers out there and don&#8217;t be shy to reach out and connect or</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qE1s!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F854fd06d-b44d-44d7-8e2e-697f2398f8bc_440x110.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qE1s!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F854fd06d-b44d-44d7-8e2e-697f2398f8bc_440x110.png 424w, https://substackcdn.com/image/fetch/$s_!qE1s!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F854fd06d-b44d-44d7-8e2e-697f2398f8bc_440x110.png 848w, https://substackcdn.com/image/fetch/$s_!qE1s!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F854fd06d-b44d-44d7-8e2e-697f2398f8bc_440x110.png 1272w, https://substackcdn.com/image/fetch/$s_!qE1s!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F854fd06d-b44d-44d7-8e2e-697f2398f8bc_440x110.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qE1s!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F854fd06d-b44d-44d7-8e2e-697f2398f8bc_440x110.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/854fd06d-b44d-44d7-8e2e-697f2398f8bc_440x110.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!qE1s!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F854fd06d-b44d-44d7-8e2e-697f2398f8bc_440x110.png 424w, https://substackcdn.com/image/fetch/$s_!qE1s!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F854fd06d-b44d-44d7-8e2e-697f2398f8bc_440x110.png 848w, https://substackcdn.com/image/fetch/$s_!qE1s!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F854fd06d-b44d-44d7-8e2e-697f2398f8bc_440x110.png 1272w, https://substackcdn.com/image/fetch/$s_!qE1s!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F854fd06d-b44d-44d7-8e2e-697f2398f8bc_440x110.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p><strong>If you are interested reading the articles that performed well for me, here are some:</strong></p><p><strong><a href="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba" title="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba">5 Lessons McKinsey Taught Me That Will Make You a Better Data Scientist</a></strong><a href="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba" title="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba"><br>towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47" title="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47">The Ultimate Interview Prep Guide for Data Scientists and Data Analysts</a></strong><a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47" title="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47"><br></a><em><a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47" title="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47">What helped me interview successfully with FANG as well as unicorns</a></em><a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47" title="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5" title="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5">Why I Left McKinsey as a Data Scientist</a></strong><a href="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5" title="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5"><br></a><em><a href="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5" title="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5">Things you should consider before starting as a data science consultant</a></em><a href="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5" title="https://towardsdatascience.com/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5">towardsdatascience.com</a></p>]]></content:encoded></item><item><title><![CDATA[5 Tips That Will Help You Become a Data Science Manager]]></title><description><![CDATA[So you want to be promoted? So does everyone else. How can you stand out from your peers?]]></description><link>https://www.divingintodata.com/p/5-tips-that-will-help-you-become-a-data-science-manager-bd4b6981c024</link><guid isPermaLink="false">https://www.divingintodata.com/p/5-tips-that-will-help-you-become-a-data-science-manager-bd4b6981c024</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Tue, 24 May 2022 19:19:37 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/6c04b7bf-c2cd-4917-be37-9c60dd907249_800x533.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6XTe!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1603665-fc75-4a2e-9cce-9cc19f6e72d1_800x533.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6XTe!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1603665-fc75-4a2e-9cce-9cc19f6e72d1_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!6XTe!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1603665-fc75-4a2e-9cce-9cc19f6e72d1_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!6XTe!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1603665-fc75-4a2e-9cce-9cc19f6e72d1_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!6XTe!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1603665-fc75-4a2e-9cce-9cc19f6e72d1_800x533.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6XTe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1603665-fc75-4a2e-9cce-9cc19f6e72d1_800x533.jpeg" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e1603665-fc75-4a2e-9cce-9cc19f6e72d1_800x533.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6XTe!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1603665-fc75-4a2e-9cce-9cc19f6e72d1_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!6XTe!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1603665-fc75-4a2e-9cce-9cc19f6e72d1_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!6XTe!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1603665-fc75-4a2e-9cce-9cc19f6e72d1_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!6XTe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1603665-fc75-4a2e-9cce-9cc19f6e72d1_800x533.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">Photo by <a href="https://unsplash.com/es/@hjwinunsplsh?utm_source=medium&amp;utm_medium=referral">Jungwoo Hong</a> on&nbsp;<a href="https://unsplash.com?utm_source=medium&amp;utm_medium=referral">Unsplash</a></figcaption></figure></div><h4>So you want to be promoted? So does everyone else. How can you stand out from your&nbsp;peers?</h4><p>On top of my passion for data science, I have always been interested in people management. I have always wanted to help others achieve their career goals (hence the Medium account), and I love to get the big picture and learn about/help decide how a team fits into the company&#8217;s broader ecosystem.</p><p>When I started my new job a year and half ago, I was hoping for a promotion to manager soon since I had a couple of years of individual contributor (IC) experience under my belt and I thought I would be able to find some general guidance online about how to make this transition possible. But interestingly enough, while the internet is filled with <a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47">interview guides</a> and <a href="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1">general career advice</a>, there&#8217;s not much written about how to make the IC &#8594; manager transition. Everyone&#8217;s path to manager will be slightly different since there are a lot of variables in play here. But thanks to my manager&#8217;s coaching and guidance, I was able to distill some general lessons from my experience and hopefully will be able to help out some aspiring DS managers.</p><h4><strong>Take initiative</strong></h4><p>One of the most valuable lessons I have learned in this process is&#8202;&#8212;&#8202;<strong>you don&#8217;t get a promotion and THEN start to perform at the next level; you perform at the next level IN ORDER TO get a promotion</strong>.</p><p>So when you notice a gap somewhere, even if it doesn&#8217;t necessarily fall into your current role description, don&#8217;t be afraid to bring it up to your manager and discuss whether you can/should take initiative to help plug the gap. You manager will for sure appreciate ICs who can notice those things and can help brainstorm solutions.</p><p>The best way to notice gaps and in turn take initiative is to be a good listener and constantly communicate with your partners &amp; stakeholders about their teams&#8217; work and pain points. Generally, the more you know about other teams&#8217; work and how they collaborate with each other, the easier it is for you to identify ways to improve things.</p><h4><strong>Mentor a&nbsp;peer</strong></h4><p>One of the biggest responsibilities of a manager is to help other team members in terms of prioritization, guidance and general mentorship. So you should start practicing these when you are thinking about making the transition.</p><p>If you have <strong>new members joining the team</strong>, great, offer to be an <strong>onboarding buddy</strong> to guide them through their first few weeks. See if you are able to provide general context about stakeholders, tech stack and projects of the team and help with more detailed questions. This will also help you get a better understanding of your knowledge gaps and help you improve as an IC.</p><p>If there are no new members joining the team in the near term, you can still help out current peers on the team when they need someone to <strong>brainstorm a solution</strong> or when they don&#8217;t know which team to reach out to about a question.</p><h4><strong>Step out of your immediate scope</strong></h4><p>As a manager, you will need to <strong>elevate from a single IC&#8217;s scope and instead be in charge of several IC&#8217;s work streams</strong> on a higher level. Take this potential transition period as your risk-free practice.</p><p>If you have bandwidth outside of your day-to-day work, talk to your teammates to learn more about their projects. Occasionally you will find opportunities for collaboration between team members to improve team efficiency; these would be great suggestions to bring to your manager.</p><p>The same thing is also possible on an inter-team level. If you are able to pay more attention to <a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27">conversations that are seemingly unrelated to your immediate scope</a>, you will be able to quickly make connections between different projects and initiatives and help improve the efficiency of collaboration across teams.</p><h4><strong>Get involved in team-level activities</strong></h4><p>Managers always appreciate a helping hand in team-level activities such as <strong>sprint planning, quarterly planning</strong> or even just timely suggestions. This will be a great opportunity to test your knowledge about other team members&#8217; work and other teams&#8217; requests for your team. It will also give you some <strong>exposure to the manager&#8217;s plan and vision</strong> for the team. Similarly, volunteering for <strong>culture initiatives</strong> is a great way to practice thinking about the team as a whole and start building a muscle that you will need to flex as a manager.</p><p>I have seen a lot of ICs with great ideas that they implement in their own work; whether it&#8217;s having a product design doc for the data product they develop, or having an SLA agreement with their partner teams. What would be even better is to codify the best practices you use in your own work so the whole team could benefit from it. This is in the same spirit as &#8220;step out of your own scope&#8221;; adopt a manager mindset and help out the whole team.</p><h4><strong>Have open, timely feedback conversations with your&nbsp;manager</strong></h4><p>As a data scientist, you should know the importance of metrics to track your progress. So how do you establish a metric for your transition to a manager? Having open and timely conversations with your manager is key here.</p><p>Every company has slightly different leveling guides for different roles. So it&#8217;s very important to understand the criteria you need to meet as a manager. Ask for the <strong>leveling guide</strong> when you have the initial career development conversation with your manager. And make sure you mention <strong>your aspiration to be a manager</strong> as soon as possible (don&#8217;t be shy) as well as your aspired timeline that you are working towards. At the same time, ask your manager for <strong>candid feedback</strong> with regards to their assessment of your readiness to become a manager, and any gaps that they think you need to address. Similar to the &#8220;build in public&#8221; mentality, don&#8217;t be afraid to put yourself out there for those type of things; even if you fail, it&#8217;s a great learning experience.</p><p>In followup career development check-ins, ask your manager to provide feedback for you against the leveling guide. This will provide clarity for both you and your manager in terms of your progress toward the goal.</p><h4><strong>Summary</strong></h4><p>Not sure if you have noticed, but communication is key in a lot of the things mentioned above. Communicate openly and often with <strong>your teammates, your stakeholders and your manager</strong>. It&#8217;s also important to get <strong>your manager&#8217;s support</strong> in this whole process. I wouldn&#8217;t have been able to do any of these without my awesome manager Dennis&#8217; help and mentorship. Remember, your manager will be your biggest advocate in all of those promo conversations, so it&#8217;s important to bring them on the journey and get their honest feedback along the way.</p><p>Besides that, remember to <strong>step out of your immediate scope</strong> to so you can <strong>get involved with more team-level work, mentor a peer, </strong>or just <strong>take more initiative in general.</strong></p><p><strong>Want to read more about data science career? Here are some great articles for you.</strong></p><p><strong><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27">Productivity Tips for Data Scientists</a></strong><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27"><br></a><em><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27">How to work better, smarter but not necessarily harder as a data scientist</a></em><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1" title="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1">5 Mistakes I Wish I Had Avoided in My Data Science Career</a></strong><a href="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1" title="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1"><br></a><em><a href="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1" title="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1">I learned these lessons the hard way so you don&#8217;t have to</a></em><a href="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1" title="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/how-to-pick-the-right-career-in-the-data-world-1cec8a084767" title="https://towardsdatascience.com/how-to-pick-the-right-career-in-the-data-world-1cec8a084767">How To Pick The Right Career In The Data World</a></strong><a href="https://towardsdatascience.com/how-to-pick-the-right-career-in-the-data-world-1cec8a084767" title="https://towardsdatascience.com/how-to-pick-the-right-career-in-the-data-world-1cec8a084767"><br></a><em><a href="https://towardsdatascience.com/how-to-pick-the-right-career-in-the-data-world-1cec8a084767" title="https://towardsdatascience.com/how-to-pick-the-right-career-in-the-data-world-1cec8a084767">Data Scientist, Data Analyst, or Data Engineer? How do you know which one is right for you?</a></em><a href="https://towardsdatascience.com/how-to-pick-the-right-career-in-the-data-world-1cec8a084767" title="https://towardsdatascience.com/how-to-pick-the-right-career-in-the-data-world-1cec8a084767">towardsdatascience.com</a></p>]]></content:encoded></item><item><title><![CDATA[Concepts You Have to Know for Data Science Interviews — Part II. Probability]]></title><description><![CDATA[Most frequently asked questions in data scientist interviews]]></description><link>https://www.divingintodata.com/p/concepts-you-have-to-know-for-data-science-interviews-part-ii-probability-5c8830f13fb5</link><guid isPermaLink="false">https://www.divingintodata.com/p/concepts-you-have-to-know-for-data-science-interviews-part-ii-probability-5c8830f13fb5</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Mon, 16 May 2022 18:13:54 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/affcd9bd-cbd7-4ec9-9908-a933a2b8a111_800x534.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JZYZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef1d5b3-8fce-419d-8778-4328ca5f8e05_800x534.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JZYZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef1d5b3-8fce-419d-8778-4328ca5f8e05_800x534.jpeg 424w, https://substackcdn.com/image/fetch/$s_!JZYZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef1d5b3-8fce-419d-8778-4328ca5f8e05_800x534.jpeg 848w, https://substackcdn.com/image/fetch/$s_!JZYZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef1d5b3-8fce-419d-8778-4328ca5f8e05_800x534.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!JZYZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef1d5b3-8fce-419d-8778-4328ca5f8e05_800x534.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JZYZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef1d5b3-8fce-419d-8778-4328ca5f8e05_800x534.jpeg" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8ef1d5b3-8fce-419d-8778-4328ca5f8e05_800x534.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!JZYZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef1d5b3-8fce-419d-8778-4328ca5f8e05_800x534.jpeg 424w, https://substackcdn.com/image/fetch/$s_!JZYZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef1d5b3-8fce-419d-8778-4328ca5f8e05_800x534.jpeg 848w, https://substackcdn.com/image/fetch/$s_!JZYZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef1d5b3-8fce-419d-8778-4328ca5f8e05_800x534.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!JZYZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef1d5b3-8fce-419d-8778-4328ca5f8e05_800x534.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">Photo by <a href="https://unsplash.com/@thisisengineering?utm_source=medium&amp;utm_medium=referral">ThisisEngineering RAEng</a> on&nbsp;<a href="https://unsplash.com?utm_source=medium&amp;utm_medium=referral">Unsplash</a></figcaption></figure></div><h4>Most frequently asked questions in data scientist interviews</h4><p>In the <a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50">last article</a> in the series (<a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50">Concepts You Have to Know for Data Science Interviews&#8202;&#8212;&#8202;Part I: Distribution</a>), I touched on the basics of distributions&#8202;&#8212;&#8202;the most important distributions and their characteristics that might show up in data science interviews. In this article, I want to continue the tutorial with common probability questions that companies like to ask DS candidates.</p><p>Probability is a complicated subject that might be hard to master in a short period of time if you really want to understand it in depth. In fact, it is a required course that spans a whole semester for math majors in college. So this article will definitely NOT make you an expert in probability, but instead will give you an idea of the most commonly tested areas within this topic.</p><p><strong>Conditional Probability/ Bayes&#8217; Theorem</strong></p><p>Conditional probability is THE most tested type of probability problems during DS interviews. It&#8217;s a helpful skill because in our day-to-day job, a lot of analytics questions we run into involve conditional probability&#8202;&#8212;&#8202;for example, what&#8217;s the probability of <strong>having a disease </strong>given <strong>the test of the disease is positive. </strong>Conditional probability is usually calculated with Bayes&#8217; Theorem, the formula is shown below:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!xvWC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6da51d79-580c-44a1-b257-4713127e195f_428x154.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xvWC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6da51d79-580c-44a1-b257-4713127e195f_428x154.png 424w, https://substackcdn.com/image/fetch/$s_!xvWC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6da51d79-580c-44a1-b257-4713127e195f_428x154.png 848w, https://substackcdn.com/image/fetch/$s_!xvWC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6da51d79-580c-44a1-b257-4713127e195f_428x154.png 1272w, https://substackcdn.com/image/fetch/$s_!xvWC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6da51d79-580c-44a1-b257-4713127e195f_428x154.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xvWC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6da51d79-580c-44a1-b257-4713127e195f_428x154.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6da51d79-580c-44a1-b257-4713127e195f_428x154.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!xvWC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6da51d79-580c-44a1-b257-4713127e195f_428x154.png 424w, https://substackcdn.com/image/fetch/$s_!xvWC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6da51d79-580c-44a1-b257-4713127e195f_428x154.png 848w, https://substackcdn.com/image/fetch/$s_!xvWC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6da51d79-580c-44a1-b257-4713127e195f_428x154.png 1272w, https://substackcdn.com/image/fetch/$s_!xvWC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6da51d79-580c-44a1-b257-4713127e195f_428x154.png 1456w" sizes="100vw"></picture><div></div></div></a><figcaption class="image-caption"><a href="https://en.wikipedia.org/wiki/Bayes%27_theorem">Wikipedia</a></figcaption></figure></div><p><strong>A,B &#8594; </strong>Events</p><p><strong>P(A|B)</strong> &#8594; probability of A given B is true</p><p><strong>P(B|A) &#8594; </strong>probability of B given A is true</p><p><strong>P(A), P(B) &#8594; </strong>the independent probabilities of A and B</p><p><strong>Independence</strong> is an important concept when learning about conditional probability. It&#8217;s worth noting that if P(A|B) = P(A), meaning &#8220;the probability of A given B&#8221; is the same as &#8220;probability of A&#8221;, then the two events A and B are independent.</p><p>A good example of independent events is the result of each coin flip; for a fair coin:</p><p><strong>A: flip a head in the 2nd flip &#8594; P(A) = 1/2</strong></p><p><strong>B: flip a tail in the 1st flip &#8594; P(B) = 1/2</strong></p><p><strong>A|B: flipping a tail in the 2nd flip given flipped a head in the 1st flip &#8594; P(A|B) = 1/2</strong></p><p>Knowing that you flipped a head in the 1st flip does NOT change the probability of flipping a tail next, so those two events are independent.</p><p>In interviews, a strong hint that you should be considering using Bayes&#8217; Theorem is the key phrases like &#8220;given that&#8221; or &#8220;conditioning on&#8221;.</p><p><strong>Total Probability</strong></p><p>The law of total probability is stated as below:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NmMj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59979145-75a2-45ac-9b8e-eb245442bd08_792x72.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NmMj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59979145-75a2-45ac-9b8e-eb245442bd08_792x72.png 424w, https://substackcdn.com/image/fetch/$s_!NmMj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59979145-75a2-45ac-9b8e-eb245442bd08_792x72.png 848w, https://substackcdn.com/image/fetch/$s_!NmMj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59979145-75a2-45ac-9b8e-eb245442bd08_792x72.png 1272w, https://substackcdn.com/image/fetch/$s_!NmMj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59979145-75a2-45ac-9b8e-eb245442bd08_792x72.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NmMj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59979145-75a2-45ac-9b8e-eb245442bd08_792x72.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/59979145-75a2-45ac-9b8e-eb245442bd08_792x72.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!NmMj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59979145-75a2-45ac-9b8e-eb245442bd08_792x72.png 424w, https://substackcdn.com/image/fetch/$s_!NmMj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59979145-75a2-45ac-9b8e-eb245442bd08_792x72.png 848w, https://substackcdn.com/image/fetch/$s_!NmMj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59979145-75a2-45ac-9b8e-eb245442bd08_792x72.png 1272w, https://substackcdn.com/image/fetch/$s_!NmMj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59979145-75a2-45ac-9b8e-eb245442bd08_792x72.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption"><a href="https://en.wikipedia.org/wiki/Law_of_total_probability">Wikipedia</a></figcaption></figure></div><p><em><strong>&#8220;Bx&#8221;</strong></em> and &#8220;<em><strong>By&#8221;</strong></em> are disjoint events that in combination represent the whole selection universe. Suppose a group consist of people from either SF or Seattle. We know that 40% of the group are from SF (<strong>P(</strong><em><strong>Bx</strong></em><strong>)</strong>) and 60% are from Seattle (<strong>P(By)</strong>); the probability of Seattle being rainy today is 70% (<strong>P(A|By</strong><em><strong>)</strong></em>) and SF being rainy today is 20% (<strong>P(A|Bx)</strong>). What is the probability that we randomly call a person within the group and their city is rainy today (<strong>P(A)</strong>)?</p><p>Using the formula above, we can easily get P(A) = 0.4*0.2+0.6*0.7 = 0.5</p><p>The total probability is usually used in calculation for conditional probability because for the conditional probability shown below, often P(B) is not directly given but has to be calculated using the law of total probability.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qDil!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc8f76f2-3cd0-4b53-abc6-8a056b25e02d_428x154.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qDil!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc8f76f2-3cd0-4b53-abc6-8a056b25e02d_428x154.png 424w, https://substackcdn.com/image/fetch/$s_!qDil!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc8f76f2-3cd0-4b53-abc6-8a056b25e02d_428x154.png 848w, https://substackcdn.com/image/fetch/$s_!qDil!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc8f76f2-3cd0-4b53-abc6-8a056b25e02d_428x154.png 1272w, https://substackcdn.com/image/fetch/$s_!qDil!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc8f76f2-3cd0-4b53-abc6-8a056b25e02d_428x154.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qDil!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc8f76f2-3cd0-4b53-abc6-8a056b25e02d_428x154.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fc8f76f2-3cd0-4b53-abc6-8a056b25e02d_428x154.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!qDil!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc8f76f2-3cd0-4b53-abc6-8a056b25e02d_428x154.png 424w, https://substackcdn.com/image/fetch/$s_!qDil!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc8f76f2-3cd0-4b53-abc6-8a056b25e02d_428x154.png 848w, https://substackcdn.com/image/fetch/$s_!qDil!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc8f76f2-3cd0-4b53-abc6-8a056b25e02d_428x154.png 1272w, https://substackcdn.com/image/fetch/$s_!qDil!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc8f76f2-3cd0-4b53-abc6-8a056b25e02d_428x154.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption"><a href="https://en.wikipedia.org/wiki/Bayes%27_theorem">Wikipedia</a></figcaption></figure></div><p><strong>Binomial Probability</strong></p><p>We have talked about the binomial distribution in my previous article about <a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50">distributions</a>. Binomial probability is using the binomial distribution to calculate the probability of having exactly n success among Y trials of an experiment that only has two outcomes (a good example is flipping a coin).</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!E37v!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5695d09c-527b-44d8-9c0a-4c50c96d2e2f_800x225.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!E37v!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5695d09c-527b-44d8-9c0a-4c50c96d2e2f_800x225.png 424w, https://substackcdn.com/image/fetch/$s_!E37v!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5695d09c-527b-44d8-9c0a-4c50c96d2e2f_800x225.png 848w, https://substackcdn.com/image/fetch/$s_!E37v!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5695d09c-527b-44d8-9c0a-4c50c96d2e2f_800x225.png 1272w, https://substackcdn.com/image/fetch/$s_!E37v!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5695d09c-527b-44d8-9c0a-4c50c96d2e2f_800x225.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!E37v!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5695d09c-527b-44d8-9c0a-4c50c96d2e2f_800x225.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5695d09c-527b-44d8-9c0a-4c50c96d2e2f_800x225.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!E37v!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5695d09c-527b-44d8-9c0a-4c50c96d2e2f_800x225.png 424w, https://substackcdn.com/image/fetch/$s_!E37v!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5695d09c-527b-44d8-9c0a-4c50c96d2e2f_800x225.png 848w, https://substackcdn.com/image/fetch/$s_!E37v!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5695d09c-527b-44d8-9c0a-4c50c96d2e2f_800x225.png 1272w, https://substackcdn.com/image/fetch/$s_!E37v!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5695d09c-527b-44d8-9c0a-4c50c96d2e2f_800x225.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption"><a href="https://en.wikipedia.org/wiki/Binomial_distribution">Wikipedia</a></figcaption></figure></div><p><strong>k: </strong>number of trials with success</p><p><strong>n: </strong>number of total trials</p><p><strong>p: </strong>probability of success in a single trial</p><p><strong>n!</strong>: n factorial, calculation for this is n*(n-1)*(n-2)&#8230; *3*2*1</p><p><strong>k!</strong>: k factorial, calculation similar to above</p><p><strong>(n-k)!</strong>: you get the point</p><p>Recognizing that the question involves an event with only two outcomes is the key to deciding when to use the binomial formula to calculate probabilities.</p><p>One good example of this is: <strong>the probability of getting 4 heads out of 10 coin flips (assuming it&#8217;s an unfair coin, and for every flip, there&#8217;s a 1/4 probability of getting a head and 3/4 probability of getting a tail)</strong>.</p><p>In this example:</p><p>n = 10, k=4, p=1/4</p><p>The calculation for this would be 10!/(4!*6!)*(1/4)&#8308;*(3/4)&#8310; = 210*1/4&#8308;*(3/4)&#8310;= 0.15</p><p><strong>How are these tested and how to prepare for them</strong></p><p>These are tested in a lot of different ways. But the general theme is, instead of being in isolation, they are usually tested in combination. Like mentioned above, the Law of Total Probability can be easily incorporated into the Bayes&#8217; Theorem when asked in interviews.</p><p>It&#8217;s important to note that just memorizing the formulas for these will not be enough for interviews. Because in interviews, nobody will tell you when to use which formula. Determining which is the correct one to use is arguably the hardest part. The best way to master it is looking at a large number of sample questions and try NOT to look at the solution but try to solve them yourself instead. Getting familiar with the type of questions that require the use of these formulas is key. For sample questions, a simple Google search will give you plenty of results for each. If you want to have a more centralized &#8220;database&#8221; for probability questions, I highly recommend the <em><strong>Ace the Data Science Interview </strong></em>book by Nick Singh and Kevin Huo.</p><p><strong>Looking for more interview guide? Here are some articles that might be helpful!</strong></p><p><strong><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50">Concepts You Have to Know for Data Science Interviews&#8202;&#8212;&#8202;Part I: Distribution</a></strong><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50"><br></a><em><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50">Most frequently asked questions in data scientist interviews</a></em><a href="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50" title="https://towardsdatascience.com/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/take-home-exercises-can-make-or-break-your-ds-interviews-5aac8b63f6d1" title="https://towardsdatascience.com/take-home-exercises-can-make-or-break-your-ds-interviews-5aac8b63f6d1">Take-Home Exercises can Make or Break Your DS Interviews</a></strong><a href="https://towardsdatascience.com/take-home-exercises-can-make-or-break-your-ds-interviews-5aac8b63f6d1" title="https://towardsdatascience.com/take-home-exercises-can-make-or-break-your-ds-interviews-5aac8b63f6d1"><br></a><em><a href="https://towardsdatascience.com/take-home-exercises-can-make-or-break-your-ds-interviews-5aac8b63f6d1" title="https://towardsdatascience.com/take-home-exercises-can-make-or-break-your-ds-interviews-5aac8b63f6d1">How to tackle your take-home exercise&#8202;&#8212;&#8202;arguably the most important part of the interview and the part that&#8217;s most in&#8230;</a></em><a href="https://towardsdatascience.com/take-home-exercises-can-make-or-break-your-ds-interviews-5aac8b63f6d1" title="https://towardsdatascience.com/take-home-exercises-can-make-or-break-your-ds-interviews-5aac8b63f6d1">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47" title="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47">The Ultimate Interview Prep Guide for Data Scientists and Data Analysts</a></strong><a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47" title="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47"><br></a><em><a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47" title="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47">What helped me interview successfully with FANG as well as unicorns</a></em><a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47" title="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-data-scientists-and-data-analysts-18621db1da47">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-your-next-dream-data-job-be4b2c7f73a8" title="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-your-next-dream-data-job-be4b2c7f73a8">The Ultimate Interview Prep Guide for Your Next Dream Data Job</a></strong><a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-your-next-dream-data-job-be4b2c7f73a8" title="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-your-next-dream-data-job-be4b2c7f73a8"><br></a><em><a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-your-next-dream-data-job-be4b2c7f73a8" title="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-your-next-dream-data-job-be4b2c7f73a8">What helped me interview successfully with FANG and unicorns for jobs ranging from product manager to data scientist</a></em><a href="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-your-next-dream-data-job-be4b2c7f73a8" title="https://towardsdatascience.com/the-ultimate-interview-prep-guide-for-your-next-dream-data-job-be4b2c7f73a8">towardsdatascience.com</a></p>]]></content:encoded></item><item><title><![CDATA[Why I Left McKinsey as a Data Scientist]]></title><description><![CDATA[Things you should consider before starting as a data science consultant]]></description><link>https://www.divingintodata.com/p/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5</link><guid isPermaLink="false">https://www.divingintodata.com/p/why-i-left-mckinsey-as-a-data-scientist-30eec01504e5</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Tue, 26 Apr 2022 15:18:37 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/a3fc2006-2a46-4979-8bcd-aaab588c6335_800x533.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rjsi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61821769-e591-42a6-933c-2618e08dd20f_800x533.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rjsi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61821769-e591-42a6-933c-2618e08dd20f_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!rjsi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61821769-e591-42a6-933c-2618e08dd20f_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!rjsi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61821769-e591-42a6-933c-2618e08dd20f_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!rjsi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61821769-e591-42a6-933c-2618e08dd20f_800x533.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rjsi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61821769-e591-42a6-933c-2618e08dd20f_800x533.jpeg" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/61821769-e591-42a6-933c-2618e08dd20f_800x533.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rjsi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61821769-e591-42a6-933c-2618e08dd20f_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!rjsi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61821769-e591-42a6-933c-2618e08dd20f_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!rjsi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61821769-e591-42a6-933c-2618e08dd20f_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!rjsi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61821769-e591-42a6-933c-2618e08dd20f_800x533.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">Photo by <a href="https://unsplash.com/@bearsnap?utm_source=medium&amp;utm_medium=referral">Junseong Lee</a> on&nbsp;<a href="https://unsplash.com?utm_source=medium&amp;utm_medium=referral">Unsplash</a></figcaption></figure></div><h4>Things you should consider before starting as a data science consultant</h4><p>In my previous articles about McKinsey, I talked about <strong><a href="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d">why I decided to join McKinsey as a data science consultant</a></strong> and why I think it&#8217;s a great career move for aspiring data scientists; I have also talked about <strong><a href="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba">valuable lessons I have learned from being a data science consultant</a></strong>.</p><p>Given the prestige of the company and the great things I mentioned in those articles, it was a surprise to a lot of people that I chose to leave after two years. In this article, I want to share <strong>what drove my decision to leave </strong>and what you should know and take into consideration if you are also thinking about joining consulting as a data scientist.</p><p>And just to steal the thunder of the negative voices that appear every single time I mention McKinsey in my articles&#8202;&#8212;&#8202;the reasons I will be mentioning here are solely out of the consideration for <strong>personal and career development</strong>; as for debates about &#8220;whether the firm is ethical&#8221;, &#8220;whether consultants are useful for companies&#8221; etc., everyone is entitled to their own opinion and those are out of the scope of my discussion here.</p><p>So with that out of the way, let&#8217;s talk about why I left:</p><p><strong>It&#8217;s hard to feel the long-term impact of your work as a DS</strong></p><p>You probably hear A LOT OF consultants saying this is THE reason why they have left consulting. I used to think it&#8217;s just a convenient reason people use in interviews but it turned out to be one of the main reasons that drove me to leave in the end.</p><p>This is partially determined by the nature of consultancy. Consultants provide expertise and advice in certain fields to companies. At the end of each engagement, the proposed solution (usually a model or an analysis for data science engagements) will be wrapped in a bow and handed to the client to <strong>implement, test and/or maintain</strong>. Imagine you built a demand-forecasting model for a client as a data science consultant: You can <strong>assess the model&#8217;s performance using historic data</strong> before handing it off to the client, and if the partner manages to convince the client to extend the project to test the model, you might get several weeks worth of new data to assess impact. But most likely you won&#8217;t know its performance after it&#8217;s officially deployed by the client (or if it will be deployed and used at all), let alone its continued impact several years down the road.</p><p>Until this day, it&#8217;s still a debate whether the <strong>implementation phase should be part of consulting firms&#8217; scope</strong>. Some people believe if consultants start to implement the solutions proposed, they are taking on the role of managers while others think proposal of solutions without implementation is a waste of time and money. McKinsey started taking implementation more and more seriously during my tenure; but at the current stage of the industry, aspiring data science consultants should not get their hopes up that they will see their models in production and be able to observe longer-term impact.</p><p><strong>Not many chances to work with cutting-edge DS concepts</strong></p><p>Companies hire consultants for the expertise they DON&#8217;T have. So naturally, most clients that work with consultants for analytics needs are not the most advanced on the data science front. As a data science consultant, you are most likely NOT going to work on building a <strong>recommendation engine for Amazon</strong> or <strong>search optimization for Google</strong>.</p><p>There are projects here and there focusing on relatively cutting-edge techniques such as dynamic pricing, but the majority of the engagements will be utilizing simpler methods. Not to mention that clients who are just starting out on developing their analytics practice usually have more <strong>distrust for black-box models</strong> and prefer interpretability over it. As a data science consultant, you will have more opportunity to learn how to explain analytics concepts to non-analytical audiences (which in my opinion is an extremely valuable skill), but less opportunity to implement the newest practice in the DS field.</p><p>This is a more relevant concern for some than for others. If your interest as a data scientist lies in <strong>machine learning, AI and/or other advanced analytics areas</strong>, it&#8217;s worth keeping this in mind before deciding which projects you want to work on in consulting, or if consulting is the right industry to enter at all. However, if you are more interested in <strong>solving real-world business problems through data </strong>and/or modeling, consulting is the perfect place to start learning and practicing those skills.</p><p><strong>Scalability is not always top of mind and it&#8217;s hard to think about/take on long-term initiatives</strong></p><p>Since consulting engagements are usually short-term, even squeezed for time occasionally, clients expect to see deliverables within weeks. Due to short timelines and the fact that most times consultants are not held accountable for the sustainability of the deliverables after the handoff, <strong>efficiency</strong> usually takes priority over <strong>scalability</strong> when it comes to choosing approaches. A lot of the analytics deliverables, models or analyses end up being more of a &#8220;proof of concept&#8221; or &#8220;prototype&#8221; than a real fully-fledged product.</p><p>Ever since working in the tech industry, especially as a manager, I have learned the importance of <strong>long-term initiatives</strong>. These type of projects are usually initiated by managers and leaders who elevate themselves from their teams&#8217; immediate day-to-day scope and <strong>proactively address gaps</strong> in realizing the whole company&#8217;s vision. Consultants hardly ever need to think about long-term initiatives because they are hired to solve a (or a set of) problem(s); so the nature of the work is <strong>reactive instead of proactive</strong> and the scope of the work is normally pre-defined. Imagine you are hired to figure out how to place a piece of the puzzle, but you don&#8217;t have a say in the how the rest of the puzzle can be moved around; it can get frustrating and you don&#8217;t get to develop an understanding of the rest of the puzzle and thus never develop the skillset to move those pieces around.</p><p><strong>The long hours are real and the traveling can get frustrating</strong></p><p>The final reason is more on the personal level than on the professional level. Due the nature of the work and the industry (like lawyers, consultants bill their clients by the hour), consulting is <strong>not known for a great work-life balance</strong>. Churning out analyses/decks until midnight is the norm for most weekdays.</p><p>On top of the long hours, pre-pandemic, consultants need to travel to a different city from Monday to Thursday for most projects. You might be thinking to yourself, &#8220;that sounds exciting, I get to visit different cities!&#8221;, and that&#8217;s what I thought, too, at the beginning. However, the excitement of travel for work wears off pretty quickly, especially since you have limited time to explore the city and spend most nights working late in similar-looking business hotels.</p><p><strong>Summary:</strong></p><p>Like I mentioned in my previous article about <a href="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d">why I decided to join McKinsey as a data scientist</a>, even though consulting is a great place to start your career as an aspiring data scientist, it might not be for everyone. Things to keep in mind about being a data scientist in consulting:</p><ul><li><p>Consulting might be a better fit for you if you are more of an &#8220;explorer&#8221; than a &#8220;builder&#8221; because for most projects, you will hand off the prototype to clients and won&#8217;t see it grow</p></li><li><p>In general, you will acquire more data story-telling skills than cutting-edge modeling skills</p></li><li><p>You will get more opportunities to engage in efficient problem-solving with short-term analytics projects; but less chance to do long-term planning for robust analytics ones</p></li></ul><p><em><strong>Interested in reading more about data science career? Here are some articles that you might be interested in:</strong></em></p><p><strong><a href="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d" title="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d">Why I Joined McKinsey as a Data Scientist</a></strong><a href="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d" title="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d"><br></a><em><a href="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d" title="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d">Should you ever be a data science consultant despite the 80-hour work weeks?</a></em><a href="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d" title="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba" title="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba">5 Lessons McKinsey Taught Me That Will Make You a Better Data Scientist</a></strong><a href="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba" title="https://towardsdatascience.com/5-lessons-mckinsey-taught-me-that-will-make-you-a-better-data-scientist-66cd9cc16aba"><br>towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1" title="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1">5 Mistakes I Wish I Had Avoided in My Data Science Career</a></strong><a href="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1" title="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1"><br></a><em><a href="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1" title="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1">I learned these lessons the hard way so you don&#8217;t have to</a></em><a href="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1" title="https://towardsdatascience.com/5-mistakes-i-wish-i-had-avoided-in-my-data-science-career-6c22a44304a1">towardsdatascience.com</a></p>]]></content:encoded></item><item><title><![CDATA[An Introvert’s Survival Guide in The “Loud” Virtual Working World]]></title><description><![CDATA[How to improve productivity and be happier when WFH]]></description><link>https://www.divingintodata.com/p/an-introverts-survival-guide-in-the-loud-virtual-working-world-ff7c907a50cf</link><guid isPermaLink="false">https://www.divingintodata.com/p/an-introverts-survival-guide-in-the-loud-virtual-working-world-ff7c907a50cf</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Tue, 29 Mar 2022 14:13:44 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/3f84050e-c3ba-4cfe-b075-406a1ea1f0d5_800x512.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qfX2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d19f7a4-ef8e-4c8c-8a72-a6a0d73fa7a7_800x512.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qfX2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d19f7a4-ef8e-4c8c-8a72-a6a0d73fa7a7_800x512.jpeg 424w, https://substackcdn.com/image/fetch/$s_!qfX2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d19f7a4-ef8e-4c8c-8a72-a6a0d73fa7a7_800x512.jpeg 848w, https://substackcdn.com/image/fetch/$s_!qfX2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d19f7a4-ef8e-4c8c-8a72-a6a0d73fa7a7_800x512.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!qfX2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d19f7a4-ef8e-4c8c-8a72-a6a0d73fa7a7_800x512.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qfX2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d19f7a4-ef8e-4c8c-8a72-a6a0d73fa7a7_800x512.jpeg" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9d19f7a4-ef8e-4c8c-8a72-a6a0d73fa7a7_800x512.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!qfX2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d19f7a4-ef8e-4c8c-8a72-a6a0d73fa7a7_800x512.jpeg 424w, https://substackcdn.com/image/fetch/$s_!qfX2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d19f7a4-ef8e-4c8c-8a72-a6a0d73fa7a7_800x512.jpeg 848w, https://substackcdn.com/image/fetch/$s_!qfX2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d19f7a4-ef8e-4c8c-8a72-a6a0d73fa7a7_800x512.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!qfX2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d19f7a4-ef8e-4c8c-8a72-a6a0d73fa7a7_800x512.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">Photo by <a href="https://unsplash.com/@tinaflour?utm_source=medium&amp;utm_medium=referral">Kristina Flour</a> on&nbsp;<a href="https://unsplash.com?utm_source=medium&amp;utm_medium=referral">Unsplash</a></figcaption></figure></div><h4>How to improve productivity and be happier when&nbsp;WFH</h4><p>By now I&#8217;m sure everyone has heard extroverts&#8217; complaints about the whole WFH situation&#8202;&#8212;&#8202;the fun of meetings and being surrounded by people is taken away from them. But since introverts gain energy from the solitude, that means WFH should be easy for them, right?</p><p>Well, not really.</p><p>Like a lot of people, at the beginning of the pandemic I thought that WFH would be more introvert-friendly and less draining in comparison to an in-person working environment, and as an introvert myself, I was looking forward to that. Man, was I wrong: The virtual work environment can be very loud, arguably louder than being in the office at times.</p><p><em><strong>Work from home (WFH) I hoped for:</strong></em></p><p><strong>9am-11am: </strong>A couple of important meetings with small groups</p><p><strong>11am-12pm:</strong> Lunch time</p><p><strong>12pm on:</strong> Heads down coding/modeling/analyses time without interruption or distraction from office noise etc.</p><p><em><strong>Work from home in reality:</strong></em></p><p><strong>9am-forever: </strong>Meetings to eternity with 10+ people, with a couple of last-minute &#8220;can we hop on a call real quick&#8221; requests sprinkled on top.</p><p><strong>10-minute-intervals in between meetings: </strong>Trying to quickly get some heads-down work done</p><p>Remote work during COVID has unfortunately resulted in <a href="https://hbswk.hbs.edu/item/you-re-right-you-are-working-longer-and-attending-more-meetings">more frequent meetings and emails</a>. It definitely took a toll on my mental health when my calendar was suddenly filled with huge meetings where it was hard to get points cross without cutting people off; my slack beeped every 30 seconds; and the little red circle on my email app increased counts every minute. The virtual working environment quickly became too &#8220;loud&#8221; for me to be productive.</p><p><strong>Know what energizes you and what drains your energy as an introvert:</strong></p><p>Even though extroversion and and introversion are <strong><a href="https://link.springer.com/chapter/10.1007/978-981-13-7213-1_6">not binary</a></strong>, if you self-identify as an introvert, it&#8217;s likely that big groups drain your energy to some extent and you prefer <strong>smaller, more intimate</strong> interactions. It shouldn&#8217;t be a surprise that you need <strong>time and space</strong> to reflect and/or work on things that involve deep thinking and analysis thus it&#8217;s helpful to find ways to <strong>eliminate distractions</strong>.</p><p>For the past several months, I experimented with different ways to improve the remote-work situation trying to reduce the distraction and noise so I can be productive and keep my mental health in check.</p><h4><strong>Meetings</strong></h4><p><em><strong>Control your schedule by grouping all your meetings together</strong></em></p><p>For introverts, what&#8217;s more annoying that having tons of meetings everyday is having tons of meetings spanning the whole day with 30-minute breaks in between. Because those 30-minute breaks are not enough to do any deep work before you get pulled into another meeting, they are essentially &#8220;wasted&#8221;. And introverts get energy from being able to <a href="https://medium.com/taking-note/why-deep-work-matters-in-a-distracted-world-ee4a675375f0">dive into and focus on a topic</a>.</p><p><strong>What works better </strong>is trying to group all of your meetings together. So instead of having meetings filling up your whole day, have them fill your whole morning (or afternoon, or early morning and late afternoon, your choice) so you will still have chunks of time to do in-depth work without getting distracted outside of those meeting blocks. In order to make sure that nobody schedules over your focus time, block if off on your calendar as &#8220;DNS&#8221; (Do Not Schedule) or &#8220;Please ask before scheduling&#8221;.</p><p>And instead of having back-to-back 30-minute meetings, I would usually try to schedule 25-minute meetings: end the meeting 5 minutes early or start the meeting invite 5-minutes after the clock. This way you have enough time to take a break between meetings and reset.</p><p><em><strong>Have a pre-read and agenda for meetings</strong></em></p><p>It&#8217;s always hard to get your point across in big meetings, especially as an introvert. Not to mention you will often have participants in the meetings ask a marginally-related question and take everyone down a rabbit hole; as an introvert, I find it hard to then drag everyone back to the original topic.</p><p>To avoid situations like these, for the meetings I run, I prefer to send pre-reads and an agenda before the meetings detailing the topics we want to talk about and providing some background/context of the meeting. This will help decrease the amount of digression and help drive the points home. When the discussions get out of hands, it&#8217;s also easier to have the agenda serving as a lighthouse for people to re-focus on the agreed-upon goal of the meeting.</p><p><em><strong>Keep your meeting size small and focus on small-group interactions</strong></em></p><p>Giant meetings and 100-people slack channels can be a nightmare for introverts.</p><p>I prefer to keep my meetings small so I can ask targeted questions and make sure everyone&#8217;s opinion gets factored into the discussion. <em>When having cross-functional discussions with other teams, instead of inviting everyone, I only invite one or two key people on each team and count on them for intra-team communication from there</em>.</p><p>For giant virtual meetings, I find it easier to ask questions through the chat function of the meeting tool than trying to talk over people.</p><h4><strong>Other working&nbsp;tools</strong></h4><p><strong>Turn off your slack notification and check it on your own terms</strong></p><p>Instead of getting distracted by the sound of slack every 5 minutes, try turning off the notification and only check it when you want to take a break from the analysis you are working on. Even if you can&#8217;t help yourself but to check the message right away when a little red bubble with number &#8220;1&#8221; shows up on top of Slack, the absence of the notification sound will make you less stressed.</p><p><strong>Only dedicate certain times for checking/processing emails</strong></p><p>Depending on your company culture and people&#8217;s working habit around you, you may get tons of emails on top of slack messages. I find it the easiest to make sure people know that emails should be used for non-urgent communications whereas Slack could handle the more urgent ones. This way I can bulk check my emails once a day and not get distracted by them during my heads-down time.</p><p>Hopefully some of the tips here are helpful to my fellow introverts out there&#8202;&#8212;&#8202;the virtual world can be loud and noisy but you have the power to turn down its volume and find your own rhythm in it.</p><p><strong>Don&#8217;t know what to read next? The following might be interesting to you:</strong></p><p><strong><a href="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d" title="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d">Why I Joined McKinsey as a Data Scientist</a></strong><a href="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d" title="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d"><br></a><em><a href="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d" title="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d">Should you ever be a data science consultant despite the 80-hour work weeks?</a></em><a href="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d" title="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27">Productivity Tips for Data Scientists</a></strong><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27"><br></a><em><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27">How to work better, smarter but not necessarily harder as a data scientist</a></em><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27">towardsdatascience.com</a></p><p><strong><a href="https://medium.com/illumination/why-i-wont-participate-in-the-great-resignation-for-my-side-hustle-e6ef6f9c85ab" title="https://medium.com/illumination/why-i-wont-participate-in-the-great-resignation-for-my-side-hustle-e6ef6f9c85ab">Why I Won&#8217;t Participate in the &#8216;Great Resignation&#8217; For My Side Hustle</a></strong><a href="https://medium.com/illumination/why-i-wont-participate-in-the-great-resignation-for-my-side-hustle-e6ef6f9c85ab" title="https://medium.com/illumination/why-i-wont-participate-in-the-great-resignation-for-my-side-hustle-e6ef6f9c85ab"><br></a><em><a href="https://medium.com/illumination/why-i-wont-participate-in-the-great-resignation-for-my-side-hustle-e6ef6f9c85ab" title="https://medium.com/illumination/why-i-wont-participate-in-the-great-resignation-for-my-side-hustle-e6ef6f9c85ab">and why you should reconsider, too, if you are thinking about it</a></em><a href="https://medium.com/illumination/why-i-wont-participate-in-the-great-resignation-for-my-side-hustle-e6ef6f9c85ab" title="https://medium.com/illumination/why-i-wont-participate-in-the-great-resignation-for-my-side-hustle-e6ef6f9c85ab">medium.com</a></p>]]></content:encoded></item><item><title><![CDATA[Concepts You Have to Know for Data Science Interviews — Part I: Distribution]]></title><description><![CDATA[Most frequently asked questions in data scientist interviews]]></description><link>https://www.divingintodata.com/p/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50</link><guid isPermaLink="false">https://www.divingintodata.com/p/concepts-you-have-to-know-for-data-science-interviews-part-i-distribution-f4c28da3fc50</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Tue, 15 Mar 2022 06:17:25 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/b9274b7d-9f08-4dda-91d5-aeb1cb9d58e6_800x532.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5_YU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6ebc3e3-a641-416d-b51c-c01f09ebbc27_800x532.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5_YU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6ebc3e3-a641-416d-b51c-c01f09ebbc27_800x532.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5_YU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6ebc3e3-a641-416d-b51c-c01f09ebbc27_800x532.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5_YU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6ebc3e3-a641-416d-b51c-c01f09ebbc27_800x532.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5_YU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6ebc3e3-a641-416d-b51c-c01f09ebbc27_800x532.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5_YU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6ebc3e3-a641-416d-b51c-c01f09ebbc27_800x532.jpeg" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a6ebc3e3-a641-416d-b51c-c01f09ebbc27_800x532.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5_YU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6ebc3e3-a641-416d-b51c-c01f09ebbc27_800x532.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5_YU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6ebc3e3-a641-416d-b51c-c01f09ebbc27_800x532.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5_YU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6ebc3e3-a641-416d-b51c-c01f09ebbc27_800x532.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5_YU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6ebc3e3-a641-416d-b51c-c01f09ebbc27_800x532.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">Photo by <a href="https://unsplash.com/@edge2edgemedia?utm_source=medium&amp;utm_medium=referral">Edge2Edge Media</a> on&nbsp;<a href="https://unsplash.com?utm_source=medium&amp;utm_medium=referral">Unsplash</a></figcaption></figure></div><h4>Office Hours</h4><h4>Most frequently asked questions in data scientist interviews</h4><p>It&#8217;s the beginning of the year and a lot of people are looking into new opportunities and new challenges. I remember from my multiple rounds of recruiting in the past that I regularly struggled to find a good comprehensive resource to prepare for my data science interviews, so I decided to put one together myself.</p><p>If you are starting a new round of recruiting or looking to up-skill your data science knowledge for the new year, hopefully you will find this cheatsheet/crash course helpful as a starting point. This is the first article of the data science interview series where I want to mention some basic distributions and basic concepts related to distributions that get asked frequently in data science interviews.</p><p>The concepts I will touch on in this series is by no means exhaustive, it&#8217;s an aggregated list for the most asked questions in data science interviews, or concepts good data scientists should be familiar with, based on my experience.</p><p><strong>Basic Statistics Concepts Related to Distributions</strong></p><p>This might seem too basic for some of you, but it&#8217;s important to make sure you truly understand the definitions, differences and use cases for simple statistic concepts like mean, median and mode. Several key concepts I see coming up time and time again in interviews:</p><ol><li><p>Mean is different from the median in the sense that the mean is more easily influenced by outliers and the skewness of the distribution. The mean will be the same as the median for symmetric distributions. The mean will be smaller than the median for left skewed distributions and will be bigger than the median for right skewed distributions.</p></li><li><p>Mode represents the &#8220;peak&#8221; of the distribution and it overlaps with both the mean and the median for symmetric distributions. For left-skewed distributions, the mode is bigger than both the median and the mean; and the mode is smaller than both for right-skewed distributions.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!IYSV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6148eb6-eaeb-4c3b-a03b-21e2ad60e47a_800x604.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!IYSV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6148eb6-eaeb-4c3b-a03b-21e2ad60e47a_800x604.png 424w, https://substackcdn.com/image/fetch/$s_!IYSV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6148eb6-eaeb-4c3b-a03b-21e2ad60e47a_800x604.png 848w, https://substackcdn.com/image/fetch/$s_!IYSV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6148eb6-eaeb-4c3b-a03b-21e2ad60e47a_800x604.png 1272w, https://substackcdn.com/image/fetch/$s_!IYSV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6148eb6-eaeb-4c3b-a03b-21e2ad60e47a_800x604.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!IYSV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6148eb6-eaeb-4c3b-a03b-21e2ad60e47a_800x604.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b6148eb6-eaeb-4c3b-a03b-21e2ad60e47a_800x604.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!IYSV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6148eb6-eaeb-4c3b-a03b-21e2ad60e47a_800x604.png 424w, https://substackcdn.com/image/fetch/$s_!IYSV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6148eb6-eaeb-4c3b-a03b-21e2ad60e47a_800x604.png 848w, https://substackcdn.com/image/fetch/$s_!IYSV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6148eb6-eaeb-4c3b-a03b-21e2ad60e47a_800x604.png 1272w, https://substackcdn.com/image/fetch/$s_!IYSV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6148eb6-eaeb-4c3b-a03b-21e2ad60e47a_800x604.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Graph from <a href="https://math.stackexchange.com/questions/969967/finding-the-most-likely-serve-speed-of-a-tennis-player">Stack Exchange</a> (Note: StackExchange images are all CC-licensed and allow commercial use)</figcaption></figure></div><p><strong>Distributions</strong></p><p>Distributions are a very important concept in statistics that gets tested in interviews a lot. There are a lot of distributions out there, but several important and common ones get mentioned and used over and over again in interviews:</p><ol><li><p><strong>The Normal Distribution</strong></p></li></ol><p>This is THE most important distribution to know as many things in life follow the Normal Distribution&#8202;&#8212;&#8202;height of adults, IQ of adults etc. It&#8217;s important to know that the mean, median and mode are the same for the Normal Distribution as it is <strong>symmetric</strong> and <strong>NOT skewed</strong>.</p><p>The Normal Distribution has<strong> symmetric variation</strong> around the mean that is defined as <strong>standard deviation</strong>. This means it&#8217;s equally likely for values to be higher or lower than the mean. But it&#8217;s less likely for values to vary from the mean by a larger magnitude than a smaller magnitude (regardless of the direction). In fact, for the Normal Distribution, 68.3% of values lie within 1 standard deviation away from the mean; 95.5% lie within 2 standard deviations away from the mean and 99.7% values are within 3 standard deviations from the mean.</p><p>The Normal Distribution is determined by only two parameters&#8202;&#8212;&#8202;the mean and the standard deviation. The mean determines the center of the distribution&#8202;&#8212;&#8202;where the peak (mean, median and mode) lies whereas the standard deviation determines the shape/spread of the distribution&#8202;&#8212;&#8202;the smaller the standard deviation, the tighter the distribution.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!C5tn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f6bedb-958a-460b-984d-904b55b2b7d8_800x619.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!C5tn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f6bedb-958a-460b-984d-904b55b2b7d8_800x619.png 424w, https://substackcdn.com/image/fetch/$s_!C5tn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f6bedb-958a-460b-984d-904b55b2b7d8_800x619.png 848w, https://substackcdn.com/image/fetch/$s_!C5tn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f6bedb-958a-460b-984d-904b55b2b7d8_800x619.png 1272w, https://substackcdn.com/image/fetch/$s_!C5tn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f6bedb-958a-460b-984d-904b55b2b7d8_800x619.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!C5tn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f6bedb-958a-460b-984d-904b55b2b7d8_800x619.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/31f6bedb-958a-460b-984d-904b55b2b7d8_800x619.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!C5tn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f6bedb-958a-460b-984d-904b55b2b7d8_800x619.png 424w, https://substackcdn.com/image/fetch/$s_!C5tn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f6bedb-958a-460b-984d-904b55b2b7d8_800x619.png 848w, https://substackcdn.com/image/fetch/$s_!C5tn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f6bedb-958a-460b-984d-904b55b2b7d8_800x619.png 1272w, https://substackcdn.com/image/fetch/$s_!C5tn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f6bedb-958a-460b-984d-904b55b2b7d8_800x619.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption"><a href="https://stats.stackexchange.com/questions/476677/understanding-standard-deviation-in-normal-distribution">stats.stackexchange.com</a></figcaption></figure></div><p><strong>2. Bernoulli Distribution</strong></p><p>The Bernoulli Distribution describes a single experiment that has <strong>ONLY two outcomes. </strong>We usually describe these two outcomes as success-1 and failure-0 for simplicity. The most common examples of this distribution are things like a flip of a coin, where a head would be a &#8220;success&#8221; and a tail would be a &#8220;failure&#8221;; or the weather outcome in terms of whether it will rain, where rain would be a &#8220;success&#8221; and no rain will be a &#8220;failure&#8221; (or the other way around if you really hate rain).</p><p>There is only one parameter for this distribution&#8202;&#8212;&#8202;the probability of success <em>p</em>. In the case of a fair coin, <em>p=</em>50%.</p><p><strong>3. Binomial distribution</strong></p><p>The Binomial Distribution is closely related to the Bernoulli distribution as it models the number of successes in repeated Bernoulli experiments.</p><p>The Binomial Distribution takes two parameters&#8202;&#8212;&#8202;the probability of success <em>p </em>and the number of repetition <em>n. </em>When <em>p </em>is greater than 0.5, the distribution is left-skewed; when it&#8217;s smaller than 0.5, the distribution is right-skewed and when <em>p = 0.5, </em>you guessed it, the distribution is symmetric.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!c9XZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59a54d78-af0f-4579-ba5e-4d1c822554e7_776x560.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!c9XZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59a54d78-af0f-4579-ba5e-4d1c822554e7_776x560.png 424w, https://substackcdn.com/image/fetch/$s_!c9XZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59a54d78-af0f-4579-ba5e-4d1c822554e7_776x560.png 848w, https://substackcdn.com/image/fetch/$s_!c9XZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59a54d78-af0f-4579-ba5e-4d1c822554e7_776x560.png 1272w, https://substackcdn.com/image/fetch/$s_!c9XZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59a54d78-af0f-4579-ba5e-4d1c822554e7_776x560.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!c9XZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59a54d78-af0f-4579-ba5e-4d1c822554e7_776x560.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/59a54d78-af0f-4579-ba5e-4d1c822554e7_776x560.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!c9XZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59a54d78-af0f-4579-ba5e-4d1c822554e7_776x560.png 424w, https://substackcdn.com/image/fetch/$s_!c9XZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59a54d78-af0f-4579-ba5e-4d1c822554e7_776x560.png 848w, https://substackcdn.com/image/fetch/$s_!c9XZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59a54d78-af0f-4579-ba5e-4d1c822554e7_776x560.png 1272w, https://substackcdn.com/image/fetch/$s_!c9XZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59a54d78-af0f-4579-ba5e-4d1c822554e7_776x560.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Left-skewed binomial distribution (<a href="https://stats.stackexchange.com/questions/176425/why-is-a-binomial-distribution-bell-shaped">Stack Exchange</a>)</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YfDV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6870a56d-3f33-4488-ba2e-9c90ca4027cc_800x431.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YfDV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6870a56d-3f33-4488-ba2e-9c90ca4027cc_800x431.png 424w, https://substackcdn.com/image/fetch/$s_!YfDV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6870a56d-3f33-4488-ba2e-9c90ca4027cc_800x431.png 848w, https://substackcdn.com/image/fetch/$s_!YfDV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6870a56d-3f33-4488-ba2e-9c90ca4027cc_800x431.png 1272w, https://substackcdn.com/image/fetch/$s_!YfDV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6870a56d-3f33-4488-ba2e-9c90ca4027cc_800x431.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YfDV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6870a56d-3f33-4488-ba2e-9c90ca4027cc_800x431.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6870a56d-3f33-4488-ba2e-9c90ca4027cc_800x431.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YfDV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6870a56d-3f33-4488-ba2e-9c90ca4027cc_800x431.png 424w, https://substackcdn.com/image/fetch/$s_!YfDV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6870a56d-3f33-4488-ba2e-9c90ca4027cc_800x431.png 848w, https://substackcdn.com/image/fetch/$s_!YfDV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6870a56d-3f33-4488-ba2e-9c90ca4027cc_800x431.png 1272w, https://substackcdn.com/image/fetch/$s_!YfDV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6870a56d-3f33-4488-ba2e-9c90ca4027cc_800x431.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption"><a href="https://www.dummies.com/article/business-careers-money/business/accounting/calculation-analysis/how-to-graph-the-binomial-distribution-145938">Symmetric binomial distribution</a></figcaption></figure></div><p>If you think the symmetric Binomial Distribution above looks a lot of a Normal Distribution, you are right. In fact, the Binomial Distribution can be approximated by the Normal Distribution when the number of experiments is big enough (with <em>n </em>being the number of experiment repetition and <em>p </em>being probability of success, the rule of thumb is when both np and n(1-p) are bigger than 10, the Binomial Distribution can be approximated by the Normal Distribution).</p><p>Other less common distributions that you might get asked about are the <strong>Poisson Distribution</strong> (it describes the number of events in a certain timeframe; the most common example is the number of customers arriving to a restaurant within a fixed time frame), <strong>Exponential Distribution</strong> (it is related to the Poisson Distribution since it describes the time interval between the events; in the customer example, it describes the time passed between each customer&#8217;s arrival within a fixed time window) and <strong>Gamma Distribution</strong> (it is similar to the Exponential Distribution in the sense that it describes the the total time until reaching a fix number of events; to continue the example, it describe the time you need to wait until having a certain number of customers arrived at the restaurant).</p><p><strong>How these are tested</strong></p><p>These concepts are usually NOT tested separately in the format of a school test, because interviewers want to see you can actually use those concepts to solve real world problems instead of memorizing the wikipedia pages about them. So the common questions asked are along the lines of:</p><ul><li><p>&#8220;What kind of distribution does XX (e.g. the length of Facebook Marketplace page visits etc.) have?&#8221;</p></li><li><p>Follow up question: &#8220;What&#8217;s the relationship of mean and median of this distribution?&#8221; or &#8220;Do you think the mean will be bigger or smaller than median for this distribution?&#8221;</p></li></ul><p>Hope this summary is helpful to your recruiting process. Keep in mind that you won&#8217;t be a stats expert by just reading several articles about it; it takes practice to master stats knowledge. So the best way to familiarize yourself with this is to utilize this knowledge whenever you can; for example, when you research about the company you are interested in, ask yourself what the most important metrics for this company would be and what you think their distribution looks like.</p><p>Don&#8217;t know what to read next? Here are some suggestions:</p><p><strong><a href="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d" title="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d">Why I Joined McKinsey as a Data Scientist</a></strong><a href="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d" title="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d"><br></a><em><a href="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d" title="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d">Should you ever be a data science consultant despite the 80-hour work weeks?</a></em><a href="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d" title="https://towardsdatascience.com/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27">Productivity Tips for Data Scientists</a></strong><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27"><br></a><em><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27">How to work better, smarter but not necessarily harder as a data scientist</a></em><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2" title="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2">Top Qualities Hiring Managers Look For In Data Scientist Candidates</a></strong><a href="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2" title="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2"><br></a><em><a href="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2" title="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2">Some of these are arguably more important than writing efficient code</a></em><a href="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2" title="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2">towardsdatascience.com</a></p>]]></content:encoded></item><item><title><![CDATA[Why I Joined McKinsey as a Data Scientist]]></title><description><![CDATA[Should you ever be a data science consultant despite the 80-hour work weeks?]]></description><link>https://www.divingintodata.com/p/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d</link><guid isPermaLink="false">https://www.divingintodata.com/p/why-i-joined-mckinsey-as-a-data-scientist-2fb3b586fd0d</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Tue, 08 Feb 2022 16:18:49 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/42a46b03-c6ba-41cc-aace-9cf3b79f18e3_800x533.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fMeF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c59330b-14bb-43cf-8bed-821f994cc44b_800x533.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fMeF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c59330b-14bb-43cf-8bed-821f994cc44b_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!fMeF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c59330b-14bb-43cf-8bed-821f994cc44b_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!fMeF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c59330b-14bb-43cf-8bed-821f994cc44b_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!fMeF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c59330b-14bb-43cf-8bed-821f994cc44b_800x533.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fMeF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c59330b-14bb-43cf-8bed-821f994cc44b_800x533.jpeg" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5c59330b-14bb-43cf-8bed-821f994cc44b_800x533.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fMeF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c59330b-14bb-43cf-8bed-821f994cc44b_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!fMeF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c59330b-14bb-43cf-8bed-821f994cc44b_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!fMeF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c59330b-14bb-43cf-8bed-821f994cc44b_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!fMeF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5c59330b-14bb-43cf-8bed-821f994cc44b_800x533.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">Photo by <a href="https://unsplash.com/@huntersrace?utm_source=medium&amp;utm_medium=referral">Hunters Race</a> on&nbsp;<a href="https://unsplash.com?utm_source=medium&amp;utm_medium=referral">Unsplash</a></figcaption></figure></div><h4>Should you ever be a data science consultant despite the 80-hour work&nbsp;weeks?</h4><p>Time really flies; we said goodbye to the crazy and unexpected 2021 and stepped into a new year&#8230;.more than a month ago! Reflecting on 2021, I realized that I have left the consulting life behind for over a year now.</p><p>Looking back, joining McKinsey was one of the best moves I made for my career (so far) and I truly learned a ton about data science and career development in general from the experience (if you want to read about the lessons I learned about data science, read my previous article <a href="https://medium.com/codex/why-you-should-always-turn-down-an-exit-interview-ce25440b4e8c">here</a>).</p><p>So as a memo to myself and people who are interested in the consulting world, I want to share some thoughts in this article about why I initially joined McKinsey; in my next article, I will then share why I ultimately decided to leave. Hopefully, in combination these articles will provide some insights for those who are considering <strong>data science consulting</strong> as a career move.</p><ol><li><p><strong>Consulting is a great place to look for your passion and learn about different industries and from different industry experts</strong></p></li></ol><p>After being a quant researcher in the finance industry for almost two years, I realized that I loved the analytics and data aspect of my job, but I was not a huge fan of the finance industry.</p><p>So I decided to take my transferrable data skills somewhere else. The question was, where? I only knew what I <strong>DIDN&#8217;T</strong> want to do, but was clueless about what I <strong>DID</strong> want to do. Did I want to work with geo-spatial data? Optimize consumer marketing campaigns or B2B sales funnels?</p><p><strong>Consulting is the perfect industry to go if you are in the same spot as I was&#8202;&#8212;&#8202;</strong><em><strong>knowing that you want to work with data but don&#8217;t know what KIND of data you want to work with and/or which INDUSTRY you are interested in working in</strong></em><strong>.</strong></p><p>Because consulting firms serve a wide range of clients, no matter where your interests and passion lie, you WILL find something that&#8217;s for you. Even within the same industry, you will get to work with a wide range of companies in terms of size, maturity, culture and other dimensions.</p><p>Because experienced consultants have worked with dozens of companies in the past, they often have developed best practices/playbooks in a certain area. So as someone who&#8217;s new to the data science field, consulting is one of the best places to &#8220;<strong>learn the ropes</strong>&#8221;.</p><p>2. <strong>Consulting is a good way to pivot your career</strong></p><p>The transition from a quant finance researcher to a data scientist was not an easy one despite the fact that there is a lot of overlap in terms of skillset; a lot of data science recruiters are unfamiliar with other fields (in my case the quant finance industry) and often don&#8217;t know how to correctly evaluate non-traditional backgrounds. Consulting hires from <strong>more diverse backgrounds</strong> than typical industry jobs, so it&#8217;s the perfect stepping stone and pivot opportunity if you want to make a career shift into data science.</p><p>Having a consulting background is definitely a plus if you are later applying for industry jobs; partially because of the prestige of most consulting firms and partially because of the skills and experience you get in a consultant role.</p><p>3. <strong>You learn to be agile, learn quickly and be a full-stack data scientist</strong></p><p>In consulting, every project will be different and every new engagement is like getting a new job. As a result, you learn to be extremely agile and adapt to different working situations, managers, teammates and stakeholders quickly. Some clients use Python and some prefer R; some clients&#8217; data is in databases and can be extracted through SQL, some have data only in CSVs or even PDFs. As a data science consultant, you often need to <strong>find creative ways to solve the seemingly impossible</strong> and <strong>up-skill yourself</strong> through learning on the job.</p><p>Because every project&#8217;s setup and team varies, as a data scientist, sometimes (if you are lucky) you get staffed with a whole team of data engineers; sometimes you have to BE the data engineer and try to process client&#8217;s &#8220;perfect&#8221; data from PDFs. Because of this, most data science consultants are <strong>&#8220;full-stack&#8221; data scientists</strong> who can work on the whole end-to-end process of data science projects covering data engineering, model building, all the way to tool building and insights generation.</p><p>These skills are extremely valuable for companies these days, especially for startups or companies that are just starting to build up their data science teams. Additionally, it was the perfect way for me to test out which <strong>career path in the data world</strong> I was actually interested in and wanted to specialize in (if you are not familiar with different data careers, read my previous article <a href="https://towardsdatascience.com/how-to-pick-the-right-career-in-the-data-world-1cec8a084767">here</a>).</p><p>4. <strong>Exit options are great and seeking them is even encouraged</strong></p><p>Unlike most industry companies, which try to avoid churn like the plague, consulting companies aren&#8217;t too worried about their employees leaving. It might be obvious why this is the case&#8202;&#8212;&#8202;when consulting alumni leave to join industry companies, they become potential clients for the consulting company.</p><p><em>Every industry company hires consultants for something at some point; who better to vouch for you over your competitors than your own alumni?</em></p><p>Because consulting firms embrace the ultimate departure of the employee, they create a lot of channels like internal job boards, newsletter etc. for their alumni to share job opportunities outside of consulting.</p><p>Also most companies LOVE to hire ex-consultants because they have <strong>experience adapting and deploying data science solutions</strong> in different companies in various industries. And consultants are used to being thrown into the deep end and learn things from the scratch fast.</p><p>For me, consulting was a great way to figure out what I wanted to do next while keeping my options open, and once I had figured that out, the McKinsey brand, network and support (e.g. paid &#8220;search&#8221; time to find a new job) helped me line up my next opportunity.</p><p>5.<strong> The network you build is incredible</strong></p><p>Since a lot of people leave consulting eventually, there are a lot of consulting alumni in industry companies&#8217; management and leadership teams. So it&#8217;s likely that there&#8217;s already a McK alum working at your dream company when you are thinking about leaving consulting; and networking with alumni is always easier than cold-emailing.</p><p>What&#8217;s also common for ex-consultants is to eventually start their own companies, building on their experiences across various industries and opportunities they discovered along the way. If your dream is to eventually build a company of your own, it&#8217;s also likely to find like-minded co-founders in consulting (or you can find and join an early-stage venture that matches your interests through the McKinsey network).</p><p><strong>Conclusion:</strong></p><p>I think you should absolutely join consulting (McK or other companies) as a data scientist if any of the following applies to you:</p><ul><li><p>You don&#8217;t know yet what kind of data you are passionate about as a data scientist</p></li><li><p>You don&#8217;t know which industry is right for you</p></li><li><p>You are hoping to pivot your career into data science from another field</p></li><li><p>You want to learn the tried and true best practices for data science (notice best practice doesn&#8217;t necessarily mean cutting edge; in fact, it&#8217;s almost always the contrary)</p></li></ul><p>Consulting might NOT be for you if:</p><ul><li><p>You want to go deep and specialize in a certain area of data science (e.g. you know want to ONLY focus on building ML models and be an expert in the area); this is possible in consulting, but might be easier to achieve in an industry role</p></li><li><p>You want to work on cutting edge methodologies in data science (I will explain why it&#8217;s hard for you to achieve this in consulting in my next article where I explain why I left McKinsey)</p></li></ul><p><em><strong>Don&#8217;t know what to read next? I might have some recommendations for you:</strong></em></p><p><strong><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27">Productivity Tips for Data Scientists</a></strong><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27"><br></a><em><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27">How to work better, smarter but not necessarily harder as a data scientist</a></em><a href="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27" title="https://towardsdatascience.com/productivity-tips-for-data-scientists-eb66242fde27">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2" title="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2">Top Qualities Hiring Managers Look For In Data Scientist Candidates</a></strong><a href="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2" title="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2"><br></a><em><a href="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2" title="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2">Some of these are arguably more important than writing efficient code</a></em><a href="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2" title="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2">towardsdatascience.com</a></p><p><strong><a href="https://medium.com/geekculture/avoid-these-five-behaviors-that-make-you-look-like-a-data-novice-40f01158ae00" title="https://medium.com/geekculture/avoid-these-five-behaviors-that-make-you-look-like-a-data-novice-40f01158ae00">Avoid These Five Behaviors That Make You Look Like A Data Novice</a></strong><a href="https://medium.com/geekculture/avoid-these-five-behaviors-that-make-you-look-like-a-data-novice-40f01158ae00" title="https://medium.com/geekculture/avoid-these-five-behaviors-that-make-you-look-like-a-data-novice-40f01158ae00"><br></a><em><a href="https://medium.com/geekculture/avoid-these-five-behaviors-that-make-you-look-like-a-data-novice-40f01158ae00" title="https://medium.com/geekculture/avoid-these-five-behaviors-that-make-you-look-like-a-data-novice-40f01158ae00">And be a trustworthy, likable data partner</a></em><a href="https://medium.com/geekculture/avoid-these-five-behaviors-that-make-you-look-like-a-data-novice-40f01158ae00" title="https://medium.com/geekculture/avoid-these-five-behaviors-that-make-you-look-like-a-data-novice-40f01158ae00">medium.com</a></p>]]></content:encoded></item><item><title><![CDATA[Productivity Tips for Data Scientists]]></title><description><![CDATA[How to work better, smarter but not necessarily harder as a data scientist]]></description><link>https://www.divingintodata.com/p/productivity-tips-for-data-scientists-eb66242fde27</link><guid isPermaLink="false">https://www.divingintodata.com/p/productivity-tips-for-data-scientists-eb66242fde27</guid><dc:creator><![CDATA[Tessa Xie]]></dc:creator><pubDate>Wed, 26 Jan 2022 09:15:02 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/573772ca-b8e3-4f86-8fcb-9091a225fcb2_800x533.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5-Lv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F580d85c4-c390-4c16-9ccb-6f315aa43bf9_800x533.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5-Lv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F580d85c4-c390-4c16-9ccb-6f315aa43bf9_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5-Lv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F580d85c4-c390-4c16-9ccb-6f315aa43bf9_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5-Lv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F580d85c4-c390-4c16-9ccb-6f315aa43bf9_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5-Lv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F580d85c4-c390-4c16-9ccb-6f315aa43bf9_800x533.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5-Lv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F580d85c4-c390-4c16-9ccb-6f315aa43bf9_800x533.jpeg" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/580d85c4-c390-4c16-9ccb-6f315aa43bf9_800x533.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5-Lv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F580d85c4-c390-4c16-9ccb-6f315aa43bf9_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5-Lv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F580d85c4-c390-4c16-9ccb-6f315aa43bf9_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5-Lv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F580d85c4-c390-4c16-9ccb-6f315aa43bf9_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5-Lv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F580d85c4-c390-4c16-9ccb-6f315aa43bf9_800x533.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">Photo by <a href="https://unsplash.com/@schmaendels?utm_source=medium&amp;utm_medium=referral">Andreas Klassen</a> on&nbsp;<a href="https://unsplash.com?utm_source=medium&amp;utm_medium=referral">Unsplash</a></figcaption></figure></div><h4>How to work better, smarter but not necessarily harder as a data scientist</h4><p>I have always been a firm believer of &#8220;work smarter not harder&#8221; and working in fast-paced companies in the past several years reinforced this belief. Based on my observations as a data scientist, there seems to be a permanent <strong>discrepancy</strong> between how &#8220;slow&#8221; analytics (or engineering) work goes and how fast businesses need solutions.</p><p>Even though this tension is mainly due to the fact that highly technical work has the level of <strong>difficulty</strong> and needs the level of <strong>precision</strong> that takes time, it can partially be alleviated through proper stakeholder management (which I have covered in several other articles) and by making yourself a more effective and efficient data scientist. There are several lessons I learned in the past several years that I hope can help you make your work as a data scientist &#8220;smarter&#8221;.</p><h4><strong>Help others to help&nbsp;you</strong></h4><p>If you have ever gotten questions along the lines of &#8220;how many XX (transactions, trips, etc.) did we have last month?&#8221; (what I call &#8220;data Siri questions&#8221;), you know how much time you spend on those seemingly small asks and how frustrating they are.</p><p>The best way to NOT let these small asks drag down your productivity (and mood) is to up-skill your stakeholders so they can be self-sufficient to some extent.</p><p>Most stakeholders I have worked with are more than happy to learn the basics about analytics in a weekly or monthly office hour or training session the data team hosts, because nobody loves to rely on other people all the time to get their jobs done.</p><h4><strong>Be that &#8220;annoying&#8221; data scientist and ask &#8220;where does the data&nbsp;go&#8221;</strong></h4><p>Time and time again I observe the data team scrambling to get data when people ask &#8220;can we track XXX (related to a product or process recently launched)?&#8221; and the data team realizes the data is <strong>not being tracked correctly</strong> or <strong>at all</strong>.</p><p>My solution to avoid this kind of situation is to &#8220;butt in&#8221; the conversations early. When you are in a meeting where people talk about building a new product and/or process, if you know you will eventually be responsible for analyzing the data (that&#8217;s probably why you are in the meeting as a data scientist), ask &#8220;dumb&#8221; questions as early as possible &#8220;where will the data be stored?&#8221;, &#8220;have we decided the schemas of the tables yet?&#8221;&#8230;</p><p>I used to assume someone must have already thought of those questions. Don&#8217;t make those assumptions. You will be surprised by how often people forget about the data aspect of things, even in companies that pride themselves for being data-driven.</p><p>And trust me, people will appreciate you for asking those &#8220;dumb&#8221; questions early on as it will avoid bottlenecks down the road.</p><h4><strong>Pay attention to &#8220;irrelevant conversations&#8221;</strong></h4><p>Being able to <strong>absorb information</strong> about things that don&#8217;t seems to fall directly in your immediate scope is an <strong>underrated ability</strong> in my opinion. Time and time again I realize that conversations I read in Slack threads and group chats that didn&#8217;t seem to concern me at the moment contain useful information for my future work.</p><p>It&#8217;s really not surprising if you think about it. Every piece of work that happens in the company is intertwined and connected because at the end of the day, everyone is working towards the same goal. It&#8217;s only a matter of time before things that are not relevant for you become relevant; and when they do, you have some basic context and know what&#8217;s already been done or at least where to start looking for information.</p><p>So be curious and inquisitive of things others are working on and don&#8217;t be tunnel visioned by your scope. The future you will thank the past you for catching certain pieces of &#8220;irrelevant&#8221; information and saving a huge amount of time because of it.</p><h4><strong>Be a sponge and a dot connector</strong></h4><p>This one ties back directly to the one above because you can only connect the dots if you paid attention to things that are happening around you.</p><p>Being able to <strong>absorb information</strong> and <strong>connect the dots</strong> is one of the most valuable abilities I look for in people I hire. I can&#8217;t even start to describe how much I appreciate team members who can tell me &#8220;XX mentioned that he/she did YY that I think is very similar to this new ask&#8221;.</p><p>Like I mentioned in my <a href="https://towardsdatascience.com/how-to-deal-with-frustrating-stakeholder-situations-as-a-data-scientist-92d48e2c32f7">previous post</a>, when it comes to high impact projects, it&#8217;s likely that someone has thought about and done something similar in the past; so being able to NOT reinvent the wheel but build upon previous work can save a lot of time and create a ton of synergy. And if another team is currently actively working on a similar project, you can either join forces or simply reprioritize and work on something else while they solve the problem for you (assuming that the timeline and output align with your needs).</p><h4><strong>Batch your work and utilize productivity-hacking</strong></h4><p>Unless you have an extremely micromanaging manager, you usually have some control over how you prioritize your work. The best tricks for being more productive that I have learned over the years are targeting low hanging fruits at the right time and decreasing switching cost by batching similar work.</p><p>Even if you successfully up-skill all of your stakeholders, you will inevitably get some &#8220;data Siri&#8221; questions that shouldn&#8217;t take much of your brain space, so use those to fill the hours when you feel like your brain is fried and you can&#8217;t really concentrate, or when you are in a boring meeting and you can multi-task.</p><p>Batching your work is another way to improve productivity because switching back and forth between different topics can be distracting and result in inefficiency and slow progress. So I usually group similar tasks together and knock as many of them out as possible in one sitting. Schedule all of your meetings back to back and reserve several hours afterwards for headphones-on, heads-down, productive coding time. Your brain will thank you for not pulling it in all directions in a short span of time.</p><h4><strong>Never dive into a problem without taking a step back first&#8202;&#8212;&#8202;question everything</strong></h4><p>Don&#8217;t let others&#8217; approach for a problem be the &#8220;box&#8221; for your thinking process. Think outside of the box. What I mean by this is if your stakeholder has an analytics request that they have some ideas about how you can approach it, that&#8217;s great, hear them out; but don&#8217;t let that be the Bible for how you MUST approach it.</p><p>As a data scientist, you should have the data expertise in the room and can help people decide what the <strong>most efficient</strong> way to approach a problem is from a data perspective. Don&#8217;t be afraid of challenging the approach and making suggestions.</p><p><em><strong>Key Takeaways:</strong></em></p><p>There are several ways to make you more productive and more effective as a data scientist:</p><ul><li><p><strong>Up-skill</strong> your stakeholders to lighten your ad-hoc requests load</p></li><li><p>Be more <strong>proactive</strong> in data-related conversations</p></li><li><p>Take more <strong>initiative</strong> in /<strong>pay more attention</strong> to conversations that are seemingly not in your scope</p></li><li><p>Learn to <strong>connect the dots</strong></p></li><li><p><strong>Prioritize </strong>by <strong>batching</strong> your work and <strong>breaking down</strong> big tasks</p></li><li><p><strong>Don&#8217;t take anything for granted</strong>, always look for more efficient approaches</p></li></ul><p><strong>Enjoyed the article? Subscribe to my email list so you won&#8217;t miss an article in the future! Don&#8217;t know what to read next? Here are some suggestions:</strong></p><p><strong><a href="https://towardsdatascience.com/how-to-deal-with-frustrating-stakeholder-situations-as-a-data-scientist-92d48e2c32f7" title="https://towardsdatascience.com/how-to-deal-with-frustrating-stakeholder-situations-as-a-data-scientist-92d48e2c32f7">How to Deal With Frustrating Stakeholder Situations as a Data Scientist</a></strong><a href="https://towardsdatascience.com/how-to-deal-with-frustrating-stakeholder-situations-as-a-data-scientist-92d48e2c32f7" title="https://towardsdatascience.com/how-to-deal-with-frustrating-stakeholder-situations-as-a-data-scientist-92d48e2c32f7"><br></a><em><a href="https://towardsdatascience.com/how-to-deal-with-frustrating-stakeholder-situations-as-a-data-scientist-92d48e2c32f7" title="https://towardsdatascience.com/how-to-deal-with-frustrating-stakeholder-situations-as-a-data-scientist-92d48e2c32f7">And turn them into opportunities</a></em><a href="https://towardsdatascience.com/how-to-deal-with-frustrating-stakeholder-situations-as-a-data-scientist-92d48e2c32f7" title="https://towardsdatascience.com/how-to-deal-with-frustrating-stakeholder-situations-as-a-data-scientist-92d48e2c32f7">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2" title="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2">Top Qualities Hiring Managers Look For In Data Scientist Candidates</a></strong><a href="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2" title="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2"><br></a><em><a href="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2" title="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2">Some of these are arguably more important than writing efficient code</a></em><a href="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2" title="https://towardsdatascience.com/top-qualities-hiring-managers-look-for-in-data-scientist-candidates-2e2cd52444c2">towardsdatascience.com</a></p><p><strong><a href="https://towardsdatascience.com/should-you-join-a-big-corporation-or-a-small-startup-as-a-data-scientist-1b7f4d83f5c6" title="https://towardsdatascience.com/should-you-join-a-big-corporation-or-a-small-startup-as-a-data-scientist-1b7f4d83f5c6">Should You Join a Big Corporation or a Small Startup As a Data Scientist?</a></strong><a href="https://towardsdatascience.com/should-you-join-a-big-corporation-or-a-small-startup-as-a-data-scientist-1b7f4d83f5c6" title="https://towardsdatascience.com/should-you-join-a-big-corporation-or-a-small-startup-as-a-data-scientist-1b7f4d83f5c6"><br></a><em><a href="https://towardsdatascience.com/should-you-join-a-big-corporation-or-a-small-startup-as-a-data-scientist-1b7f4d83f5c6" title="https://towardsdatascience.com/should-you-join-a-big-corporation-or-a-small-startup-as-a-data-scientist-1b7f4d83f5c6">How do you know which one is for you and what to expect from each</a></em><a href="https://towardsdatascience.com/should-you-join-a-big-corporation-or-a-small-startup-as-a-data-scientist-1b7f4d83f5c6" title="https://towardsdatascience.com/should-you-join-a-big-corporation-or-a-small-startup-as-a-data-scientist-1b7f4d83f5c6">towardsdatascience.com</a></p>]]></content:encoded></item></channel></rss>