{"id":4523,"date":"2021-06-28T20:15:41","date_gmt":"2021-06-28T20:15:41","guid":{"rendered":"https:\/\/www.interventure.info\/?p=4523"},"modified":"2021-07-09T10:15:15","modified_gmt":"2021-07-09T10:15:15","slug":"meet-data-heroes-all-in-one-place","status":"publish","type":"post","link":"https:\/\/www.interventure.info\/blog\/meet-data-heroes-all-in-one-place\/","title":{"rendered":"Meet data heroes all in one place"},"content":{"rendered":"<div class=\"wrapper-text\">\n<p>There is nothing worse than asking a data engineer to build you some fancy report\/dashboard. If you are lucky, he would just ignore you, otherwise he might slap you right in the face. So, be careful.\u00a0Instead of living in fear around data heroes, make some effort to learn key differences between positions born and raised thanks to the sexiest job of the century. Yes, we are talking about Data Science.\u00a0\u00a0<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" width=\"1400\" height=\"933\" src=\"https:\/\/www.interventure.info\/wp-content\/uploads\/2021\/06\/Tamara-Ciric-InterVenture.jpg\" alt=\"\" class=\"wp-image-4556\"\/><\/figure>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>Nowadays, there are plenty of positions and roles that are somehow connected to data. Start with data engineers, business analysts, data specialist, researcher scientist etc it makes so hard to differentiate role\u2019s scope between those. Still, there is a high percentage of overlapping, but in the core all data positions are inherited from three major positions:&nbsp;&nbsp;<\/p>\n<\/div>\n\n\n<ul class=\"wp-block-list\"><li>Data Engineer&nbsp;<\/li><li>Data Analyst&nbsp;&nbsp;<\/li><li>Data Scientist&nbsp;<\/li><\/ul>\n\n\n<div class=\"wrapper-text\">\n<p>The purpose of this blog is to introduce these three professions to our audience and, in case you have a lot of data and don\u2019t know how to use it&nbsp;to&nbsp;make some fun or even money, read carefully till the end and we assure you, at least you will catch who is the right person around to ask for a little help.&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>But wait\u2026. We are missing something at&nbsp;beginning&nbsp;of this journey\u2026YES, here we go!&nbsp; Firstly, let\u2019s understand what Data Science is indeed!&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>According to&nbsp;<em>Wikipedia<\/em>, Data Science<em>&nbsp;<\/em>is<em>&nbsp;an interdisciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from structured and unstructured data<\/em><em><sup>&nbsp;<\/sup><\/em><em>and apply knowledge and actionable insights from data across a broad range of application domains. Data science is related to data mining, machine learning and big data<\/em>.&nbsp;&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>(Source:&nbsp; <a href=\"https:\/\/en.wikipedia.org\/wiki\/Data_science\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/en.wikipedia.org\/wiki\/Data_science<\/a>)&nbsp;<br>&nbsp;<br>Hmm, how convenient explanation with so many unnecessary terms and words. So, data science&nbsp;is a broad field of study pertaining to data systems and processes, aimed at maintaining data sets and deriving meaning out of them.&nbsp;&nbsp;<br>Let&#8217;s&nbsp;translate&nbsp;an earlier&nbsp;definition through image:&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large is-resized\"><img decoding=\"async\" src=\"https:\/\/www.interventure.info\/wp-content\/uploads\/2021\/06\/1.png\" alt=\"\" class=\"wp-image-4525\" width=\"1208\" height=\"930\"\/><figcaption>Data Science required skill set&nbsp;<\/figcaption><\/figure><\/div>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>To become a part of the famous data science world you must master at least one of those skills and be familiar and comfortable with others.&nbsp;First, machine learning (ML) and data science (DS) are fascinating fields. Mostly because they sit at the crossroad of computer science, mathematics and business understanding.&nbsp;&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>This means that there is way more room for personal growth.&nbsp;Another important factor is that the field is moving at lightning speed.&nbsp;Not a day goes by without hearing from the latest breakthrough, the newest shiny deep learning architecture, this great new book that every&nbsp;DS&nbsp;practitioner should read, etc.&nbsp;Then there are all the other reasons like you can make good money, you can make a big impact in your company, AI is the future&nbsp;and the rest of countless reasons why. It is important to&nbsp;emphasize&nbsp;there is&nbsp;a lot of overlapping&nbsp;skills&nbsp;for each position&nbsp;related to Data Science.&nbsp;&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>Don\u2019t get confused reading blogs where&nbsp;the terms Data Science, Artificial Intelligence (AI) and Machine learning fall in the same domain. They are connected to each other, but they have their specific applications and meaning.&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large is-resized\"><img decoding=\"async\" src=\"https:\/\/www.interventure.info\/wp-content\/uploads\/2021\/06\/2.png\" alt=\"\" class=\"wp-image-4527\" width=\"1208\" height=\"880\"\/><figcaption>Relationship between AI, ML and DS<\/figcaption><\/figure><\/div>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>Describing this relationship requires whole new approach so we will save it as a topic for some of the following blogs in the future.&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>Finally,&nbsp;it\u2019s time for our heroes to shine.&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>First star of our story will&nbsp;be a&nbsp;<strong>Data Engineer<\/strong>.&nbsp;&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>He&nbsp;is supposed to have the following responsibilities:&nbsp;<\/p>\n<\/div>\n\n\n<ul class=\"wp-block-list\"><li>Development, construction, and maintenance of data architectures.&nbsp;<\/li><li>Conducting testing on large scale data platforms.&nbsp;<\/li><li>Handling error logs and building robust data pipelines.&nbsp;<\/li><li>Ability to handle raw and unstructured data.&nbsp;<\/li><li>Provide recommendations for data improvement, quality, and efficiency of data.&nbsp;<\/li><li>Ensure and support the data architecture utilised by data scientists and analysts.&nbsp;<\/li><li>Development of data processes for data modelling, mining, and data production.&nbsp;<\/li><\/ul>\n\n\n<div class=\"wrapper-text\">\n<p>Let\u2019s imagine that you have a couple of devices and each with thousands of notes, images, files&nbsp;etc.&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>And you want to make&nbsp;your&nbsp;personal scrapbook. You&nbsp;have to&nbsp;collect all your data and place it together somewhere on same device, then you&nbsp;must&nbsp;find a common tool to integrate all data on the same place (let\u2019s say you chose famous Word where you can add text, links, images all in one place) and start making your lifetime scrapbook. Congratulations, you have just&nbsp;become&nbsp;Data Engineer.&nbsp;&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>Look, is seems not so hard, just find all data sources and integrate them into same place and then let other to use it wisely. And, friendly advice,&nbsp;try to automatise it&nbsp;somehow, you&nbsp;don\u2019t&nbsp;want to spend couple of hours&nbsp;every day&nbsp;on same repetitive task.&nbsp;So,&nbsp;make some effort and&nbsp;build&nbsp;a good infrastructure for it. Make sure that you have necessary programming skills&nbsp;and&nbsp;have a good grasp of&nbsp;critical thinking. Yes, you need to think out of the box.&nbsp;&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>Following are the key skills required to become a data engineer:&nbsp;<\/p>\n<\/div>\n\n\n<ul class=\"wp-block-list\"><li>Knowledge of programming tools like Python and Java.&nbsp;<\/li><li>Solid Understanding of Operating Systems.&nbsp;<\/li><li>Ability to develop scalable ETL packages.&nbsp;<\/li><li>Should be well&nbsp;proficient&nbsp;in SQL as well as NoSQL technologies like Cassandra and MongoDB.&nbsp;<\/li><li>He should possess knowledge of data warehouse and big data technologies like Hadoop, Hive, Pig, and Spark.&nbsp;<\/li><li>Should possess creative and out of the box thinking.&nbsp;<\/li><\/ul>\n\n\n<div class=\"wrapper-text\">\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img decoding=\"async\" width=\"703\" height=\"352\" src=\"https:\/\/www.interventure.info\/wp-content\/uploads\/2021\/06\/3.png\" alt=\"\" class=\"wp-image-4529\"\/><figcaption>Data Engineer Road&nbsp;Map&nbsp;<\/figcaption><\/figure><\/div>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>Great, we\u2019ve&nbsp;introduced&nbsp;Data&nbsp;Engineer,&nbsp;and he took care to make data available for further manipulation. And what&nbsp;is next?&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>So,&nbsp;the&nbsp;logical next step would be a studious data analysis. Why?&nbsp;&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>Because it\u2019s not rare for data engineers to&nbsp;lose&nbsp;same data in process of extracting, transforming and loading data.&nbsp;Besides&nbsp;that, it is&nbsp;often&nbsp;to have low quality data and data engineer isn\u2019t skilled to explore it because&nbsp;that requires some&nbsp;statistical and analytical skills&nbsp;and&nbsp;very&nbsp;often data engineer just don\u2019t have enough time for it.&nbsp;&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>Now, it\u2019s time to&nbsp;our second star&nbsp;to shine:&nbsp;<strong>Data Analyst<\/strong>.&nbsp;&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>Okay, Data Analyst, here we go!&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>Following are the main responsibilities of a&nbsp;<strong>Data Analyst<\/strong>:&nbsp;<\/p>\n<\/div>\n\n\n<ul class=\"wp-block-list\"><li>Analysing the data through descriptive statistics.&nbsp;<\/li><li>Using database query languages to retrieve and manipulate information.&nbsp;<\/li><li>Perform data filtering, cleaning and&nbsp;early-stage&nbsp;transformation.&nbsp;<\/li><li>Communicating results with the team using data visualization.&nbsp;<\/li><\/ul>\n\n\n<div class=\"wrapper-text\">\n<p>Data Analyst is needed to understand data quality and propose ways and procedures to increase data quality in the engineering process. Also, they are key players in decision making due to their skill to translate data tables into common language represented with all fancy reports and dashboards. No one understands data until it\u2019s transformed into clear visual (graphical) representation. To be efficient and effective data analyst, you must have a good grasp of business understanding and strong communication skills because you&nbsp;have to&nbsp;clearly communicate data to third parties, usually stakeholders, and make sure to obtain all necessary insights for further business decisions. You don\u2019t want to let down people who give you such freedom and their sincere trust. Be aware how many responsibilities it takes and respect that.&nbsp;&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img decoding=\"async\" width=\"1208\" height=\"1218\" src=\"https:\/\/www.interventure.info\/wp-content\/uploads\/2021\/06\/4.png\" alt=\"\" class=\"wp-image-4531\"\/><figcaption>Data Analyst as a designer&nbsp;<\/figcaption><\/figure><\/div>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>So here comes the hero, the analyst, to filter, direct, and translate your data into actionable insight.&nbsp;To become a&nbsp;<strong>Data Analyst<\/strong>, you must possess the following skills:&nbsp;<\/p>\n<\/div>\n\n\n<ul class=\"wp-block-list\"><li>Should&nbsp;have&nbsp;the strong mathematical aptitude&nbsp;<\/li><li>Should be well&nbsp;proficient&nbsp;with Excel, SQL and at least one visualization tool such as Google Data Studio.&nbsp;<\/li><li>Possession of problem-solving attitude.&nbsp;<\/li><li>Proficient in the communication of results to the team.&nbsp;<\/li><li>Should have a strong suite of analytical skills.&nbsp;<\/li><\/ul>\n\n\n<div class=\"wrapper-text\">\n<p>Ladies and Gentlemen,&nbsp;there&nbsp;is&nbsp;still&nbsp;our last star and maybe the brightest one,&nbsp;that&#8217;s&nbsp;a&nbsp;reason why this&nbsp;person&nbsp;is&nbsp;widely known as a unicorn in the world of data science. With such a pleasure, we\u2019 re introducing you a&nbsp;<strong>Data Scientist&nbsp;<\/strong>role.&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>He&nbsp;does&nbsp;everything what data analyst&nbsp;does but&nbsp;have a little bit more to offer on his plate.&nbsp;He is, sorry&nbsp;analysts,&nbsp;don\u2019t&nbsp;be offended, but&nbsp;some kind of upgraded&nbsp;version of Data Analyst.&nbsp;<br>Data scientist is supposed to know a little of everything&nbsp;and this guy is often called, in a funny way, as Jack of all trades, Master of none.&nbsp;But&nbsp;we&nbsp;have to&nbsp;disagree on this statement. We&nbsp;truly believe&nbsp;that data scientist is someone who could predict the future&nbsp;a quite&nbsp;accurately. In a business matter, of course, and for the top management&nbsp;he is&nbsp;a&nbsp;wizard. Can you imagine&nbsp;having&nbsp;that superb title? Fancy, isn&#8217;t it?&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>Long story short, this&nbsp;fellow&nbsp;is supposed to mingle some statistical, mathematical,&nbsp;programming&nbsp;and communicational skills. Also,&nbsp;it&nbsp;is strongly required to&nbsp;have ability of critical thinking and&nbsp;strong&nbsp;sense&nbsp;of business&nbsp;domain.&nbsp;Don\u2019t&nbsp;get confused,&nbsp;all of&nbsp;these skills are also&nbsp;must-have&nbsp;for&nbsp;data&nbsp;analysts&nbsp;but not so&nbsp;imperative&nbsp;as for scientists.&nbsp;&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>A&nbsp;<strong>Data Scientist<\/strong>&nbsp;is required to perform responsibilities:&nbsp;<\/p>\n<\/div>\n\n\n<ul class=\"wp-block-list\"><li>Performing data&nbsp;pre-processing&nbsp;that involves data transformation as well as data cleaning.&nbsp;<\/li><li>Using various&nbsp;machine&nbsp;learning tools to forecast and classify patterns in the data.&nbsp;<\/li><li>Increasing the performance and accuracy of machine learning algorithms through&nbsp;adjustment&nbsp;and further performance optimization.&nbsp;<\/li><li>Understanding the requirements of the company and formulating questions that need to be addressed.&nbsp;<\/li><li>Using robust storytelling tools to communicate results with the team members.&nbsp;<\/li><\/ul>\n\n\n<div class=\"wrapper-text\">\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" width=\"1208\" height=\"1248\" src=\"https:\/\/www.interventure.info\/wp-content\/uploads\/2021\/06\/5.png\" alt=\"\" class=\"wp-image-4533\"\/><figcaption>Data Scientist Skill Sets<\/figcaption><\/figure>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>To become this lovely&nbsp;fellow&nbsp;from the&nbsp;picture&nbsp;above, you only need to master those&nbsp;key skills:&nbsp;<\/p>\n<\/div>\n\n\n<ul class=\"wp-block-list\"><li>Should be proficient with Math and Statistics.&nbsp;<\/li><li>Should be able to handle structured &amp; unstructured information.&nbsp;<\/li><li>In-depth knowledge of tools like R, Python and&nbsp;SAS.&nbsp;<\/li><li>Well&nbsp;competent&nbsp;in various machine learning algorithms.&nbsp;<\/li><li>Have knowledge of SQL and NoSQL.&nbsp;<\/li><\/ul>\n\n\n\n<ul class=\"wp-block-list\"><li>Must be familiar with Big Data tools&nbsp;such as Spark, Hadoop, Kafka etc.&nbsp;<\/li><\/ul>\n\n\n<div class=\"wrapper-text\">\n<p>Yes,&nbsp;after all those&nbsp;introductions,&nbsp;we&nbsp;conclude&nbsp;and feel the same&nbsp;way, you&nbsp;could hire&nbsp;just a&nbsp;data scientist and he will do all&nbsp;job&nbsp;around data. But remember to give him at least 3 salaries instead of&nbsp;one and&nbsp;be sure&nbsp;that your scientist&nbsp;will never become a magic wizard of the company because he&nbsp;does not&nbsp;have time to shine due to all&nbsp;tasks that should be taken by some other&nbsp;data position(s).&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>Finally, let\u2019s put it all together as an overview of skill sets and responsibilities required for each described position in this blog:&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img decoding=\"async\" width=\"1068\" height=\"616\" src=\"https:\/\/www.interventure.info\/wp-content\/uploads\/2021\/06\/6-1.png\" alt=\"\" class=\"wp-image-4540\"\/><figcaption>Skill Sets&nbsp;for&nbsp;Data&nbsp;Analyst, Engineer and Scientist&nbsp;<\/figcaption><\/figure><\/div>\n<\/div>\n\n<div class=\"wrapper-text\">\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img decoding=\"async\" width=\"1064\" height=\"616\" src=\"https:\/\/www.interventure.info\/wp-content\/uploads\/2021\/06\/7.png\" alt=\"\" class=\"wp-image-4537\"\/><figcaption>The roles and responsibilities&nbsp;of Data Analyst, Engineer and Scientist<\/figcaption><\/figure><\/div>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p>We hope you\u2019ve enjoyed reading this blog and finally understand the difference in role\u2019s scope between this data geeks (heroes) of our story.&nbsp;&nbsp;<\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p><em>This blog post is written by our colleague <a href=\"https:\/\/www.linkedin.com\/in\/tamara-ciric-5575b7123\" target=\"_blank\" rel=\"noreferrer noopener\">Tamara \u0106iri\u0107<\/a>, Data Analyst at <a href=\"https:\/\/www.interventure.info\/partners\/flaschenpost\/\">Flaschenpost<\/a>.<\/em><\/p>\n<\/div>\n\n<div class=\"wrapper-text\">\n<p><em>Also, if you are looking for new job opportunities, take a look at our <a href=\"https:\/\/interventure.teamtailor.com\/jobs?utm_source=website&amp;utm_medium=blogpost&amp;utm_campaign=jobs\" target=\"_blank\" rel=\"noreferrer noopener\">open positions<\/a>. <br>One of them could be just for you.<\/em><\/p>\n<\/div>","protected":false},"excerpt":{"rendered":"<p>There is nothing worse than asking a data engineer to build you some fancy report\/dashboard. If you are lucky, he would just ignore you, otherwise he might slap you right in the face. So, be careful.\u00a0Instead of living in fear around data heroes, make some effort to learn key differences between positions born and raised [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":4556,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[12],"tags":[],"class_list":["post-4523","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-engineering"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.interventure.info\/de\/wp-json\/wp\/v2\/posts\/4523","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.interventure.info\/de\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.interventure.info\/de\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.interventure.info\/de\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.interventure.info\/de\/wp-json\/wp\/v2\/comments?post=4523"}],"version-history":[{"count":0,"href":"https:\/\/www.interventure.info\/de\/wp-json\/wp\/v2\/posts\/4523\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.interventure.info\/de\/wp-json\/wp\/v2\/media\/4556"}],"wp:attachment":[{"href":"https:\/\/www.interventure.info\/de\/wp-json\/wp\/v2\/media?parent=4523"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.interventure.info\/de\/wp-json\/wp\/v2\/categories?post=4523"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.interventure.info\/de\/wp-json\/wp\/v2\/tags?post=4523"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}