{"id":25552006,"date":"2023-01-27T15:52:22","date_gmt":"2023-01-27T10:22:22","guid":{"rendered":"https:\/\/entri.app\/blog\/?p=25552006"},"modified":"2023-05-15T11:32:29","modified_gmt":"2023-05-15T06:02:29","slug":"exploratory-data-analysis-in-machine-learning","status":"publish","type":"post","link":"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/","title":{"rendered":"Exploratory Data Analysis in Machine Learning &#8211; EDA Steps, Importance"},"content":{"rendered":"<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_79_2 counter-hierarchy ez-toc-counter ez-toc-custom ez-toc-container-direction\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">Table of Contents<\/p>\n<label for=\"ez-toc-cssicon-toggle-item-69e0feb922212\" class=\"ez-toc-cssicon-toggle-label\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #999;color:#999\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #999;color:#999\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/label><input type=\"checkbox\"  id=\"ez-toc-cssicon-toggle-item-69e0feb922212\"  aria-label=\"Toggle\" \/><nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#_Why_Exploratory_Data_Analysis_in_Machine_Learning\" >\u00a0Why Exploratory Data Analysis in Machine Learning<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#Benefits_of_EDA_in_Machine_Learning\" >Benefits of EDA in Machine Learning<\/a><\/li><\/ul><\/nav><\/div>\n<p>Working with data includes exploratory data analysis in its entirety. Today&#8217;s data scientists and analysts devote the majority of their time to exploratory data analysis, or EDA, and data wrangling. You must clean your data and make sure it is in an appropriate state before you begin data analysis or subject it to a machine learning algorithm. Additionally, it is crucial to be aware of any persistent trends and strong correlations that may be present in your data. Exploratory data analysis is the method used to get to know your data in-depth. With this article we are letting you know about the\u00a0 importance of carrying out EDA, advantages of EDA, steps involved and importance of visualizing data in exploratory data analysis.<\/p>\n<p style=\"text-align: center;\"><strong><a href=\"https:\/\/entri.app\/course\/data-science-and-machine-learning-course\/\" target=\"_blank\" rel=\"noopener\">\u00a0 Looking for a Data science and Machine learning Career? Explore Here!!<\/a><\/strong><\/p>\n<p>So, let\u2019s dive in\u2026<\/p>\n<h2><span class=\"ez-toc-section\" id=\"_Why_Exploratory_Data_Analysis_in_Machine_Learning\"><\/span><strong>\u00a0<\/strong><strong>Why Exploratory Data Analysis in Machine Learning <\/strong><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Users examine and comprehend their data using statistical and graphical techniques during data exploration, sometimes referred to as exploratory data analysis (EDA). Choosing a model or method to utilize in the following steps, as well as spotting trends and issues in the dataset, are all aided by this process. EDA&#8217;s primary goals are to find mistakes and outliers in the data as well as to recognise various patterns. It enables Analysts to comprehend the data more thoroughly before assuming anything. The outcomes of EDA assist firms in understanding their customers, growing their business, and making informed decisions.<\/p>\n<p><strong><div class=\"lead-gen-block\"><a href=\"https:\/\/entri.app\/blog\/wp-content\/uploads\/2023\/05\/1_merged-3_compressed.pdf\" data-url=\"https:\/\/entri.app\/blog\/wp-content\/uploads\/2023\/05\/1_merged-3_compressed.pdf\" class=\"lead-pdf-download\" data-id=\"25556853\"><\/strong><\/p>\n<p style=\"text-align: center;\"><button class=\"btn btn-default\">Free SQL Tutorial for Beginners &#8211; Download PDF<\/button><\/p>\n<p><strong><\/a><\/div><\/strong><\/p>\n<p>The usage of the aforementioned objectives forms the basis of the data exploration analysis&#8217;s function. After the data has been formatted, the analysis that has been done reveals patterns and trends that aid in taking the right measures necessary to achieve the business&#8217;s anticipated goals. It is expected that appropriate EDA will completely address all questions pertaining to a given business decision, just as we expect specified responsibilities to be completed by any executive in a specific job role. Data science requires the best data aspects to be taken into account by the model because it entails constructing models for prediction. EDA makes sure that the proper patterns and trends are made available so that the model may be trained to produce the desired results, much like a good recipe.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Benefits_of_EDA_in_Machine_Learning\"><\/span><strong>Benefits of EDA in Machine Learning<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<ul>\n<li>\n<h3><strong>Deep Understanding of Variables<\/strong><\/h3>\n<\/li>\n<\/ul>\n<p>Data analysts can greatly increase their understanding of a variety of dataset-related factors. They can use EDA to extract a variety of data, including averages, means, lowest and maximum values, and additional data needed for properly preparing the data.<\/p>\n<ul>\n<li>\n<h3><strong>Explores Trends and Patterns of Dataset<\/strong><\/h3>\n<\/li>\n<\/ul>\n<p>EDA can also be used to aid in the discovery of patterns in a dataset. It&#8217;s crucial to look for patterns in a dataset because they can aid with estimate and prediction. This might assist your business in making future plans and identifying potential issues and solutions.<\/p>\n<ul>\n<li>\n<h3><strong>Better Decision Making<\/strong><\/h3>\n<\/li>\n<\/ul>\n<p>The most important benefit of implementing EDA in a company is that it aids in increasing data comprehension. With EDA, they may use the tools at their disposal to gather crucial insights, draw conclusions, and support decision-making based on those insights.<\/p>\n<h3><strong>Important Steps involved in Exploratory Data Analysis<\/strong><\/h3>\n<ul>\n<li><strong>Dataset Observation:<\/strong> Exploratory data analysis begins with a high-level examination of your dataset. Determine the size of your dataset, including the number of rows and columns, to get started. This can assist you in foreseeing potential problems with your data in the future.<\/li>\n<li><strong>Missing Value Treatment:<\/strong> Following your dataset&#8217;s observation, you may begin looking for any missing values. When you discover missing values, consider the potential causes of their absence. You might be able to use estimates to fill in some missing values if you can identify a trend in your data.<\/li>\n<li><strong>Value Categorization:<\/strong> Following the discovery of any missing values, you can classify your data to determine which statistical and visualization techniques will perform best with your dataset. You can group your values into the following groups:<\/li>\n<\/ul>\n<p>&#8211;<u>Categorical variables:<\/u> those that have a predetermined range of values.<\/p>\n<p>&#8211;<u>Continuous variables: <\/u>capable of holding an endless variety of values.<\/p>\n<p>&#8211;<u>Discrete variables: <\/u>\u00a0can have a predetermined number of values, all of which must be numbers.<\/p>\n<ul>\n<li><strong>Finding the Right Shape of the Dataset:<\/strong> This phase is crucial since it allows you to observe your dataset&#8217;s shape and learn essential information about it. Your dataset&#8217;s shape reveals the distribution of your data. Additionally, you can observe data characteristics like skewness and gaps that might teach you more about the dataset. You can use it to find trends in your dataset as well.<\/li>\n<li><strong>Identifying Correlations:<\/strong> You can start to identify relationships in your dataset as you continue to comprehend it. Finding connections and relationships between values can be simplified by using scatter plots. Take note of everything, and look for as many connections as you can. You can begin speculating as you become aware of correlations as to the potential causes of particular values&#8217; correlations.<\/li>\n<li><strong>Finding Critical Outliers:<\/strong> The numbers in your dataset that stand out from the rest are known as outliers. A dataset&#8217;s outliers may be much higher or lower than the other values. It&#8217;s critical to spot outliers since they might distort a dataset&#8217;s mean, median, mode, or range and change how a visual representation looks.<\/li>\n<li><strong>Visualizing the Results:<\/strong> After the analysis is complete, the results must be thoroughly scrutinized in order to allow for the right interpretation. Trends in data distribution and correlations between variables provide useful information for modifying the data parameters in a way that is appropriate.<\/li>\n<\/ul>\n<h4><strong>EDA and Data Visualization<\/strong><\/h4>\n<p>Data visualization does not always have a defined question, unlike statistical data exploration tools that have clear goals and questions. It can simply be used to explore data and determine how the data is structured. For efficient exploratory data analysis, there is a synergy between visualization and statistical methods. To clean and improve the data, statistical analysis can be used after getting a feel of outliers, patterns, and other important information from the visualization standpoint.<\/p>\n<p>Data are graphically represented when they are visualized. It makes complicated relationships and structures in the data simple to understand by using visualization tools like graphs and charts. Each and every professional discipline will gain from better data comprehension. Data visualization makes it simpler to analyze data and improves data exploration by successfully utilizing our eyes&#8217; capacity to instantly distinguish between various colors, shapes, and patterns.<\/p>\n<p style=\"text-align: center;\"><strong>\u00a0 <a href=\"https:\/\/entri.app\/course\/data-science-and-machine-learning-course\/\" target=\"_blank\" rel=\"noopener\">Enroll for Data Science and Machine Learning Course Now!<\/a><\/strong><\/p>\n<p><strong>End Note<\/strong><\/p>\n<p>Data experts can view and interpret data in a variety of ways. Data scientists and other data experts utilize exploratory data analysis as a strategy to comprehend datasets prior to modeling them. Knowing how to conduct exploratory data analysis is helpful if your line of work entails data mining or analysis. With this article we have discussed the importance of exploratory data analysis (EDA) in machine learning, advantages of EDA, important steps in EDA and the benefits of visualizing the results.<\/p>\n<p>Data exploration is not yet done. Exploring data steps can have different tools and techniques depending on the dataset you are working on. It is the responsibility of the data analyst to make the right model suitable for the dataset. To help you out, with the upcoming articles, we will be explaining in depth about the different types of exploratory data analysis and how to perform them.<\/p>\n<h4><strong>Related Articles\u00a0<\/strong><\/h4>\n<table dir=\"ltr\" border=\"1\" cellspacing=\"0\" cellpadding=\"0\">\n<colgroup>\n<col width=\"375\" \/><\/colgroup>\n<tbody>\n<tr>\n<td data-sheets-value=\"{&quot;1&quot;:2,&quot;2&quot;:&quot;Best Data Science Skills for Data Science Career&quot;}\"><strong><a href=\"https:\/\/entri.app\/blog\/best-data-science-skills-for-data-science-career\/\" target=\"_blank\" rel=\"noopener\">Best Data Science Skills for Data Science Career<\/a><\/strong><\/td>\n<\/tr>\n<tr>\n<td data-sheets-value=\"{&quot;1&quot;:2,&quot;2&quot;:&quot;Understanding Machine Learning Basics - A Simple Guide&quot;}\"><strong><a href=\"https:\/\/entri.app\/blog\/understanding-machine-learning-basics-a-simple-guide\/\" target=\"_blank\" rel=\"noopener\">Understanding Machine Learning Basics &#8211; A Simple Guide<\/a><\/strong><\/td>\n<\/tr>\n<tr>\n<td data-sheets-value=\"{&quot;1&quot;:2,&quot;2&quot;:&quot;Importance of Data Preprocessing in Machine Learning &quot;}\"><strong><a href=\"https:\/\/entri.app\/blog\/importance-of-data-preprocessing-in-machine-learning\/\" target=\"_blank\" rel=\"noopener\">Importance of Data Preprocessing in Machine Learning<\/a><\/strong><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<div class=\"modal\" id=\"modal25556853\"><div class=\"modal-content\"><span class=\"close-button\">&times;<\/span>\n\n<div class=\"wpcf7 no-js\" id=\"wpcf7-f25556853-o1\" lang=\"en-US\" dir=\"ltr\" data-wpcf7-id=\"25556853\">\n<div class=\"screen-reader-response\"><p role=\"status\" aria-live=\"polite\" aria-atomic=\"true\"><\/p> <ul><\/ul><\/div>\n<form action=\"\/blog\/wp-json\/wp\/v2\/posts\/25552006#wpcf7-f25556853-o1\" method=\"post\" class=\"wpcf7-form init\" aria-label=\"Contact form\" novalidate=\"novalidate\" data-status=\"init\">\n<fieldset class=\"hidden-fields-container\"><input type=\"hidden\" name=\"_wpcf7\" value=\"25556853\" \/><input type=\"hidden\" name=\"_wpcf7_version\" value=\"6.1.4\" \/><input type=\"hidden\" name=\"_wpcf7_locale\" value=\"en_US\" \/><input type=\"hidden\" name=\"_wpcf7_unit_tag\" value=\"wpcf7-f25556853-o1\" \/><input type=\"hidden\" name=\"_wpcf7_container_post\" value=\"0\" \/><input type=\"hidden\" name=\"_wpcf7_posted_data_hash\" value=\"\" \/><input type=\"hidden\" name=\"_wpcf7cf_hidden_group_fields\" value=\"[]\" \/><input type=\"hidden\" name=\"_wpcf7cf_hidden_groups\" value=\"[]\" \/><input type=\"hidden\" name=\"_wpcf7cf_visible_groups\" value=\"[]\" \/><input type=\"hidden\" name=\"_wpcf7cf_repeaters\" value=\"[]\" \/><input type=\"hidden\" name=\"_wpcf7cf_steps\" value=\"{}\" \/><input type=\"hidden\" name=\"_wpcf7cf_options\" value=\"{&quot;form_id&quot;:25556853,&quot;conditions&quot;:[],&quot;settings&quot;:{&quot;animation&quot;:&quot;yes&quot;,&quot;animation_intime&quot;:200,&quot;animation_outtime&quot;:200,&quot;conditions_ui&quot;:&quot;normal&quot;,&quot;notice_dismissed&quot;:false,&quot;notice_dismissed_update-cf7-5.9.8&quot;:true,&quot;notice_dismissed_update-cf7-6.1.1&quot;:true}}\" \/>\n<\/fieldset>\n<p><span class=\"wpcf7-form-control-wrap\" data-name=\"full_name\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-text wpcf7-validates-as-required\" aria-required=\"true\" aria-invalid=\"false\" placeholder=\"Name\" value=\"\" type=\"text\" name=\"full_name\" \/><\/span><br \/>\n<span class=\"wpcf7-form-control-wrap\" data-name=\"phone\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-tel wpcf7-validates-as-required wpcf7-text wpcf7-validates-as-tel\" aria-required=\"true\" aria-invalid=\"false\" placeholder=\"Phone\" value=\"\" type=\"tel\" name=\"phone\" \/><\/span><br \/>\n<span class=\"wpcf7-form-control-wrap\" data-name=\"email_id\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-email wpcf7-text wpcf7-validates-as-email\" aria-invalid=\"false\" placeholder=\"Email\" value=\"\" type=\"email\" name=\"email_id\" \/><\/span><br \/>\n<span class=\"wpcf7-form-control-wrap\" data-name=\"language\"><select class=\"wpcf7-form-control wpcf7-select wpcf7-validates-as-required\" aria-required=\"true\" aria-invalid=\"false\" name=\"language\"><option value=\"\">Language<\/option><option value=\"Malayalam\">Malayalam<\/option><option value=\"Tamil\">Tamil<\/option><option value=\"Telugu\">Telugu<\/option><option value=\"Kannada\">Kannada<\/option><option value=\"Hindi\">Hindi<\/option><\/select><\/span><br \/>\n<span class=\"wpcf7-form-control-wrap\" data-name=\"course\"><select class=\"wpcf7-form-control wpcf7-select wpcf7-validates-as-required course-field-select\" aria-required=\"true\" aria-invalid=\"false\" name=\"course\"><option value=\"\">Upskill in<\/option><option value=\"Commerce\">Commerce<\/option><option value=\"Coding\">Coding<\/option><option value=\"Robotics &amp; AI Course\">Robotics &amp; AI Course<\/option><option value=\"Stock Market Course\">Stock Market Course<\/option><option value=\"Spoken English\">Spoken English<\/option><option value=\"German Language\">German Language<\/option><option value=\"Montessori Teacher Training\">Montessori Teacher Training<\/option><option value=\"IELTS\">IELTS<\/option><option value=\"OET\">OET<\/option><option value=\"MEP\">MEP<\/option><option value=\"Embedded System Software Engineering\">Embedded System Software Engineering<\/option><option value=\"Quantity Surveying\">Quantity Surveying<\/option><option value=\"Hospital and Healthcare Administration\">Hospital and Healthcare Administration<\/option><option value=\"Yoga TTC\">Yoga TTC<\/option><option value=\"Digital Marketing\">Digital Marketing<\/option><option value=\"AI for Teachers\">AI for Teachers<\/option><option value=\"Arabic\">Arabic<\/option><\/select><\/span>\n<\/p>\n<div data-id=\"group-coding\" data-orig_data_id=\"group-coding\" data-clear_on_hide class=\"\" data-class=\"wpcf7cf_group\">\n\t<p><span class=\"wpcf7-form-control-wrap\" data-name=\"course_name\"><select class=\"wpcf7-form-control wpcf7-select wpcf7-validates-as-required course-name-select\" aria-required=\"true\" aria-invalid=\"false\" name=\"course_name\"><option value=\"\">Select Course<\/option><option value=\"Full Stack Development\">Full Stack Development<\/option><option value=\"Data Science and ML\">Data Science and ML<\/option><option value=\"Software Testing\">Software Testing<\/option><option value=\"Python Programming\">Python Programming<\/option><option value=\"AWS Training\">AWS Training<\/option><\/select><\/span>\n\t<\/p>\n<\/div>\n<div data-id=\"group-accounting\" data-orig_data_id=\"group-accounting\" data-clear_on_hide class=\"\" data-class=\"wpcf7cf_group\">\n\t<p><span class=\"wpcf7-form-control-wrap\" data-name=\"course_name\"><select class=\"wpcf7-form-control wpcf7-select wpcf7-validates-as-required course-name-select\" aria-required=\"true\" aria-invalid=\"false\" name=\"course_name\"><option value=\"\">Select Course<\/option><option value=\"Business Accounting\">Business Accounting<\/option><option value=\"CMA USA\">CMA USA<\/option><option value=\"Enrolled Agent\">Enrolled Agent<\/option><option value=\"SAP FICO\">SAP FICO<\/option><option value=\"SAP MM\">SAP MM<\/option><option value=\"SAP SD\">SAP SD<\/option><option value=\"ACCA\">ACCA<\/option><option value=\"Tally\">Tally<\/option><option value=\"UAE Accounting\">UAE Accounting<\/option><option value=\"GST\">GST<\/option><\/select><\/span>\n\t<\/p>\n<\/div>\n<p><span class=\"wpcf7-form-control-wrap\" data-name=\"education\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-text wpcf7-validates-as-required\" aria-required=\"true\" aria-invalid=\"false\" placeholder=\"Educational qualification\" value=\"\" type=\"text\" name=\"education\" \/><\/span>\n<\/p>\n<div style=\"display:none\">\n<input class=\"wpcf7-form-control wpcf7-hidden course-name-input\" value=\"\" type=\"hidden\" name=\"course_name\" \/>\n<input class=\"wpcf7-form-control wpcf7-hidden utm-source\" value=\"\" type=\"hidden\" name=\"utm_source\" \/>\n<input class=\"wpcf7-form-control wpcf7-hidden utm-medium\" value=\"\" type=\"hidden\" name=\"utm_medium\" \/>\n<input class=\"wpcf7-form-control wpcf7-hidden utm-campaign\" value=\"\" type=\"hidden\" name=\"utm_campaign\" \/>\n<input class=\"wpcf7-form-control wpcf7-hidden utm-content\" value=\"\" type=\"hidden\" name=\"utm_content\" \/>\n<input class=\"wpcf7-form-control wpcf7-hidden utm-term\" value=\"\" type=\"hidden\" name=\"utm_term\" \/>\n<input class=\"wpcf7-form-control wpcf7-hidden blog-url\" value=\"\" type=\"hidden\" name=\"blog_url\" \/>\n<input class=\"wpcf7-form-control wpcf7-hidden post-category-name\" value=\"\" type=\"hidden\" name=\"post_category_name\" \/>\n<input class=\"wpcf7-form-control wpcf7-hidden post-author-name\" value=\"\" type=\"hidden\" name=\"post_author_name\" \/>\n<input class=\"wpcf7-form-control wpcf7-hidden file-url\" value=\"\" type=\"hidden\" name=\"file_url\" \/>\n<input class=\"wpcf7-form-control wpcf7-hidden video-url\" value=\"\" type=\"hidden\" name=\"video_url\" \/>\n<input class=\"wpcf7-form-control wpcf7-hidden courseid\" value=\"\" type=\"hidden\" name=\"course_id\" \/>\n<\/div>\n<div class=\"cf7-cf-turnstile\" style=\"margin-top: 0px; margin-bottom: -15px;\"> <div id=\"cf-turnstile-cf7-2190817274\" class=\"cf-turnstile\" data-sitekey=\"0x4AAAAAABVigxtkiZeGTu5L\" data-theme=\"light\" data-language=\"auto\" data-size=\"normal\" data-retry=\"auto\" data-retry-interval=\"1000\" data-action=\"contact-form-7\" data-appearance=\"always\"><\/div> <script>document.addEventListener(\"DOMContentLoaded\", function() { setTimeout(function(){ var e=document.getElementById(\"cf-turnstile-cf7-2190817274\"); e&&!e.innerHTML.trim()&&(turnstile.remove(\"#cf-turnstile-cf7-2190817274\"), turnstile.render(\"#cf-turnstile-cf7-2190817274\", {sitekey:\"0x4AAAAAABVigxtkiZeGTu5L\"})); }, 0); });<\/script> <br class=\"cf-turnstile-br cf-turnstile-br-cf7-2190817274\"> <style>#cf-turnstile-cf7-2190817274 { margin-left: -15px; }<\/style> <script>document.addEventListener(\"DOMContentLoaded\",function(){document.querySelectorAll('.wpcf7-form').forEach(function(e){e.addEventListener('submit',function(){if(document.getElementById('cf-turnstile-cf7-2190817274')){setTimeout(function(){turnstile.reset('#cf-turnstile-cf7-2190817274');},1000)}})})});<\/script> <\/div><br\/><input class=\"wpcf7-form-control wpcf7-submit has-spinner\" type=\"submit\" value=\"Submit\" \/>\n<\/p><div class=\"wpcf7-response-output\" aria-hidden=\"true\"><\/div>\n<\/form>\n<\/div>\n\n<\/div><\/div>\n","protected":false},"excerpt":{"rendered":"<p>Working with data includes exploratory data analysis in its entirety. Today&#8217;s data scientists and analysts devote the majority of their time to exploratory data analysis, or EDA, and data wrangling. You must clean your data and make sure it is in an appropriate state before you begin data analysis or subject it to a machine [&hellip;]<\/p>\n","protected":false},"author":119,"featured_media":25552009,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[802,1864,1841],"tags":[],"class_list":["post-25552006","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-articles","category-data-science-ml","category-entri-skilling"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Exploratory Data Analysis in Machine Learning - Entri Blog<\/title>\n<meta name=\"description\" content=\"Exploratory Data Analysis in machine learning is an approach to analyze the data using visual techniques. Used to discover trends, patterns.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Exploratory Data Analysis in Machine Learning - Entri Blog\" \/>\n<meta property=\"og:description\" content=\"Exploratory Data Analysis in machine learning is an approach to analyze the data using visual techniques. Used to discover trends, patterns.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/\" \/>\n<meta property=\"og:site_name\" content=\"Entri Blog\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/entri.me\/\" \/>\n<meta property=\"article:published_time\" content=\"2023-01-27T10:22:22+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-05-15T06:02:29+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/entri.app\/blog\/wp-content\/uploads\/2023\/01\/Exploratory-Data-Analysis-in-Machine-Learning-EDA.png\" \/>\n\t<meta property=\"og:image:width\" content=\"820\" \/>\n\t<meta property=\"og:image:height\" content=\"615\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Vishnu K V\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@entri_app\" \/>\n<meta name=\"twitter:site\" content=\"@entri_app\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Vishnu K V\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/\"},\"author\":{\"name\":\"Vishnu K V\",\"@id\":\"https:\/\/entri.app\/blog\/#\/schema\/person\/1b415089342c8fc9f0590bb666c212e6\"},\"headline\":\"Exploratory Data Analysis in Machine Learning &#8211; EDA Steps, Importance\",\"datePublished\":\"2023-01-27T10:22:22+00:00\",\"dateModified\":\"2023-05-15T06:02:29+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/\"},\"wordCount\":1294,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/entri.app\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/entri.app\/blog\/wp-content\/uploads\/2023\/01\/Exploratory-Data-Analysis-in-Machine-Learning-EDA.png\",\"articleSection\":[\"Articles\",\"Data Science and Machine Learning\",\"Entri Skilling\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/\",\"url\":\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/\",\"name\":\"Exploratory Data Analysis in Machine Learning - Entri Blog\",\"isPartOf\":{\"@id\":\"https:\/\/entri.app\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/entri.app\/blog\/wp-content\/uploads\/2023\/01\/Exploratory-Data-Analysis-in-Machine-Learning-EDA.png\",\"datePublished\":\"2023-01-27T10:22:22+00:00\",\"dateModified\":\"2023-05-15T06:02:29+00:00\",\"description\":\"Exploratory Data Analysis in machine learning is an approach to analyze the data using visual techniques. Used to discover trends, patterns.\",\"breadcrumb\":{\"@id\":\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#primaryimage\",\"url\":\"https:\/\/entri.app\/blog\/wp-content\/uploads\/2023\/01\/Exploratory-Data-Analysis-in-Machine-Learning-EDA.png\",\"contentUrl\":\"https:\/\/entri.app\/blog\/wp-content\/uploads\/2023\/01\/Exploratory-Data-Analysis-in-Machine-Learning-EDA.png\",\"width\":820,\"height\":615,\"caption\":\"Exploratory Data Analysis in Machine Learning EDA\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/entri.app\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Entri Skilling\",\"item\":\"https:\/\/entri.app\/blog\/category\/entri-skilling\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Data Science and Machine Learning\",\"item\":\"https:\/\/entri.app\/blog\/category\/entri-skilling\/data-science-ml\/\"},{\"@type\":\"ListItem\",\"position\":4,\"name\":\"Exploratory Data Analysis in Machine Learning &#8211; EDA Steps, Importance\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/entri.app\/blog\/#website\",\"url\":\"https:\/\/entri.app\/blog\/\",\"name\":\"Entri Blog\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/entri.app\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/entri.app\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/entri.app\/blog\/#organization\",\"name\":\"Entri App\",\"url\":\"https:\/\/entri.app\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/entri.app\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/entri.app\/blog\/wp-content\/uploads\/2019\/10\/Entri-Logo-1.png\",\"contentUrl\":\"https:\/\/entri.app\/blog\/wp-content\/uploads\/2019\/10\/Entri-Logo-1.png\",\"width\":989,\"height\":446,\"caption\":\"Entri App\"},\"image\":{\"@id\":\"https:\/\/entri.app\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/entri.me\/\",\"https:\/\/x.com\/entri_app\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/entri.app\/blog\/#\/schema\/person\/1b415089342c8fc9f0590bb666c212e6\",\"name\":\"Vishnu K V\",\"description\":\"Professional Data Scientist who is passionate about writing relevant and interesting articles to inspire young data science aspirants and a continuous learner of the data science field.\",\"url\":\"https:\/\/entri.app\/blog\/author\/vishnu\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Exploratory Data Analysis in Machine Learning - Entri Blog","description":"Exploratory Data Analysis in machine learning is an approach to analyze the data using visual techniques. Used to discover trends, patterns.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/","og_locale":"en_US","og_type":"article","og_title":"Exploratory Data Analysis in Machine Learning - Entri Blog","og_description":"Exploratory Data Analysis in machine learning is an approach to analyze the data using visual techniques. Used to discover trends, patterns.","og_url":"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/","og_site_name":"Entri Blog","article_publisher":"https:\/\/www.facebook.com\/entri.me\/","article_published_time":"2023-01-27T10:22:22+00:00","article_modified_time":"2023-05-15T06:02:29+00:00","og_image":[{"width":820,"height":615,"url":"https:\/\/entri.app\/blog\/wp-content\/uploads\/2023\/01\/Exploratory-Data-Analysis-in-Machine-Learning-EDA.png","type":"image\/png"}],"author":"Vishnu K V","twitter_card":"summary_large_image","twitter_creator":"@entri_app","twitter_site":"@entri_app","twitter_misc":{"Written by":"Vishnu K V","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#article","isPartOf":{"@id":"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/"},"author":{"name":"Vishnu K V","@id":"https:\/\/entri.app\/blog\/#\/schema\/person\/1b415089342c8fc9f0590bb666c212e6"},"headline":"Exploratory Data Analysis in Machine Learning &#8211; EDA Steps, Importance","datePublished":"2023-01-27T10:22:22+00:00","dateModified":"2023-05-15T06:02:29+00:00","mainEntityOfPage":{"@id":"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/"},"wordCount":1294,"commentCount":0,"publisher":{"@id":"https:\/\/entri.app\/blog\/#organization"},"image":{"@id":"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#primaryimage"},"thumbnailUrl":"https:\/\/entri.app\/blog\/wp-content\/uploads\/2023\/01\/Exploratory-Data-Analysis-in-Machine-Learning-EDA.png","articleSection":["Articles","Data Science and Machine Learning","Entri Skilling"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/","url":"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/","name":"Exploratory Data Analysis in Machine Learning - Entri Blog","isPartOf":{"@id":"https:\/\/entri.app\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#primaryimage"},"image":{"@id":"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#primaryimage"},"thumbnailUrl":"https:\/\/entri.app\/blog\/wp-content\/uploads\/2023\/01\/Exploratory-Data-Analysis-in-Machine-Learning-EDA.png","datePublished":"2023-01-27T10:22:22+00:00","dateModified":"2023-05-15T06:02:29+00:00","description":"Exploratory Data Analysis in machine learning is an approach to analyze the data using visual techniques. Used to discover trends, patterns.","breadcrumb":{"@id":"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#primaryimage","url":"https:\/\/entri.app\/blog\/wp-content\/uploads\/2023\/01\/Exploratory-Data-Analysis-in-Machine-Learning-EDA.png","contentUrl":"https:\/\/entri.app\/blog\/wp-content\/uploads\/2023\/01\/Exploratory-Data-Analysis-in-Machine-Learning-EDA.png","width":820,"height":615,"caption":"Exploratory Data Analysis in Machine Learning EDA"},{"@type":"BreadcrumbList","@id":"https:\/\/entri.app\/blog\/exploratory-data-analysis-in-machine-learning\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/entri.app\/blog\/"},{"@type":"ListItem","position":2,"name":"Entri Skilling","item":"https:\/\/entri.app\/blog\/category\/entri-skilling\/"},{"@type":"ListItem","position":3,"name":"Data Science and Machine Learning","item":"https:\/\/entri.app\/blog\/category\/entri-skilling\/data-science-ml\/"},{"@type":"ListItem","position":4,"name":"Exploratory Data Analysis in Machine Learning &#8211; EDA Steps, Importance"}]},{"@type":"WebSite","@id":"https:\/\/entri.app\/blog\/#website","url":"https:\/\/entri.app\/blog\/","name":"Entri Blog","description":"","publisher":{"@id":"https:\/\/entri.app\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/entri.app\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/entri.app\/blog\/#organization","name":"Entri App","url":"https:\/\/entri.app\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/entri.app\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/entri.app\/blog\/wp-content\/uploads\/2019\/10\/Entri-Logo-1.png","contentUrl":"https:\/\/entri.app\/blog\/wp-content\/uploads\/2019\/10\/Entri-Logo-1.png","width":989,"height":446,"caption":"Entri App"},"image":{"@id":"https:\/\/entri.app\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/entri.me\/","https:\/\/x.com\/entri_app"]},{"@type":"Person","@id":"https:\/\/entri.app\/blog\/#\/schema\/person\/1b415089342c8fc9f0590bb666c212e6","name":"Vishnu K V","description":"Professional Data Scientist who is passionate about writing relevant and interesting articles to inspire young data science aspirants and a continuous learner of the data science field.","url":"https:\/\/entri.app\/blog\/author\/vishnu\/"}]}},"_links":{"self":[{"href":"https:\/\/entri.app\/blog\/wp-json\/wp\/v2\/posts\/25552006","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/entri.app\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/entri.app\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/entri.app\/blog\/wp-json\/wp\/v2\/users\/119"}],"replies":[{"embeddable":true,"href":"https:\/\/entri.app\/blog\/wp-json\/wp\/v2\/comments?post=25552006"}],"version-history":[{"count":7,"href":"https:\/\/entri.app\/blog\/wp-json\/wp\/v2\/posts\/25552006\/revisions"}],"predecessor-version":[{"id":25560149,"href":"https:\/\/entri.app\/blog\/wp-json\/wp\/v2\/posts\/25552006\/revisions\/25560149"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/entri.app\/blog\/wp-json\/wp\/v2\/media\/25552009"}],"wp:attachment":[{"href":"https:\/\/entri.app\/blog\/wp-json\/wp\/v2\/media?parent=25552006"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/entri.app\/blog\/wp-json\/wp\/v2\/categories?post=25552006"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/entri.app\/blog\/wp-json\/wp\/v2\/tags?post=25552006"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}