{"id":78189,"date":"2025-05-21T00:13:18","date_gmt":"2025-05-21T00:13:18","guid":{"rendered":"http:\/\/bangla.sitestree.com\/?p=78189"},"modified":"2025-06-25T02:24:50","modified_gmt":"2025-06-25T02:24:50","slug":"initial-and-exploratory-analysis-for-data-analytics-projects","status":"publish","type":"post","link":"http:\/\/bangla.sitestree.com\/?p=78189","title":{"rendered":"Initial and Exploratory Analysis for Data Analytics Projects"},"content":{"rendered":"\n<p><\/p>\n\n\n\n<p>To have a thorough understanding of the data.  <\/p>\n\n\n\n<p><strong>Two Types:<\/strong><\/p>\n\n\n\n<p>\u2022 Initial Analysis<\/p>\n\n\n\n<p>\u2022 Exploratory Analysis<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p><strong>Initial Analysis:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Univariate<\/li>\n\n\n\n<li>Bi-Variate<\/li>\n\n\n\n<li>Multi-Variate<\/li>\n<\/ul>\n\n\n\n<p><\/p>\n\n\n\n<p><strong>Univariate Analysis<\/strong><\/p>\n\n\n\n<p>\u2022 Deciding\/Determining the dependent (target) variable<\/p>\n\n\n\n<p>\u2022 Assigning the correct data types, appropriate column names<\/p>\n\n\n\n<p>\u2022 Address: Inconsistencies, missing values, outliers<\/p>\n\n\n\n<p>\u2022 Categorical variables with too many levels (address the issue)<\/p>\n\n\n\n<p>\u2022 (understand) Distributions of the variables (is it a right fit for the project)<\/p>\n\n\n\n<p>\u2022 Imbalance in the dependent variable<\/p>\n\n\n\n<p>\u2022 Time variables<\/p>\n\n\n\n<p>\u2022 Univariate visualizations<\/p>\n\n\n\n<p>\u2022 <a href=\"https:\/\/towardsdatascience.com\/the-basics-of-eda-with-candy-83b2e8ad9e63\">A <\/a><a href=\"https:\/\/towardsdatascience.com\/the-basics-of-eda-with-candy-83b2e8ad9e63\">detailed data dictionary<\/a><\/p>\n\n\n\n<p>\u2022 <a href=\"https:\/\/solegaonkar.github.io\/ConceptLowVarianceFilter.html\">Low <\/a><a href=\"https:\/\/solegaonkar.github.io\/ConceptLowVarianceFilter.html\">variance filter<\/a><\/p>\n\n\n\n<p><\/p>\n\n\n\n<p><strong>Bivariate Analysis<\/strong><\/p>\n\n\n\n<p>\u2022 Pairwise relations<\/p>\n\n\n\n<p>\u2022 Pairwise visualizations<\/p>\n\n\n\n<p>\u2022 Correlation analysis<\/p>\n\n\n\n<p>Multivariate Analysis<\/p>\n\n\n\n<p>\u2022 <a href=\"https:\/\/openclassrooms.com\/en\/courses\/5869986-perform-an-exploratory-data-analysis\/6435201-discover-the-principles-of-multivariate-exploratory-data-analysis\">Multivariate relations<\/a><\/p>\n\n\n\n<p>\u2022 <a href=\"https:\/\/www.stat.cmu.edu\/~hseltman\/309\/Book\/chapter4.pdf\">Statistical tools<\/a><\/p>\n\n\n\n<p><\/p>\n\n\n\n<p><strong>Exploratory Analysis<\/strong><\/p>\n\n\n\n<p><a href=\"https:\/\/en.wikipedia.org\/wiki\/Normalization_(statistics)\">Normalizing<\/a><\/p>\n\n\n\n<p>\u2022 Subsetting the data<\/p>\n\n\n\n<p>\u2022 Clustering<\/p>\n\n\n\n<p><strong>Others<\/strong><\/p>\n\n\n\n<p>\u2022 Decision rules, association rules, n-grams<\/p>\n\n\n\n<p>\u2022 Time series analysis<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p><strong>Data Analytics, Machine Learning, Data Science<\/strong><\/p>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>To have a thorough understanding of the data. Two Types: \u2022 Initial Analysis \u2022 Exploratory Analysis Initial Analysis: Univariate Analysis \u2022 Deciding\/Determining the dependent (target) variable \u2022 Assigning the correct data types, appropriate column names \u2022 Address: Inconsistencies, missing values, outliers \u2022 Categorical variables with too many levels (address the issue) \u2022 (understand) Distributions of &hellip; <\/p>\n<p><a class=\"more-link btn\" href=\"http:\/\/bangla.sitestree.com\/?p=78189\">Continue reading<\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[1974,1903,1],"tags":[],"class_list":["post-78189","post","type-post","status-publish","format-standard","hentry","category-analytics-and-machine-learning-project-development","category-data-visualization","category-root","item-wrap"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack-related-posts":[{"id":14811,"url":"http:\/\/bangla.sitestree.com\/?p=14811","url_meta":{"origin":78189,"position":0},"title":"Next Recession: How to Invest and Profit in the Next Recession","author":"Sayed","date":"June 17, 2019","format":false,"excerpt":"How to Invest and Profit in the Next Recession https:\/\/www.bloomberg.com\/opinion\/articles\/2019-06-17\/how-to-invest-and-profit-in-the-next-recession \" plan on deploying your cash in tranches: Buy a U.S. index fund when markets are down 20 to 25%; add a developed global index fund when markets fall by 30%. And if we are lucky enough to enjoy a\u2026","rel":"","context":"In &quot;AI ML DS RL DL NN NLP Data Mining Optimization&quot;","block_context":{"text":"AI ML DS RL DL NN NLP Data Mining Optimization","link":"http:\/\/bangla.sitestree.com\/?cat=1910"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":25039,"url":"http:\/\/bangla.sitestree.com\/?p=25039","url_meta":{"origin":78189,"position":1},"title":"Next Recession: How to Invest and Profit in the Next Recession #Root","author":"Author-Check- Article-or-Video","date":"April 15, 2021","format":false,"excerpt":"How to Invest and Profit in the Next Recession https:\/\/www.bloomberg.com\/opinion\/articles\/2019-06-17\/how-to-invest-and-profit-in-the-next-recession \" plan on deploying your cash in tranches: Buy a U.S. index fund when markets are down 20 to 25%; add a developed global index fund when markets fall by 30%. And if we are lucky enough to enjoy a\u2026","rel":"","context":"In &quot;FromSitesTree.com&quot;","block_context":{"text":"FromSitesTree.com","link":"http:\/\/bangla.sitestree.com\/?cat=1917"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":75873,"url":"http:\/\/bangla.sitestree.com\/?p=75873","url_meta":{"origin":78189,"position":2},"title":"Step 1 A DSL to Automate Initial and Exploratory Analysis for data analytics project","author":"Sayed","date":"August 5, 2023","format":false,"excerpt":"","rel":"","context":"In &quot;DSL&quot;","block_context":{"text":"DSL","link":"http:\/\/bangla.sitestree.com\/?cat=1961"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":75875,"url":"http:\/\/bangla.sitestree.com\/?p=75875","url_meta":{"origin":78189,"position":3},"title":"Step 2 Review Report a DSL Automate Initial and Exploratory Analysis for Data analytics projects","author":"Sayed","date":"August 5, 2023","format":false,"excerpt":"","rel":"","context":"In &quot;DSL&quot;","block_context":{"text":"DSL","link":"http:\/\/bangla.sitestree.com\/?cat=1961"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":76257,"url":"http:\/\/bangla.sitestree.com\/?p=76257","url_meta":{"origin":78189,"position":4},"title":"Can you answer these questions on Data Science Project Development","author":"Sayed","date":"August 24, 2024","format":false,"excerpt":"Can you answer these questions on Data Science Project Development Questions to answer 1. What does a data science project usually involve? What is the common theme across data science projects? 2. Does industry projects and research projects differ? Why and to what extent? 3. What are the some dataset\u2026","rel":"","context":"In &quot;\u09ac\u09cd\u09b2\u0997 \u0964 Blog&quot;","block_context":{"text":"\u09ac\u09cd\u09b2\u0997 \u0964 Blog","link":"http:\/\/bangla.sitestree.com\/?cat=182"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":26252,"url":"http:\/\/bangla.sitestree.com\/?p=26252","url_meta":{"origin":78189,"position":5},"title":"Can you answer these random questions on Data Science Project Development #Root","author":"Author-Check- Article-or-Video","date":"April 21, 2021","format":false,"excerpt":"Questions to answer 1. What does a data science project usually involve? What is the common theme across data science projects? 2. Does industry projects and research projects differ? Why and to what extent? 3. What are the some dataset repositories? Where can you get them? 4. Are all public\u2026","rel":"","context":"In &quot;FromSitesTree.com&quot;","block_context":{"text":"FromSitesTree.com","link":"http:\/\/bangla.sitestree.com\/?cat=1917"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts\/78189","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=78189"}],"version-history":[{"count":3,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts\/78189\/revisions"}],"predecessor-version":[{"id":78294,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts\/78189\/revisions\/78294"}],"wp:attachment":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=78189"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=78189"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=78189"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}