{"id":78193,"date":"2025-05-21T00:36:48","date_gmt":"2025-05-21T00:36:48","guid":{"rendered":"http:\/\/bangla.sitestree.com\/?p=78193"},"modified":"2025-06-08T21:34:01","modified_gmt":"2025-06-08T21:34:01","slug":"initial-analysis-of-text-and-image-data-data-analytics-and-ml-projects","status":"publish","type":"post","link":"http:\/\/bangla.sitestree.com\/?p=78193","title":{"rendered":"Initial Analysis of Text and Image Data (Data Analytics and ML Projects)"},"content":{"rendered":"\n<p><\/p>\n\n\n\n<p><strong>Initial Analysis of Text Data<\/strong><\/p>\n\n\n\n<p>\u2022 Stop word filter<\/p>\n\n\n\n<p>\u2022 Lemma<\/p>\n\n\n\n<p>\u2022 POS<\/p>\n\n\n\n<p>\u2022 Vocabulary Analysis<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p><strong>Image Data: Initial Analysis<\/strong><\/p>\n\n\n\n<p>\u2022 Fix image size, ratios<\/p>\n\n\n\n<p>\u2022 Image Scaling<\/p>\n\n\n\n<p>\u2022 Transform to Gray<\/p>\n\n\n\n<p>\u2022 Standardize<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p>Data Analytics, Machine Learning, Data Science<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Initial Analysis of Text Data \u2022 Stop word filter \u2022 Lemma \u2022 POS \u2022 Vocabulary Analysis Image Data: Initial Analysis \u2022 Fix image size, ratios \u2022 Image Scaling \u2022 Transform to Gray \u2022 Standardize Data Analytics, Machine Learning, Data Science<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[1974,1],"tags":[],"class_list":["post-78193","post","type-post","status-publish","format-standard","hentry","category-analytics-and-machine-learning-project-development","category-root","item-wrap"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack-related-posts":[{"id":78189,"url":"http:\/\/bangla.sitestree.com\/?p=78189","url_meta":{"origin":78193,"position":0},"title":"Initial and Exploratory Analysis for Data Analytics Projects","author":"Sayed","date":"May 21, 2025","format":false,"excerpt":"To have a thorough understanding of the data. Two Types: \u2022 Initial Analysis \u2022 Exploratory Analysis Initial Analysis: Univariate Bi-Variate Multi-Variate Univariate Analysis \u2022 Deciding\/Determining the dependent (target) variable \u2022 Assigning the correct data types, appropriate column names \u2022 Address: Inconsistencies, missing values, outliers \u2022 Categorical variables with too many\u2026","rel":"","context":"In &quot;Analytics and Machine Learning Project Development&quot;","block_context":{"text":"Analytics and Machine Learning Project Development","link":"http:\/\/bangla.sitestree.com\/?cat=1974"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":78191,"url":"http:\/\/bangla.sitestree.com\/?p=78191","url_meta":{"origin":78193,"position":1},"title":"Data Requirements for Data Analytics Projects","author":"Sayed","date":"May 21, 2025","format":false,"excerpt":"Data \u2022 Dataset Characteristics \u2022Large Scale, Real, Representative, Relevant Features, balanced classes, unit relevant \u2022 Adapting data\/dataset for the project \u2022Clean, normalize\/standardize, bring more data, and bring more data of the missing type \u2022 Data Suitability for the project \u2022 Check for R Square Measure \u2022 Check for Bias, Variance,\u2026","rel":"","context":"In &quot;Analytics and Machine Learning Project Development&quot;","block_context":{"text":"Analytics and Machine Learning Project Development","link":"http:\/\/bangla.sitestree.com\/?cat=1974"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":78231,"url":"http:\/\/bangla.sitestree.com\/?p=78231","url_meta":{"origin":78193,"position":2},"title":"Make Sense of your Data: For Data Analytics Project","author":"Sayed","date":"May 22, 2025","format":false,"excerpt":"Hypothesis-based versus data-driven analysis \u201cOnly those data analysts who are given time to explore and analyze data thoughtfully and thoroughly are consistently successful.\u201d Data Identification and Prioritization Use Augmented data besides Data Pipeline Analytics Sandbox Characterizing the Data\u2014Exploring a Single Variable Data: Descriptive analysis options Find: Distribution of quantitative variables\u2026","rel":"","context":"In &quot;Analytics and Machine Learning Project Development&quot;","block_context":{"text":"Analytics and Machine Learning Project Development","link":"http:\/\/bangla.sitestree.com\/?cat=1974"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2025\/05\/image-36.png?resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2025\/05\/image-36.png?resize=350%2C200 1x, https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2025\/05\/image-36.png?resize=525%2C300 1.5x, https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2025\/05\/image-36.png?resize=700%2C400 2x"},"classes":[]},{"id":75873,"url":"http:\/\/bangla.sitestree.com\/?p=75873","url_meta":{"origin":78193,"position":3},"title":"Step 1 A DSL to Automate Initial and Exploratory Analysis for data analytics project","author":"Sayed","date":"August 5, 2023","format":false,"excerpt":"","rel":"","context":"In &quot;DSL&quot;","block_context":{"text":"DSL","link":"http:\/\/bangla.sitestree.com\/?cat=1961"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":75875,"url":"http:\/\/bangla.sitestree.com\/?p=75875","url_meta":{"origin":78193,"position":4},"title":"Step 2 Review Report a DSL Automate Initial and Exploratory Analysis for Data analytics projects","author":"Sayed","date":"August 5, 2023","format":false,"excerpt":"","rel":"","context":"In &quot;DSL&quot;","block_context":{"text":"DSL","link":"http:\/\/bangla.sitestree.com\/?cat=1961"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":78264,"url":"http:\/\/bangla.sitestree.com\/?p=78264","url_meta":{"origin":78193,"position":5},"title":"Reporting (Results and Discussion) for your Data Analytics Projects","author":"Sayed","date":"May 22, 2025","format":false,"excerpt":"Evaluation, Results, Analysis, Reporting Evaluation: What and How \u2022Evaluate: the accuracy and generality of the model \u2022 (we did in model evaluation, threat to validity) \u2022Now Evaluate: if model meets the business objectives \u2022Seek if there is some business reasons \u2022why this model is deficient \u2022Evaluation: Take this model and\u2026","rel":"","context":"In &quot;Analytics and Machine Learning Project Development&quot;","block_context":{"text":"Analytics and Machine Learning Project Development","link":"http:\/\/bangla.sitestree.com\/?cat=1974"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2025\/05\/image-44.png?resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2025\/05\/image-44.png?resize=350%2C200 1x, https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2025\/05\/image-44.png?resize=525%2C300 1.5x, https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2025\/05\/image-44.png?resize=700%2C400 2x, https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2025\/05\/image-44.png?resize=1050%2C600 3x, https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2025\/05\/image-44.png?resize=1400%2C800 4x"},"classes":[]}],"_links":{"self":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts\/78193","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=78193"}],"version-history":[{"count":2,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts\/78193\/revisions"}],"predecessor-version":[{"id":78283,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts\/78193\/revisions\/78283"}],"wp:attachment":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=78193"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=78193"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=78193"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}