{"id":16736,"date":"2020-02-02T16:16:13","date_gmt":"2020-02-02T21:16:13","guid":{"rendered":"https:\/\/bangla.salearningschool.com\/recent-posts\/?p=16736"},"modified":"2020-02-08T09:40:13","modified_gmt":"2020-02-08T14:40:13","slug":"misc-basic-statistics-for-data-science","status":"publish","type":"post","link":"http:\/\/bangla.sitestree.com\/?p=16736","title":{"rendered":"Misc Basic Statistics for Data Science"},"content":{"rendered":"<p><strong>Hypergeometric Distribution<\/strong><\/p>\n<p>&#8220;In <a title=\"Probability theory\" href=\"https:\/\/en.wikipedia.org\/wiki\/Probability_theory\">probability theory<\/a> and <a title=\"Statistics\" href=\"https:\/\/en.wikipedia.org\/wiki\/Statistics\">statistics<\/a>, the <strong>hypergeometric distribution<\/strong> is a <a title=\"Probability distribution\" href=\"https:\/\/en.wikipedia.org\/wiki\/Probability_distribution#Discrete_probability_distribution\">discrete probability distribution<\/a> that describes the probability of <img decoding=\"async\" src=\"https:\/\/wikimedia.org\/api\/rest_v1\/media\/math\/render\/svg\/c3c9a2c7b599b37105512c5d570edc034056dd40\" alt=\"k\" \/> successes (random draws for which the object drawn has a specified feature) in <img decoding=\"async\" src=\"https:\/\/wikimedia.org\/api\/rest_v1\/media\/math\/render\/svg\/a601995d55609f2d9f5e233e36fbe9ea26011b3b\" alt=\"n\" \/> draws, <em>without<\/em> replacement, from a finite <a title=\"Population\" href=\"https:\/\/en.wikipedia.org\/wiki\/Population\">population<\/a> of size <img decoding=\"async\" src=\"https:\/\/wikimedia.org\/api\/rest_v1\/media\/math\/render\/svg\/f5e3890c981ae85503089652feb48b191b57aae3\" alt=\"N\" \/> that contains exactly <img decoding=\"async\" src=\"https:\/\/wikimedia.org\/api\/rest_v1\/media\/math\/render\/svg\/2b76fce82a62ed5461908f0dc8f037de4e3686b0\" alt=\"K\" \/> objects with that feature, wherein each draw is either a success or a failure. In contrast, the <a title=\"Binomial distribution\" href=\"https:\/\/en.wikipedia.org\/wiki\/Binomial_distribution\">binomial distribution<\/a> describes the probability of <img decoding=\"async\" src=\"https:\/\/wikimedia.org\/api\/rest_v1\/media\/math\/render\/svg\/c3c9a2c7b599b37105512c5d570edc034056dd40\" alt=\"k\" \/> successes in <img decoding=\"async\" src=\"https:\/\/wikimedia.org\/api\/rest_v1\/media\/math\/render\/svg\/a601995d55609f2d9f5e233e36fbe9ea26011b3b\" alt=\"n\" \/> draws <em>with<\/em> replacement.<br \/>\nIn <a title=\"Statistics\" href=\"https:\/\/en.wikipedia.org\/wiki\/Statistics\">statistics<\/a>, the <strong>hypergeometric test<\/strong> uses the hypergeometric distribution to calculate the statistical significance of having drawn a specific <img decoding=\"async\" src=\"https:\/\/wikimedia.org\/api\/rest_v1\/media\/math\/render\/svg\/c3c9a2c7b599b37105512c5d570edc034056dd40\" alt=\"k\" \/> successes (out of <img decoding=\"async\" src=\"https:\/\/wikimedia.org\/api\/rest_v1\/media\/math\/render\/svg\/a601995d55609f2d9f5e233e36fbe9ea26011b3b\" alt=\"n\" \/> total draws) from the aforementioned population. The test is often used to identify which sub-populations are over- or under-represented in a sample. This test has a wide range of applications. For example, a marketing group could use the test to understand their customer base by testing a set of known customers for over-representation of various demographic subgroups (e.g., women, people under 30).&#8221; <a href=\"https:\/\/en.wikipedia.org\/wiki\/Hypergeometric_distribution\">https:\/\/en.wikipedia.org\/wiki\/Hypergeometric_distribution<\/a><\/p>\n<p><strong>Binomial Distribution<\/strong><br \/>\n&#8220;In <a title=\"Probability theory\" href=\"https:\/\/en.wikipedia.org\/wiki\/Probability_theory\">probability theory<\/a> and <a title=\"Statistics\" href=\"https:\/\/en.wikipedia.org\/wiki\/Statistics\">statistics<\/a>, the <strong>binomial distribution<\/strong> with parameters <em>n<\/em> and <em>p<\/em> is the <a title=\"Discrete probability distribution\" href=\"https:\/\/en.wikipedia.org\/wiki\/Discrete_probability_distribution\">discrete probability distribution<\/a> of the number of successes in a sequence of <em>n<\/em> <a title=\"Statistical independence\" href=\"https:\/\/en.wikipedia.org\/wiki\/Statistical_independence\">independent<\/a> <a title=\"Experiment (probability theory)\" href=\"https:\/\/en.wikipedia.org\/wiki\/Experiment_(probability_theory)\">experiments<\/a>, each asking a <a title=\"Yes\u2013no question\" href=\"https:\/\/en.wikipedia.org\/wiki\/Yes%E2%80%93no_question\">yes\u2013no question<\/a>, and each with its own <a title=\"Boolean-valued function\" href=\"https:\/\/en.wikipedia.org\/wiki\/Boolean-valued_function\">boolean<\/a>-valued <a title=\"Outcome (probability)\" href=\"https:\/\/en.wikipedia.org\/wiki\/Outcome_(probability)\">outcome<\/a>: <a title=\"wikt:success\" href=\"https:\/\/en.wiktionary.org\/wiki\/success\">success<\/a>\/<a title=\"Yes and no\" href=\"https:\/\/en.wikipedia.org\/wiki\/Yes_and_no\">yes<\/a>\/<a title=\"Truth value\" href=\"https:\/\/en.wikipedia.org\/wiki\/Truth_value\">true<\/a>\/<a title=\"One\" href=\"https:\/\/en.wikipedia.org\/wiki\/One\">one<\/a> (with <a title=\"Probability\" href=\"https:\/\/en.wikipedia.org\/wiki\/Probability\">probability<\/a> <em>p<\/em>) or <a title=\"Failure\" href=\"https:\/\/en.wikipedia.org\/wiki\/Failure\">failure<\/a>\/<a title=\"Yes and no\" href=\"https:\/\/en.wikipedia.org\/wiki\/Yes_and_no\">no<\/a>\/<a title=\"False (logic)\" href=\"https:\/\/en.wikipedia.org\/wiki\/False_(logic)\">false<\/a>\/<a title=\"Zero\" href=\"https:\/\/en.wikipedia.org\/wiki\/Zero\">zero<\/a> (with <a title=\"Probability\" href=\"https:\/\/en.wikipedia.org\/wiki\/Probability\">probability<\/a> <em>q<\/em> = 1 \u2212 <em>p<\/em>). A single success\/failure experiment is also called a <a title=\"Bernoulli trial\" href=\"https:\/\/en.wikipedia.org\/wiki\/Bernoulli_trial\">Bernoulli trial<\/a> or Bernoulli experiment and a sequence of outcomes is called a <a title=\"Bernoulli process\" href=\"https:\/\/en.wikipedia.org\/wiki\/Bernoulli_process\">Bernoulli process<\/a>; for a single trial, i.e., <em>n<\/em> = 1, the binomial distribution is a <a title=\"Bernoulli distribution\" href=\"https:\/\/en.wikipedia.org\/wiki\/Bernoulli_distribution\">Bernoulli distribution<\/a>. The binomial distribution is the basis for the popular <a title=\"Binomial test\" href=\"https:\/\/en.wikipedia.org\/wiki\/Binomial_test\">binomial test<\/a> of <a title=\"Statistical significance\" href=\"https:\/\/en.wikipedia.org\/wiki\/Statistical_significance\">statistical significance<\/a>.<\/p>\n<p>The binomial distribution is frequently used to model the number of successes in a sample of size <em>n<\/em> drawn <a title=\"With replacement\" href=\"https:\/\/en.wikipedia.org\/wiki\/With_replacement\">with replacement<\/a> from a population of size <em>N<\/em>. If the sampling is carried out without replacement, the draws are not independent and so the resulting distribution is a <a title=\"Hypergeometric distribution\" href=\"https:\/\/en.wikipedia.org\/wiki\/Hypergeometric_distribution\">hypergeometric distribution<\/a>, not a binomial one. However, for <em>N<\/em> much larger than <em>n<\/em>, the binomial distribution remains a good approximation, and is widely used.&#8221;<br \/>\n<a href=\"https:\/\/en.wikipedia.org\/wiki\/Binomial_distribution\">https:\/\/en.wikipedia.org\/wiki\/Binomial_distribution<\/a><\/p>\n<p><strong>Negative Binomial Distribution<\/strong><br \/>\n&#8220;In <a title=\"Probability theory\" href=\"https:\/\/en.wikipedia.org\/wiki\/Probability_theory\">probability theory<\/a> and <a title=\"Statistics\" href=\"https:\/\/en.wikipedia.org\/wiki\/Statistics\">statistics<\/a>, the <strong>negative binomial distribution<\/strong> is a <a title=\"Discrete probability distribution\" href=\"https:\/\/en.wikipedia.org\/wiki\/Discrete_probability_distribution\">discrete probability distribution<\/a> of the number of successes in a sequence of independent and identically distributed <a title=\"Bernoulli trial\" href=\"https:\/\/en.wikipedia.org\/wiki\/Bernoulli_trial\">Bernoulli trials<\/a> before a specified (non-random) number of failures (denoted <em>r<\/em>) occurs. For example, we can define that when we throw a <a title=\"Dice\" href=\"https:\/\/en.wikipedia.org\/wiki\/Dice\">dice<\/a> and get a 6 it is a failure while rolling any other number is considered a success, and also choose r to be 3. We then throw the dice repeatedly until the third time the number 6 appears. In such a case, the probability distribution of the number of non-6s that appeared will be a negative binomial distribution.<\/p>\n<p>The <strong>Pascal distribution<\/strong> (after <a title=\"Blaise Pascal\" href=\"https:\/\/en.wikipedia.org\/wiki\/Blaise_Pascal\">Blaise Pascal<\/a>) and <strong>Polya distribution<\/strong> (for <a title=\"George P\u00f3lya\" href=\"https:\/\/en.wikipedia.org\/wiki\/George_P%C3%B3lya\">George P\u00f3lya<\/a>) are special cases of the negative binomial distribution. A convention among engineers, climatologists, and others is to use &#8220;negative binomial&#8221; or &#8220;Pascal&#8221; for the case of an integer-valued stopping-time parameter <em>r<\/em>, and use &#8220;Polya&#8221; for the real-valued case.&#8221;<br \/>\n<a href=\"https:\/\/en.wikipedia.org\/wiki\/Negative_binomial_distribution\">https:\/\/en.wikipedia.org\/wiki\/Negative_binomial_distribution<\/a><\/p>\n<p>Probability and Counting<\/p>\n<p>&#8220;To decide &#8220;how likely&#8221; an event is, we need to <strong>count<\/strong> the number of times an event could occur and compare it to the total number of possible events. Such a comparison is called the <strong>probability<\/strong> of the particular event occurring. The mathematical theory of <strong>counting<\/strong> is known as combinatorial analysis&#8221;<br \/>\n<a href=\"https:\/\/www.intmath.com\/counting-probability\/counting-probability-intro.php\">https:\/\/www.intmath.com\/counting-probability\/counting-probability-intro.php<\/a><\/p>\n<p>Principle of Counting<\/p>\n<p>&#8220;The Fundamental Counting Principle (also called the counting rule) is a way to figure out the number of outcomes in a probability problem. Basically, you multiply the events together to get the total number of outcomes. The formula is:<br \/>\nIf you have an event \u201ca\u201d and another event \u201cb\u201d then all the different outcomes for the events is a * b.&#8221;<br \/>\n<a href=\"https:\/\/www.statisticshowto.datasciencecentral.com\/fundamental-counting-principle\/\">https:\/\/www.statisticshowto.datasciencecentral.com\/fundamental-counting-principle\/<\/a><\/p>\n<p><strong>Combinatorics<\/strong><br \/>\n<a href=\"https:\/\/mathigon.org\/world\/Combinatorics\">https:\/\/mathigon.org\/world\/Combinatorics<\/a><\/p>\n<p>fundamental principle of counting<\/p>\n<p>&#8220;The <strong>Fundamental Counting Principle<\/strong> states that if one event has m possible outcomes and a second independent event has n possible outcomes, then there are m x n total possible outcomes for the two events together.&#8221;<br \/>\n<a href=\"https:\/\/www.mathgoodies.com\/glossary\/term\/Fundamental%20Counting%20Principle\">https:\/\/www.mathgoodies.com\/glossary\/term\/Fundamental%20Counting%20Principle<\/a><\/p>\n<p>Factorial<br \/>\n&#8220;In <a title=\"Mathematics\" href=\"https:\/\/en.wikipedia.org\/wiki\/Mathematics\">mathematics<\/a>, the <strong>factorial<\/strong> of a positive <a title=\"Integer\" href=\"https:\/\/en.wikipedia.org\/wiki\/Integer\">integer<\/a> n, denoted by <em>n<\/em>!, is the <a title=\"Product (mathematics)\" href=\"https:\/\/en.wikipedia.org\/wiki\/Product_(mathematics)\">product<\/a> of all positive integers less than or equal to n:<br \/>\n{\\displaystyle n!=n\\times (n-1)\\times (n-2)\\times (n-3)\\times \\cdots \\times 3\\times 2\\times 1\\,.}<img decoding=\"async\" src=\"https:\/\/wikimedia.org\/api\/rest_v1\/media\/math\/render\/svg\/9b2be989313e5805c0d2a0d6a730493e04efd317\" alt=\"{\\displaystyle n!=n\\times (n-1)\\times (n-2)\\times (n-3)\\times \\cdots \\times 3\\times 2\\times 1\\,.}\" \/>&#8221;<br \/>\n<a href=\"https:\/\/en.wikipedia.org\/wiki\/Factorial\">https:\/\/en.wikipedia.org\/wiki\/Factorial<\/a><\/p>\n<p><strong>Factorial with Identical Numbers<\/strong><br \/>\n<a href=\"https:\/\/i0.wp.com\/bangla.salearningschool.com\/wp-content\/uploads\/2020\/02\/image.png\" rel=\"attachment wp-att-16737\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" class=\"alignnone  wp-image-16737\" title=\"image-png\" src=\"https:\/\/i0.wp.com\/bangla.salearningschool.com\/wp-content\/uploads\/2020\/02\/image.png?resize=526%2C81\" alt=\"\" width=\"526\" height=\"81\" srcset=\"https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2020\/02\/image.png?w=738 738w, https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2020\/02\/image.png?resize=300%2C46 300w\" sizes=\"auto, (max-width: 526px) 100vw, 526px\" \/><\/a><\/p>\n<p><em><strong>&#8220;<\/strong><\/em>Bayes&#8217; theorem<\/p>\n<h2>Description<\/h2>\n<p>In probability theory and statistics, Bayes\u2019s theorem describes the probability of an event, based on prior knowledge of conditions that might be related to the event. <a href=\"https:\/\/en.wikipedia.org\/wiki\/Bayes'_theorem\">Wikipedia<\/a><\/p>\n<p>Formula<br \/>\n<img decoding=\"async\" src=\"https:\/\/www.gstatic.com\/education\/formulas\/images_long_sheet\/bayes__theorem.svg\" alt=\"P(A\\mid B)=\\frac {P(B\\mid A) \\cdot P(A)}{P(B)}\" \/><\/p>\n<table>\n<tbody>\n<tr>\n<td><img decoding=\"async\" src=\"https:\/\/www.gstatic.com\/education\/formulas\/images_long_sheet\/bayes__theorem_AB.svg\" alt=\"A, B\" \/><\/td>\n<td>=<\/td>\n<td>events<\/td>\n<\/tr>\n<tr>\n<td><img decoding=\"async\" src=\"https:\/\/www.gstatic.com\/education\/formulas\/images_long_sheet\/bayes__theorem_PAB.svg\" alt=\"P(A|B)\" \/><\/td>\n<td>=<\/td>\n<td>probability of A given B is true<\/td>\n<\/tr>\n<tr>\n<td><img decoding=\"async\" src=\"https:\/\/www.gstatic.com\/education\/formulas\/images_long_sheet\/bayes__theorem_PBA.svg\" alt=\"P(B|A)\" \/><\/td>\n<td>=<\/td>\n<td>probability of B given A is true<\/td>\n<\/tr>\n<tr>\n<td><img decoding=\"async\" src=\"https:\/\/www.gstatic.com\/education\/formulas\/images_long_sheet\/bayes__theorem_PAPB.svg\" alt=\"P(A), P(B)\" \/><\/td>\n<td>=<\/td>\n<td>the independent probabilities of A and B<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><em><strong>&#8220;<\/strong><\/em><\/p>\n<p><em><strong>*** . *** . *** . ***<\/strong><\/em><br \/>\n<em><strong>Note: Older short-notes from this site are posted on Medium: <\/strong><\/em><a href=\"https:\/\/medium.com\/@SayedAhmedCanada\">https:\/\/medium.com\/@SayedAhmedCanada<\/a><\/p>\n<p>*** . *** *** . *** . *** . ***<\/p>\n<p><em><strong>Sayed Ahmed<\/strong><br \/>\n<\/em><br \/>\n<em><strong>BSc. Eng. in Comp. Sc. &amp; Eng. (BUET)<\/strong><\/em><br \/>\n<em><strong>MSc. in Comp. Sc. (U of Manitoba, Canada)<\/strong><\/em><br \/>\n<em><strong>MSc. in Data Science and Analytics (Ryerson University, Canada)<\/strong><\/em><br \/>\n<em><strong>Linkedin<\/strong>: <a href=\"https:\/\/ca.linkedin.com\/in\/sayedjustetc\">https:\/\/ca.linkedin.com\/in\/sayedjustetc<\/a><br \/>\n<\/em><\/p>\n<p><em><strong>Blog<\/strong>: <a href=\"http:\/\/bangla.salearningschool.com\/\">http:\/\/Bangla.SaLearningSchool.com<\/a>, <a href=\"http:\/\/sitestree.com\">http:\/\/SitesTree.com<\/a><\/em><br \/>\n<em><strong>Online and Offline Training<\/strong>: <a href=\"http:\/\/training.SitesTree.com\">http:\/\/Training.SitesTree.com<\/a> (Also, can be free and low cost sometimes)<\/em><\/p>\n<p><em>Facebook Group\/Form to discuss (Q &amp; A): <\/em><a href=\"https:\/\/www.facebook.com\/banglasalearningschool\">https:\/\/www.facebook.com\/banglasalearningschool<\/a><\/p>\n<p>Our free or paid training events: <a href=\"https:\/\/www.facebook.com\/justetcsocial\">https:\/\/www.facebook.com\/justetcsocial<\/a><\/p>\n<p><em>Get access to courses on Big Data, Data Science, AI, Cloud, Linux, System Admin, Web Development and Misc. related. Also, create your own course to sell to others. <\/em><a href=\"http:\/\/sitestree.com\/training\/\">http:\/\/sitestree.com\/training\/<\/a><\/p>\n<p><em><strong>I<\/strong>f you want to contribute to occasional free and\/or low cost online\/offline training or charitable\/non-profit work in the education\/health\/social service sector, you can financially contribute to: safoundation at <a href=\"http:\/\/salearningschool.com\">salearningschool.com<\/a> using Paypal or Credit Card (on <\/em><a href=\"http:\/\/sitestree.com\/training\/enrol\/index.php?id=114\">http:\/\/sitestree.com\/training\/enrol\/index.php?id=114<\/a> <em>).<br \/>\n<\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Hypergeometric Distribution &#8220;In probability theory and statistics, the hypergeometric distribution is a discrete probability distribution that describes the probability of successes (random draws for which the object drawn has a specified feature) in draws, without replacement, from a finite population of size that contains exactly objects with that feature, wherein each draw is either a &hellip; <\/p>\n<p><a class=\"more-link btn\" href=\"http:\/\/bangla.sitestree.com\/?p=16736\">Continue reading<\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[1910,182],"tags":[],"class_list":["post-16736","post","type-post","status-publish","format-standard","hentry","category-ai-ml-ds-rl-dl-nn-nlp-data-mining-optimization","category---blog","item-wrap"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack-related-posts":[{"id":16532,"url":"http:\/\/bangla.sitestree.com\/?p=16532","url_meta":{"origin":16736,"position":0},"title":"Part 1: Some Math\/Stat Background that (true) Data Scientists will know\/use: from the internet","author":"Sayed","date":"December 28, 2019","format":false,"excerpt":"Chebyshev's inequality \"In probability theory, Chebyshev's inequality (also called the Bienaym\u00e9\u2013Chebyshev inequality) guarantees that, for a wide class of probability distributions, no more than a certain fraction of values can be more than a certain distance from the mean. Specifically, no more than 1\/k2 of the distribution's values can be\u2026","rel":"","context":"In &quot;Math and Statistics for Data Science, and Engineering&quot;","block_context":{"text":"Math and Statistics for Data Science, and Engineering","link":"http:\/\/bangla.sitestree.com\/?cat=1908"},"img":{"alt_text":"","src":"https:\/\/upload.wikimedia.org\/wikipedia\/commons\/thumb\/8\/85\/Discrete_probability_distrib.svg\/220px-Discrete_probability_distrib.svg.png","width":350,"height":200},"classes":[]},{"id":16200,"url":"http:\/\/bangla.sitestree.com\/?p=16200","url_meta":{"origin":16736,"position":1},"title":"Important Basic Concepts: Statistics for Big Data","author":"Sayed","date":"September 15, 2019","format":false,"excerpt":"Important Basic Concepts: Statistics for Big Data Graphical : Exploratory Data Analysis (EDA) methods? First of all, EDA is about exploring the data and understanding if the data will be good for the experiment and study. Graphs and plots can easily show the data patterns. The raw data can be\u2026","rel":"","context":"In &quot;Statistics for Big Data&quot;","block_context":{"text":"Statistics for Big Data","link":"http:\/\/bangla.sitestree.com\/?cat=1904"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":16665,"url":"http:\/\/bangla.sitestree.com\/?p=16665","url_meta":{"origin":16736,"position":2},"title":"Bayesian Statistics and Machine Learning","author":"Sayed","date":"January 19, 2020","format":false,"excerpt":"Bayesian Statistics and Machine Learning \"Bayesian inference is a method of statistical inference in which Bayes' theorem is used to update the probability for a hypothesis as more evidence or information becomes available. Bayesian inference is an important technique in statistics, and especially in mathematical statistics.\" en.wikipedia.org \u203a wiki \u203a\u2026","rel":"","context":"In &quot;AI ML DS RL DL NN NLP Data Mining Optimization&quot;","block_context":{"text":"AI ML DS RL DL NN NLP Data Mining Optimization","link":"http:\/\/bangla.sitestree.com\/?cat=1910"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2020\/01\/image-1.png?resize=350%2C200","width":350,"height":200},"classes":[]},{"id":16550,"url":"http:\/\/bangla.sitestree.com\/?p=16550","url_meta":{"origin":16736,"position":3},"title":"Part 3: Some Basic Math\/Stat Concepts for the wanna be Data Scientists","author":"Sayed","date":"December 30, 2019","format":false,"excerpt":"Conditional Probability and PDF \"The conditional probability of an event B is the probability that the event will occur given the knowledge that an event A has already occurred. This probability is written P(B|A), notation for the probability of B given A. \" \"In the case where events A and\u2026","rel":"","context":"In &quot;AI ML DS RL DL NN NLP Data Mining Optimization&quot;","block_context":{"text":"AI ML DS RL DL NN NLP Data Mining Optimization","link":"http:\/\/bangla.sitestree.com\/?cat=1910"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2019\/12\/image-8.png?resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2019\/12\/image-8.png?resize=350%2C200 1x, https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2019\/12\/image-8.png?resize=525%2C300 1.5x"},"classes":[]},{"id":26205,"url":"http:\/\/bangla.sitestree.com\/?p=26205","url_meta":{"origin":16736,"position":4},"title":"Important Basic Concepts: Statistics for Big Data #Root","author":"Author-Check- Article-or-Video","date":"April 19, 2021","format":false,"excerpt":"Important Basic Concepts: Statistics for Big Data Graphical : Exploratory Data Analysis (EDA) methods? First of all, EDA is about exploring the data and understanding if the data will be good for the experiment and study. Graphs and plots can easily show the data patterns. The raw data can be\u2026","rel":"","context":"In &quot;FromSitesTree.com&quot;","block_context":{"text":"FromSitesTree.com","link":"http:\/\/bangla.sitestree.com\/?cat=1917"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":16536,"url":"http:\/\/bangla.sitestree.com\/?p=16536","url_meta":{"origin":16736,"position":5},"title":"Part 2: Some basic Math\/Statistics concepts that Data Scientists (the true ones) will usually know\/use","author":"Sayed","date":"December 29, 2019","format":false,"excerpt":"Part 2: Some basic Math\/Statistics concepts that Data Scientists (the true ones) will usually know\/use (came across, studied, learned, used) Covariance and Correlation \"Covariance is a measure of how two variables change together, but its magnitude is unbounded, so it is difficult to interpret. By dividing covariance by the product\u2026","rel":"","context":"In &quot;Math and Statistics for Data Science, and Engineering&quot;","block_context":{"text":"Math and Statistics for Data Science, and Engineering","link":"http:\/\/bangla.sitestree.com\/?cat=1908"},"img":{"alt_text":"[eq5]","src":"https:\/\/i0.wp.com\/www.statlect.com\/images\/covariance-formula__12.png?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]}],"_links":{"self":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts\/16736","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=16736"}],"version-history":[{"count":2,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts\/16736\/revisions"}],"predecessor-version":[{"id":16740,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts\/16736\/revisions\/16740"}],"wp:attachment":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=16736"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=16736"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=16736"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}