{"id":76257,"date":"2024-08-24T13:32:23","date_gmt":"2024-08-24T17:32:23","guid":{"rendered":"http:\/\/bangla.sitestree.com\/?p=76257"},"modified":"2024-08-24T13:32:31","modified_gmt":"2024-08-24T17:32:31","slug":"can-you-answer-these-questions-on-data-science-project-development","status":"publish","type":"post","link":"http:\/\/bangla.sitestree.com\/?p=76257","title":{"rendered":"Can you answer these questions on Data Science Project Development"},"content":{"rendered":"\n<p><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Can you answer these questions on Data Science Project Development<\/h2>\n\n\n\n<p><strong>Questions to answer<\/strong><\/p>\n\n\n\n<p>1. What does a data science project usually involve? What is the common theme across data science projects?<\/p>\n\n\n\n<p>2. Does industry projects and research projects differ? Why and to what extent?<\/p>\n\n\n\n<p>3. What are the some dataset repositories? Where can you get them?<\/p>\n\n\n\n<p>4. Are all public datasets perfect for doing a project?<\/p>\n\n\n\n<p>5. Can you get one dataset with all required data?<\/p>\n\n\n\n<p>6. if you have different datasets with part of the data that you need &#8212; what do you do? do you just abandon the project?<\/p>\n\n\n\n<p>7. True or false. you get a dataset and then directly apply your algorithm to get the answer or to predict?<\/p>\n\n\n\n<p>8. Is there any benefit to check the data details such as see the distribution of the data? mean and median and mode of the data?<\/p>\n\n\n\n<p>9. What are the usual steps in a data science research project?<\/p>\n\n\n\n<p>10. what is univariate analysis?<\/p>\n\n\n\n<p>11. what is bivariate analysis?<\/p>\n\n\n\n<p>12. is multivariate analysis?<\/p>\n\n\n\n<p>13. Why do you do univariate, bivariate or multivariate analysis?<\/p>\n\n\n\n<p>14. What is exploratory data analysis?<\/p>\n\n\n\n<p>15. What are the two types of exploratory data analysis.<\/p>\n\n\n\n<p>16. What are the pros and cons of visualizing the raw data?<\/p>\n\n\n\n<p>17. Is there any need to use clustering while doing data exploration? or this can be more of a methodology of the research process?<\/p>\n\n\n\n<p>18. what is data synthesis?<\/p>\n\n\n\n<p>19. What is study selection?<\/p>\n\n\n\n<p>20. What do the materials and methods section in a data science research project contain?<\/p>\n\n\n\n<p>21. Difference between Methodology and experiment?<\/p>\n\n\n\n<p>22. Difference between Exploratory analysis step and methodology step?<\/p>\n\n\n\n<p>23. What makes a good experiment?<\/p>\n\n\n\n<p>24. what is a regression analysis? Can it be an exploratory analysis? can it be as part of experimental analysis.<\/p>\n\n\n\n<p>25. what is PCA?<\/p>\n\n\n\n<p>26. What is Principal Component Analysis?<\/p>\n\n\n\n<p>27. What is factor analysis?<\/p>\n\n\n\n<p>28. True or false =&gt; PCA and Factor analysis do the same thing i,e. they are the same? why, why not explain<\/p>\n\n\n\n<p>29. What is the measure name that indicates whether factor analysis is required or not<\/p>\n\n\n\n<p>30. With PCA, you can always ignore the similar behaving\/contributing features?<\/p>\n\n\n\n<p>31. can you find out the purpose of decision trees, SVM, deep learning. is there an opportunity to apply those for the project in concern? to what extent and how? is ther any more work to do for the dataset preparation?<\/p>\n\n\n\n<p>32. Can you write Python code to visualize a feature with a box plot? If not, can you google\/startpage to find out the code and implement? What does the Boxplot for this feature saying?<\/p>\n\n\n\n<p>33. what are the prediction methodology used in the project? Which one performed the best? Does the result make sense? Does the perfect result make sense? Do you think there can be bias and the code will not be able to give similar output for other datasets?<\/p>\n\n\n\n<p>34. What are the measures used for prediction performance?<\/p>\n\n\n\n<p>35. What was used for the project: Linear\/Polynomial Regression or Logistics Regression. What is a better measure between these two for this project or in general?<\/p>\n\n\n\n<p>36. True or false, Logistic regression is for Regression<\/p>\n\n\n\n<p>37. True or false, Linear regression is for Classification<\/p>\n\n\n\n<p>38. What is the significance of R square? Low or high values of R square is desirable to say that the Regression outcome is important?<\/p>\n\n\n\n<p>39. What is f-score?<\/p>\n\n\n\n<p>40. What is the train-test-split method in Python?<\/p>\n\n\n\n<p>41. Did I use SQL in the project? Was that a must? What else could you do?<\/p>\n\n\n\n<p>42. Can you join multiple dataframe (i.e. table) in Python? Do you have left, right, inner or similar joins in Python.<\/p>\n\n\n\n<p>(When I originally wrote) Last modified: Saturday, 9 November 2019, 4:10 PM<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Can you answer these questions on Data Science Project Development Questions to answer 1. What does a data science project usually involve? What is the common theme across data science projects? 2. Does industry projects and research projects differ? Why and to what extent? 3. What are the some dataset repositories? Where can you get &hellip; <\/p>\n<p><a class=\"more-link btn\" href=\"http:\/\/bangla.sitestree.com\/?p=76257\">Continue reading<\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[182],"tags":[],"class_list":["post-76257","post","type-post","status-publish","format-standard","hentry","category---blog","item-wrap"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack-related-posts":[{"id":26252,"url":"http:\/\/bangla.sitestree.com\/?p=26252","url_meta":{"origin":76257,"position":0},"title":"Can you answer these random questions on Data Science Project Development #Root","author":"Author-Check- Article-or-Video","date":"April 21, 2021","format":false,"excerpt":"Questions to answer 1. What does a data science project usually involve? What is the common theme across data science projects? 2. Does industry projects and research projects differ? Why and to what extent? 3. What are the some dataset repositories? Where can you get them? 4. Are all public\u2026","rel":"","context":"In &quot;FromSitesTree.com&quot;","block_context":{"text":"FromSitesTree.com","link":"http:\/\/bangla.sitestree.com\/?cat=1917"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":16371,"url":"http:\/\/bangla.sitestree.com\/?p=16371","url_meta":{"origin":76257,"position":1},"title":"Can you answer these random questions on Data Science Project Development","author":"Sayed","date":"November 9, 2019","format":false,"excerpt":"Questions to answer 1. What does a data science project usually involve? What is the common theme across data science projects? 2. Does industry projects and research projects differ? Why and to what extent? 3. What are the some dataset repositories? Where can you get them? 4. Are all public\u2026","rel":"","context":"In &quot;AI ML DS RL DL NN NLP Data Mining Optimization&quot;","block_context":{"text":"AI ML DS RL DL NN NLP Data Mining Optimization","link":"http:\/\/bangla.sitestree.com\/?cat=1910"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":78191,"url":"http:\/\/bangla.sitestree.com\/?p=78191","url_meta":{"origin":76257,"position":2},"title":"Data Requirements for Data Analytics Projects","author":"Sayed","date":"May 21, 2025","format":false,"excerpt":"Data \u2022 Dataset Characteristics \u2022Large Scale, Real, Representative, Relevant Features, balanced classes, unit relevant \u2022 Adapting data\/dataset for the project \u2022Clean, normalize\/standardize, bring more data, and bring more data of the missing type \u2022 Data Suitability for the project \u2022 Check for R Square Measure \u2022 Check for Bias, Variance,\u2026","rel":"","context":"In &quot;Analytics and Machine Learning Project Development&quot;","block_context":{"text":"Analytics and Machine Learning Project Development","link":"http:\/\/bangla.sitestree.com\/?cat=1974"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":22891,"url":"http:\/\/bangla.sitestree.com\/?p=22891","url_meta":{"origin":76257,"position":3},"title":"Graph Mining: Misc. Topics to Learn: Misc. Resources to Learn From","author":"Sayed","date":"March 21, 2021","format":false,"excerpt":"Graph Mining: Misc. Topics to Learn: Misc. Resources to Learn From Influence\/Virus\/Label Propagation Resources to learn fromPage A presentation on Influence\/Virus PropagationURL Big Data Graph Databases Resources to Learn FromPage Graph DatabaseURL Neo4jPage Big Data Graph Processing Resources to Learn FromPage ToolURL Techniques, Tools and Applications of Graph AnalyticURL Graph\u2026","rel":"","context":"In &quot;Graph Mining&quot;","block_context":{"text":"Graph Mining","link":"http:\/\/bangla.sitestree.com\/?cat=1905"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":76294,"url":"http:\/\/bangla.sitestree.com\/?p=76294","url_meta":{"origin":76257,"position":4},"title":"Misc. Plots for Data Science Projects","author":"Sayed","date":"August 25, 2024","format":false,"excerpt":"https:\/\/www.researchgate.net\/figure\/Doubledecker-plot-for-the-OvaryCancer-data-showing-the-conditional-distribution-of-X-ray_fig13_5142958 Titanic Dataset: Double Decker Plot https:\/\/www.researchgate.net\/figure\/Titanic-data-Class-Gender-Age-and-Survival-a-joint-independence-b-main-effects_fig3_2508823 Berkeley Admission Data: https:\/\/www.thoughtco.com\/uc-berkeley-admissions-787148 Geyser Data: With Contours https:\/\/www.r-bloggers.com\/2016\/10\/assessing-clustering-tendency-a-vital-issue-unsupervised-machine-learning\/","rel":"","context":"In &quot;\u09ac\u09cd\u09b2\u0997 \u0964 Blog&quot;","block_context":{"text":"\u09ac\u09cd\u09b2\u0997 \u0964 Blog","link":"http:\/\/bangla.sitestree.com\/?cat=182"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2024\/08\/image-27.png?resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2024\/08\/image-27.png?resize=350%2C200 1x, https:\/\/i0.wp.com\/bangla.sitestree.com\/wp-content\/uploads\/2024\/08\/image-27.png?resize=525%2C300 1.5x"},"classes":[]},{"id":65652,"url":"http:\/\/bangla.sitestree.com\/?p=65652","url_meta":{"origin":76257,"position":5},"title":"Database Programming in .Net: ADO.Net Overview #Misc .Net","author":"Author-Check- Article-or-Video","date":"July 10, 2021","format":false,"excerpt":"Overview of ADO.net Purpose Components of ADO.net and their functions ADO.net provides disconnected database access to make minimum resource (RAM,Page Table,Heap) use. Database connections are open as long as the connection is required. Afterwards, the connection is just closed. If connections are kept active and if new connections are used\u2026","rel":"","context":"In &quot;FromSitesTree.com&quot;","block_context":{"text":"FromSitesTree.com","link":"http:\/\/bangla.sitestree.com\/?cat=1917"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts\/76257","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=76257"}],"version-history":[{"count":1,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts\/76257\/revisions"}],"predecessor-version":[{"id":76258,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts\/76257\/revisions\/76258"}],"wp:attachment":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=76257"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=76257"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=76257"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}