{"id":20653,"date":"2021-02-25T01:01:07","date_gmt":"2021-02-25T06:01:07","guid":{"rendered":"http:\/\/bangla.salearningschool.com\/recent-posts\/hadoop-random-notes\/"},"modified":"2021-02-25T01:01:07","modified_gmt":"2021-02-25T06:01:07","slug":"hadoop-random-notes","status":"publish","type":"post","link":"http:\/\/bangla.sitestree.com\/?p=20653","title":{"rendered":"Hadoop Random Notes"},"content":{"rendered":"<p>Some Random Notes on Hadoop&#8230;<\/p>\n<p>Why Hadoop? &#8230;randomly came across again&#8230;.so, Just some random stuff. I may try to relate my exposure to it (or something), even that remotely relates to Hadoop<\/p>\n<p><strong>What is Hadoop?<\/strong><\/p>\n<p><b>Apache Hadoop<\/b> is an <a title=\"Open source\" href=\"http:\/\/en.wikipedia.org\/wiki\/Open_source\">open-source<\/a> <a title=\"Software framework\" href=\"http:\/\/en.wikipedia.org\/wiki\/Software_framework\">software framework<\/a> for storage and large scale processing of data-sets on clusters of <a title=\"Commodity hardware\" href=\"http:\/\/en.wikipedia.org\/wiki\/Commodity_hardware\">commodity hardware<\/a>. Hadoop is an <a title=\"Apache Software Foundation\" href=\"http:\/\/en.wikipedia.org\/wiki\/Apache_Software_Foundation#Projects\">Apache top-level project<\/a> being built and used by a global community of contributors and users.<sup id=\"cite_ref-poweredby_2-0\"><a href=\"http:\/\/en.wikipedia.org\/wiki\/Hadoop#cite_note-poweredby-2\">[2]<\/a><\/sup> It is licensed under the <a title=\"Apache License\" href=\"http:\/\/en.wikipedia.org\/wiki\/Apache_License\">Apache License<\/a> 2.0. (Wikipedia)<\/p>\n<p>&#8220;Hadoop is one way of using an enormous cluster of computers to store an enormous amount of data and then to operate on that data in parallel.&#8221; Reference : <a title=\"Haddop Parallel Computing\" href=\"http:\/\/escience.washington.edu\">http:\/\/escience.washington.edu<\/a><\/p>\n<p><strong>Hadoop Salary in San Jose, California, USA<\/strong><\/p>\n<p><a title=\"Hadoop salary in USA\" href=\"http:\/\/www.indeed.com\/salary?q1=hadoop&amp;l1=san+jose+ca\">http:\/\/www.indeed.com\/salary?q1=hadoop&amp;l1=san+jose+ca<\/a><\/p>\n<p>&#8212;<\/p>\n<p><strong>Hadoop Salary in Toronto, Ontario, Canada<\/strong><br \/>\n<a href=\"http:\/\/www.indeed.com\/salary?q1=hadoop&amp;l1=toronto%2C+ontario%2C+canada\" target=\"_blank\" rel=\"nofollow noopener\">http:\/\/www.indeed.com\/salary?q1=hadoop&amp;l1=toronto%2C+ontario%2C+canada<\/a><\/p>\n<p>Hadoop and Parallel Computing and Distributed Database:<\/p>\n<p><a href=\"http:\/\/escience.washington.edu\/get-help-now\/what-hadoop\" target=\"_blank\" rel=\"nofollow noopener\">http:\/\/escience.washington.edu\/get-help-now\/what-hadoop<\/a><\/p>\n<p>A course on Parallel Computing (Yes, I took it): <a href=\"http:\/\/www.geocities.ws\/sayedbuet94\/parallel\/classnotes\/\" target=\"_blank\" rel=\"nofollow noopener\">http:\/\/www.geocities.ws\/sayedbuet94\/parallel\/classnotes\/<\/a><\/p>\n<p>Introduction to distributed DBMS: <a href=\"https:\/\/cs.uwaterloo.ca\/%7Etozsu\/courses\/cs856\/F02\/lecture-1-ho.pdf\" target=\"_blank\" rel=\"nofollow noopener\">https:\/\/cs.uwaterloo.ca\/&#8230;\/cs856\/F02\/lecture-1-ho.pdf<\/a><\/p>\n<p>Book On Distributed DBMS: Yes it&#8217;s theory: For Jobs, you need to study practical stuff. <a href=\"http:\/\/www.stanford.edu\/class\/cs347\/reading\/textbook.pdf\" target=\"_blank\" rel=\"nofollow noopener\">http:\/\/www.stanford.edu\/class\/cs347\/reading\/textbook.pdf<\/a><\/p>\n<p>Yes, I took a Distributed DBMS, theory course.<\/p>\n<p>Application of Hadoop<\/p>\n<p>Keith Wiley: Astronomical Image Processing with Hadoop <a href=\"http:\/\/escience.washington.edu\/get-help-now\/astronomical-image-processing-hadoop\" target=\"_blank\" rel=\"nofollow noopener\">http:\/\/escience.washington.edu\/&#8230;<\/a><\/p>\n<p><a title=\"Database System Architectures\" href=\"http:\/\/bangla.salearningschool.com\/database-system-architectures-%E0%A6%A1%E0%A6%BE%E0%A6%9F%E0%A6%BE%E0%A6%AC%E0%A7%87%E0%A6%9C-%E0%A6%B8%E0%A6%BF%E0%A6%B8%E0%A7%8D%E0%A6%9F%E0%A7%87%E0%A6%AE-%E0%A6%8F%E0%A6%B0-%E0%A6%86%E0%A6%B0\/#!\">A Presentation document by me on Database System Architectures<\/a><\/p>\n<p><a title=\"Apache Hadoop Tutorial: MapReduce Tutorial\" href=\"http:\/\/hadoop.apache.org\/docs\/r1.2.1\/mapred_tutorial.html\">An Apache Hadoop Tutorial: MapReduce with Apache Hadoop<\/a><\/p>\n<p><a href=\"http:\/\/lintool.github.io\/MapReduceAlgorithms\/\">Hadoop Applicatioon: Data-Intensive Text Processing with MapReduce<\/a><\/p>\n<p>I have some research publications on Parallel Computing as can be seen at: <a href=\"http:\/\/sayed.justetc.net\/publications.php\">http:\/\/sayed.justetc.net\/publications.php<\/a><\/p>\n<p>Implemented in MPI and OpenMP. For sure the code were run in Parallel Computers [Clusters of Computers]<\/p>\n<p>I did a part time Research Assistant work on Running Parallel Programs to find some code, and then report the finding after running for a while [do not remember it was few days or a week] From: http:\/\/sitestree.com\/?p=524<br \/> Categories:Web Development, Root, By Sayed Ahmed, Data Warehouse Misc<br \/>Tags:<br \/> Post Data:2013-11-08 23:39:53<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Some Random Notes on Hadoop&#8230; Why Hadoop? &#8230;randomly came across again&#8230;.so, Just some random stuff. I may try to relate my exposure to it (or something), even that remotely relates to Hadoop What is Hadoop? Apache Hadoop is an open-source software framework for storage and large scale processing of data-sets on clusters of commodity hardware. &hellip; <\/p>\n<p><a class=\"more-link btn\" href=\"http:\/\/bangla.sitestree.com\/?p=20653\">Continue reading<\/a><\/p>\n","protected":false},"author":8,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[1917],"tags":[],"class_list":["post-20653","post","type-post","status-publish","format-standard","hentry","category-fromsitestree-com","item-wrap"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack-related-posts":[{"id":20739,"url":"http:\/\/bangla.sitestree.com\/?p=20739","url_meta":{"origin":20653,"position":0},"title":"Hadoop, NoSQL, PHP","author":"Author-Check- Article-or-Video","date":"February 27, 2021","format":false,"excerpt":"MongoDB (from \"humongous\") is an open-source document database, and the leading NoSQL database. Written in C++. http:\/\/www.mongodb.org\/ If you are in USA, Hadoop\/NoSQL can pay you a lot... http:\/\/www.indeed.com\/salary\/q-Hadoop-l-San-Jose,-CA.html PHP Programming in the area: http:\/\/developer.marklogic.com\/labs\/mlphp Apache Hadoop http:\/\/en.wikipedia.org\/wiki\/Apache_Hadoop Facebook and Hadoop http:\/\/gigaom.com\/2013\/11\/06\/facebook-open-sources-its-sql-on-hadoop-engine-and-the-web-rejoices\/ From: http:\/\/sitestree.com\/?p=519 Categories:Web Development, Root, By Sayed AhmedTags:\u2026","rel":"","context":"In &quot;FromSitesTree.com&quot;","block_context":{"text":"FromSitesTree.com","link":"http:\/\/bangla.sitestree.com\/?cat=1917"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":68329,"url":"http:\/\/bangla.sitestree.com\/?p=68329","url_meta":{"origin":20653,"position":1},"title":"Some Big Data Terms #Big Data","author":"Author-Check- Article-or-Video","date":"August 1, 2021","format":false,"excerpt":"MapReduce \" MapReduce is a programming model and an associated implementation for processing and generating big data sets with a parallel, distributed algorithm on a cluster.[1][2] A MapReduce program is composed of a Map() procedure (method) that performs filtering and sorting (such as sorting students by first name into queues,\u2026","rel":"","context":"In &quot;FromSitesTree.com&quot;","block_context":{"text":"FromSitesTree.com","link":"http:\/\/bangla.sitestree.com\/?cat=1917"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":22533,"url":"http:\/\/bangla.sitestree.com\/?p=22533","url_meta":{"origin":20653,"position":2},"title":"Big Data Solutions #Root #By Sayed Ahmed #Misc Hadoop Stuff #Data Warehouse Misc #Big Data","author":"Author-Check- Article-or-Video","date":"March 15, 2021","format":false,"excerpt":"Traditional Approach of Data Analysis: Data is stored in RDBMS in one or couple of computers. And software are written for analysis. However, if the data is too big this approach is not sufficient. Also, all data may not fit\/well-suited for RDBMS. \u00a0 Google's Approach: Google invented MAP Reduce to\u2026","rel":"","context":"In &quot;FromSitesTree.com&quot;","block_context":{"text":"FromSitesTree.com","link":"http:\/\/bangla.sitestree.com\/?cat=1917"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/xiaochongzhang.me\/blog\/wp-content\/uploads\/2013\/05\/MapReduce_Work_Structure.png?resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/xiaochongzhang.me\/blog\/wp-content\/uploads\/2013\/05\/MapReduce_Work_Structure.png?resize=350%2C200 1x, https:\/\/i0.wp.com\/xiaochongzhang.me\/blog\/wp-content\/uploads\/2013\/05\/MapReduce_Work_Structure.png?resize=525%2C300 1.5x"},"classes":[]},{"id":74235,"url":"http:\/\/bangla.sitestree.com\/?p=74235","url_meta":{"origin":20653,"position":3},"title":"Demo: Hadoop Job Execution. Big Data and Hadoop Concepts","author":"Sayed","date":"April 23, 2022","format":false,"excerpt":"https:\/\/youtu.be\/y3It9aoaoT8","rel":"","context":"In &quot;From Youtube Channel&quot;","block_context":{"text":"From Youtube Channel","link":"http:\/\/bangla.sitestree.com\/?cat=1952"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/img.youtube.com\/vi\/y3It9aoaoT8\/0.jpg?resize=350%2C200","width":350,"height":200},"classes":[]},{"id":74674,"url":"http:\/\/bangla.sitestree.com\/?p=74674","url_meta":{"origin":20653,"position":4},"title":"Execute Hadoop Scripts. Hadoop \u09b8\u09cd\u0995\u09cd\u09b0\u09bf\u09aa\u09cd\u099f \u099a\u09be\u09b2\u09be\u09a8.","author":"Sayed","date":"May 19, 2022","format":false,"excerpt":"https:\/\/youtu.be\/FR7Xrek01RE","rel":"","context":"In &quot;From Youtube Channel&quot;","block_context":{"text":"From Youtube Channel","link":"http:\/\/bangla.sitestree.com\/?cat=1952"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/img.youtube.com\/vi\/FR7Xrek01RE\/0.jpg?resize=350%2C200","width":350,"height":200},"classes":[]},{"id":74668,"url":"http:\/\/bangla.sitestree.com\/?p=74668","url_meta":{"origin":20653,"position":5},"title":"Execute a Hadoop Job","author":"Sayed","date":"May 18, 2022","format":false,"excerpt":"https:\/\/youtu.be\/lKfhByEhWxk","rel":"","context":"In &quot;From Youtube Channel&quot;","block_context":{"text":"From Youtube Channel","link":"http:\/\/bangla.sitestree.com\/?cat=1952"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/img.youtube.com\/vi\/lKfhByEhWxk\/0.jpg?resize=350%2C200","width":350,"height":200},"classes":[]}],"_links":{"self":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts\/20653","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=20653"}],"version-history":[{"count":0,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts\/20653\/revisions"}],"wp:attachment":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=20653"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=20653"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=20653"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}