Graph Mining: What is Graph Mining? Learn by finding answers to the following questions. Can you answer the following questions?

Graph Mining: What is Graph Mining? Learn by finding answers to the following questions. Can you answer the following questions?

What is Graph Mining?

What is a Graph anyway?

Is Graph mining just a kind of Machine Learning? i.e. is Machine Learning the only primary component of Graph Mining?

Does Graph Mining involve Only Statistics? Or Statistics can help?

Does Graph Mining involve Only Graph Theory? Or Graph Theory can help?

Is Graph Theory primarily relates to Computer Science or Math/Statistics or Machine Learning?

Does Graph Mining involve Only Machine Learning? Or Machine Learning is just one component?

To be great at Graph Mining – what are the areas that you need to master?

What are the applications of Graph Mining?

How the hackers can use/utilize Graph Mining?

What is the Origin of Graph Theory?

What is the first paper in Graph Theory? What does it try to solve?

What (where) are the applications of the Graph Algorithm named: Seven Bridges of König

Why is Graph and Graph Mining Important? Why do we care?

What are some application areas of Graph Mining?

How can you model Social Network to be a Graph?

Give an example of Graphs in Biology?

What is a Semantic Graph? Give examples.

Give an example of Graphs from Non-Graph Data.

Network vs. Graph?

Some Answers:
What is Graph Mining?
Ans: Discovering and Analyzing Graph Data

What are the applications of Graph Mining?
Ans: Fraud Detection, Community/Cluster detection, Recommending friends, Finding Influential Nodes [Virus Spread – not a good application]

What (where) are the applications of the Graph Algorithm named: Seven Bridges of König
Ans: Transportation, Biology, Chip Designing, Chemistry

What are some application areas of Graph Mining?
Ans: Social Networks, Semantic Web, World Wide Web, Drug Design, Computer Networks, Sensor Networks, Chemical Components

How can you model Social Network to be a Graph?
Ans: Nodes = Users, Edges = Friends/Followers

Give an example of Graphs in Biology?
Ans: Protein-Protein Interaction
Nodes: Proteins
Edges: Physical Interactions

What is a Semantic Graph? Give examples.
Ans: Wikipedia, Nodes = Concepts, Edges = Property/Type

What is a Semantic Graph? Give examples.
Ans: Concepts and relations

Give an example of Graphs from Non-Graph Data.
Ans: Network of Thrones (See resources), Student Enrollment

Network vs. Graph?
Ans: Network: Real Systems – Web, Social, Biology Terms: Network, Node, Link, Relationship

Graph: Mathematical representation Terms: Graph, Vertex/node, Edge
We can use them interchangeably

Resources:
Graph mining – lesson 1: Introduction to graphs and networks
http://www.nathalievialaneix.eu/teaching/m2se/M2SE-network_1.pdf

Graph theory
https://en.wikipedia.org/wiki/Graph_theory

Seven Bridges of König
https://en.wikipedia.org/wiki/Graph_theory#History
http://www.cs.kent.edu/~dragan/ST-Spring2016/The%20Seven%20Bridges%20of%20Konigsberg-Euler’s%20solution.pdf

Semantic Network
https://en.wikipedia.org/wiki/Semantic_network

Network of Thrones
https://www.macalester.edu/~abeverid/thrones.html

Graph from Non-Graph data
Manage Data in Excel With Databases, Tables, Records, and Fields
https://www.lifewire.com/manage-data-with-databases-tables-records-and-fields-in-excel-4178649

Graph Mining: Introduction
https://hpi.de/fileadmin/user_upload/fachgebiete/mueller/courses/graphmining/2017/01-Introduction.pdf

Software for graph visualization and mining: Gephi (https://gephi.org/), Tulip (https://tulip.labri.fr/TulipDrupal/) and Cytoscape (https://cytoscape.org/)

Packages dedicated to graphs:
for Python: igraph (https://igraph.org/), NetworkX (https://networkx.github.io/) and graph-tool (https://graph-tool.skewed.de/); Snappy (https://snap.stanford.edu/snappy/)

¤ for R: igraph (https://igraph.org/), statnet (http://statnet.org/), bipartite (https://cran.r-project.org/web/packages/bipartite/) and tnet (https://toreopsahl.com/tnet/);

¨Datasets
¤ Mark Neuman’s page: http://www-personal.umich.edu/~mejn/netdata
¤ Stanford Dataset: https://snap.stanford.edu/data/
¤ KONECT: http://konect.uni-koblenz.de/networks/
¤ ICON: https://icon.colorado.edu/#!/

¨Animation of Algorithms
¤ https://www.cs.usfca.edu/~galles/visualization/Algorithms.html

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz



Not that this site is doing any great work; However, if you want to contribute to the operation of this site (or other charitable/non-profit work in the education sector), you can financially contribute to: safoundation at salearningschool.com using Paypal (Credit Card Accepted).

Robots for Biopsy and Surgery. The intersection of Medical and Engineering.

Robots for Biopsy and Surgery. The intersection of Medical and Engineering.
From Google search. I cannot say if the information on the URLs are reliable.

FDA Clears Intuitive Surgical’s Lung Cancer Biopsy Robot
https://www.mpo-mag.com/contents/view_breaking-news/2019-02-20/fda-clears-intuitive-surgicals-lung-cancer-biopsy-robot/

World’s smallest and most accurate 3-D-printed biopsy robot
https://phys.org/news/2017-07-world-smallest-accurate-d-printed-biopsy.html

Intuitive Surgical Releases Ion Robotic Lung Biopsy System
https://www.medgadget.com/2019/02/intuitive-surgical-releases-ion-robotic-lung-biopsy-system.html

Robot-assisted stereotactic brain biopsy: systematic review and bibliometric analysis
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5996011/

A student-designed device may allow faster, more accurate lung biopsies.
https://www.technologyreview.com/s/404729/robotic-biopsy/

More precise.1 More flexible.1 For more answers.
https://www.intuitive.com/en-us/products-and-services/ion

Robots from Space Lead to One-stop Breast Cancer Diagnosis Treatment
https://www.nasa.gov/mission_pages/station/research/news/b4h-3rd/hh-space-robots-breast-cancer-diagnosis/

Evolution of Robot-assisted ultrasound-guided breast biopsy systems
https://www.sciencedirect.com/science/article/pii/S1687850717301632

Ion lung biopsy system from Intuitive Surgical wins FDA approval
https://www.therobotreport.com/ion-lung-biopsy-intuitive-surgical-fda/

Robotic Partial Nephrectomy: Robotic Surgery for Kidney Cancer and Benign Kidney Tumors
https://med.nyu.edu/robotic-surgery/physicians/procedures/z-procedures-guide/robotic-partial-nephrectomy

Robotic kidney transplantation allows safe access for transplant renal biopsy and percutaneous procedures.
https://www.ncbi.nlm.nih.gov/pubmed/31483897

Robotic assisted kidney transplantation
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4120216/

Robotic & Image Guided Surgery
https://my.clevelandclinic.org/departments/urology-kidney/depts/robotic-laparoscopic-surgery

Laparoscopic and Robotic Partial Nephrectomy
https://urology.ufl.edu/patient-care/robotic-laparoscopic-urologic-surgery/procedures/laparoscopic-and-robotic-partial-nephrectomy/

Should I have surgery to remove part or all of my kidney for my kidney tumor?
A decision aid to discuss treatment options with your urologist
https://decisionaid.ohri.ca/docs/das/Surgery_for_Kidney_Cancer.pdf

Robotic kidney transplantation allows safe access for transplant renal biopsy and percutaneous procedures
https://onlinelibrary.wiley.com/doi/abs/10.1111/tri.13517

Ultrasound Imaging for Robotic Surgery
https://www.bkmedical.com/applications/robotic-assisted-surgery-ultrasound/

An MRI-Compatible Robotic System for Breast Biopsy
https://www.youtube.com/watch?v=TlLN9drqidU

Robotic biopsy tool could revolutionise cancer screening
https://www.reuters.com/video/2018/07/23/robotic-biopsy-tool-could-revolutionise?videoId=448136910

3D Printed MRI Biopsy Robot
https://www.youtube.com/watch?v=dF66T63jvhQ

—-

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz



Not that this site is doing any great work; However, if you want to contribute to the operation of this site (or charitable/non-profit work in the education sector), you can financially contribute to: safoundation at salearningschool.com using Paypal.

Graph Mining: K-Spanning Trees: Learn by finding answers to the following questions. Can you answer the questions?

Graph Mining: K-Spanning Trees: Learn by finding answers to the following questions. Can you answer the questions?

What is a Spanning Tree?

What is a Minimum Spanning Tree?

For the undirected Graph below, what is a Spanning Tree? First, draw the Graph, then Try to find the answer, and then check the answer below.
Edges with weights: {1, 2, 7} {1, 3, 2} {1, 4, 4} {2, 3, 3} {2, 4, 5} {3, 4, 2} {3, 5, 6} {4, 5, 4}

What is a minimum spanning tree?

What is the minimum spanning tree for the graph as provided above?

What is Prim’s Algorithm for Spanning Trees?

Describe the steps as used in Prim’s Algorithm?

Give the Algorithm i.e. Prim’s Algorithm?

Write pseudo-code for Prim’s Algorithm?

Implement Prim’s Algorithm in Python, R, Matlab, or in any other language of your choice?

What is the initialization step in Prim’s Algorithm?

Can you choose any node to start with Prim’s Algorithm?

True or False, Prim’s algorithm does not include all nodes in the output?

How an edge/path is selected in Prim’s Algorithm?

True or False, to proceed and when to select an edge, the edge does not need to belong to the already formed Minimum Spanning Tree (MST)?

True or False, to proceed and to select a new edge, the edge will need to have the highest weight outgoing from the MST or from the vertex under consideration)?

True or False, to proceed and to select a new edge, the edge will need to have the minimum weight outgoing from the MST (or from the vertex under consideration)?

True or False? The new edge selection steps are as follows.
Step 1. For the new edge, exactly one of its vertices/edge-points will need to be in the MST already
Step 2. The new edge has the minimum weight that satisfies step 1.

How long will you continue to add new edges to the MST? i.e. terminating condition for your algorithm.

Apply Prim’s algorithm to find the MST in Graph given above/below: See the answer after you give it a try
Edges with weights: {1, 2, 7} {1, 3, 2} {1, 4, 4} {2, 3, 3} {2, 4, 5} {3, 4, 2} {3, 5, 6} {4, 5, 4}

What is K-Spanning tree?

What are the steps in K-Spanning Tree?

Give the Algorithm for K-Spanning?

Write pseudo-code for K-Spanning?

Implement the K-Spanning tree algorithm in Python, R, Matlab, or in any other language of your choice?

What is K in the K-Spanning Tree algorithms?

How many edges do you remove to create k Clusters?

What is the first step in the K-Spanning tree?

When you remove edges from an MST – which edges do you remove? Edges represent distance. The ones with the highest weights or the lowest weights?

Some Answers:

What is a Spanning Tree?
Ans: A connected subgraph with all vertices and no cycles

For the undirected Graph below, what is a Spanning Tree?
Edges with weights: {1, 2, 7} {1, 3, 2} {3, 4, 2} {3, 5, 6}
Weight: 17

What is a minimum spanning tree?
Ans: Spanning trees with the minimum sum of edge weights where weights indicate distances.

What is the minimum spanning tree for the graph as provided above?
Ans: the same spanning tree with weight sum = 17
i.e. {1, 2, 7} {1, 3, 2} {3, 4, 2} {3, 5, 6}

What is Prim’s Algorithm for Spanning Trees?
Ans: Finds the minimum spanning tree

Apply Prim’s algorithm to find the MST in Graph given above/below: See the answer after you give it a try
Edges with weights: {1, 2, 7} {1, 3, 2} {1, 4, 4} {2, 3, 3} {2, 4, 5} {3, 4, 2} {3, 5, 6} {4, 5, 4}
Draw the graph that will make things easier

Ans: Select node 5
Select (5, 4, 4) from {3, 5, 6} {4, 5, 4}
Add (5, 4) to the MST
select edge {3, 4, 2} from {3, 5, 6} {2, 4, 5} {3, 4, 2} {1, 4, 4}
MST: {5, 4, 4} {3, 4, 2}
Similarly: select (3, 2, 3) {1, 3, 2}
Final Output: {5, 4, 4} {3, 4, 2} (3, 2, 3) {1, 3, 2}

What is the K-Spanning tree?
Ans: Community/Clustering Algorithm

When you remove edges from an MST – which edges do you remove? Edges represent distance. The ones with the highest weights or the lowest weights?
Ans: highest

By

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz



Not that this site is doing any great work; However, if you want to contribute to the operation of this site (or charitable/non-profit work in the education sector), you can financially contribute to: safoundation at salearningschool.com using Paypal.

Graph Mining: Betweenness Based Clustering: Learn by finding answers to the following questions. Can you answer the following?

Graph Mining: Betweenness Based Clustering: Learn by finding answers to the following questions. Can you answer the following?

What are the types of Betweenness?

What is Vertex Betweenness? i.e. the concept

What is Edge Betweenness? i.e. the concept

Describe Vertex Betweenness? maybe just giving an example

What does Vertex Betweenness clustering achieve? i.e. the output?

Give an Algorithm for Vertex Betweenness Clustering?

What is mu in the Algorithm for Vertex Betweenness Clustering?

Which vertex do you select at each step? i.e. the one with the highest betweenness or the lowest betweenness?

When you select a vertex at each step? Then what do you do (i.e. to create clusters)

Does the selected vertex go to all the clusters created around it?

What is the other name for Edge Betweenness?

Describe Edge Betweenness? maybe just giving an example

What does Edge Betweenness clustering achieve? i.e. the output?

Give an Algorithm for Edge Betweenness Clustering?

What is mu in the Algorithm for Edge Betweenness Clustering?

Which Edge do you select at each step? i.e. the one with the highest betweenness or the lowest betweenness?

When you select an Edge at each step? Then what do you do (i.e. to create clusters)

Does the selected Edge go to all the clusters created around it? or you just cut the graph at that edge and create clusters.

Answers:

Describe Vertex Betweenness? maybe just giving an example
Ans: The total number of shortest paths that pass through the vertex = the Vertex Betweenness for that Vertex. i.e. Among All vertices to all vertices shortest paths.

What is the other name for Edge Betweenness Clustering?
Ans: Girvan & Newman Clustering

What is Edge Betweenness? i.e. the concept
Ans: For a given edge, the number of shortest paths that pass through the edge. (All pairs/nodes to all pairs/nodes shortest paths)

By

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz





Not that this site is doing any great work; However, if you want to contribute to the operation of this site (or charitable/non-profit work in the education sector), you can financially contribute to: safoundation at salearningschool.com using Paypal.




Graph Mining: Shared Nearest Neighbors : Clustering : Community Detection

Graph Mining: Shared Nearest Neighbors : Clustering : Community Detection

Graph Mining: Shared Nearest Neighbors (SNN): Clustering : Community Detection: Learn by Finding Answers to the Following Questions.

Will use SNN sometimes.

What is one another name of the algorithm: Shared Nearest Neighbors?

What is the purpose of the Algorithm: Shared Nearest Neighbors?

Can you name other Algorithms that serve the same or similar purpose?

What is the Criteria that SNN uses?

Is SNN Hierarchical? What does Hierarchical mean in the context of Clustering/Community detection?

How does SNN work? i.e. what is the algorithm? i.e. how does SNN create clusters/communities?

Give and explain the steps in SNN with a small example.

What is threshold τ in the SNN algorithm? Is it numeric? What can be the maximum and the minimum values?

For the following Undirected and unweighted Graphs, show the steps and the final clusters.
Edges: {0, 1} {0, 2} {0, 3} {1, 2}, {1, 3} {2, 3} {2, 4} {3, 4}
You can draw the graph first. You can use τ = 2.

What is Node Similarity or Node Proximity between two nodes in SNN?

What is the output of SNN? i.e. on an undirected and unweighted Graph?

Will the output will have weights on the edges?

Give the pseudocode for SNN algorithm

Implement the SNN algorithm in Python or R or Matlab – whichever you prefer.

Can you apply SNN on weighted Graphs?

If you can apply SNN on weighted graphs, then how will you apply the Threshold, say theta?

What is the first step in applying theta – if that is doable?

What is the difference between tau (τ) and theta.

What is k in SNN algorithms (if Weighted)?

How do we know which tau(τ) or ”k” to choose?

What are some evaluation metrics for tau or k?

what is Conductance?

Conductance whose property is this? The input graph, the output graph or the possible communities (from where you select the final communities)

Is low value or high value of Conductance – that is used for the communities?

When comparing two k or tau values – which k or tau that you accept? Think in terms of Conductance.

So, what was the purpose behind choosing these K or Tau values? What do we want to achieve ultimately? Is it to achieve Stronger communities by checking all possible communities and then measuring the community strength using Conductance? If it is – do all the questions on Weighted Graphs make sense? Why, Why not? How?

Write a Pseudo-code for SNN on Undirected Weighted Graphs. Find the optimal value for K and tau.

Implement SNN on Undirected Weighted Graphs. Print the optimal value for K and tau.

Why are we detecting communities and clusters anyway? What are the practical applications of such algorithms?
Hint: Check notes on Introduction to Clustering/Community detection

Some Answers:
What is one another name of the algorithm: Shared Nearest Neighbors?
Ans: Jarvis-Patrick algorithm

Can you name other Algorithms that serve the same or similar purpose?
Ans: Maximal-Clique Enumeration, K- Spanning Tree, Shared Nearest Neighbor, Highly Connected Components, MinCut, Betweenness Based Algorithms, Louvain Modularity, CNM Algorithm

What is the Criteria that SNN uses?
Ans: Two nodes are similar if they share a lot of neighbors

Is SNN Hierarchical?
Ans: No

What is Node Similarity or Node Proximity between two nodes in SNN?
Ans: Count of the shared number of nodes/neighbors

What is k in SNN algorithms (if Weighted)?
retain its k neighbors

Resources:

Jarvis-Patrick Clustering
https://btluke.com/jpclust.html

Empirical Comparison of Algorithms for Network Community Detection
https://cs.stanford.edu/~jure/pubs/communities-www10.pdf

By

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Not that this site is doing anything great work; However, if you want to contribute to the operation of this site (or charitable/non-profit work in education sector), you can donate to: safoundation using Paypal.

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz

Has your DNS got hijacked?

Tool: https://www.f-secure.com/en_GB/web/home_gb/router-checkerKnowledgebase: https://securitytrails.com/blog/dns-hijacking

By

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz

Graph Mining: Community Detection: Learn by finding answers to the following questions. Can you answer the following questions on Community Detection?

Graph Mining: Community Detection: Learn by finding answers to the following questions. Can you answer the following questions on Community Detection?

What is a community anyway? Describe from your real-world/social understanding of it?

Can you relate it to the Graphs concept in Computer Science?

In your daily life do you interact with a community in a real world? or in your communication with others using technologies?

What is Graph as defined in Computer Science Books/Practice? Can you formally represent a Graph? What are the components of a graph?

Is a whole Graph a Community? or Graphs can be subsets (Subgraph) as well?

What is a Sub-graph anyway?

Can you identify communities in real-world networks? i.e. real world networks that can be represented as graphs?

Give examples of real-world Graphs as well.

Can you detect a community in the tissues or the organs in the human body? Can you represent these as Graphs?

Can you detect a community in Friends on social media with similar interests? i.e. is it a community? Can you represent these/this as Graphs?

Can you detect/define a community for the Neighborhoods where a hurricane will likely pass? i.e. is it a community?

Is it a community: People that are likely to be affected by a contagious disease?

Will the properties of a community (subgraph) be the same like the whole graph (i.e. average properties of the whole graph)? or will it differ? When? How? Why? Examples?

How does the formation of communities affect? i.e. How does the existence or not of communities affect? You can provide generic answers or just explain with examples as well.

Do you think that you are a part/node of communities in online social networks such as Facebook, twitter? If so, how and when you are affected? If so, how and when you can affect others?

Can communities affect rumor spreading or disease spreading or epidemic spreading? Can you also identify how a rumor/disease will spread i.e. from the source i.e. which path will it take, how will it flow, who will get affected?

Is it that the spreading will only affect the nodes/parts of the communities? Do you see any limitations of this thought?

Who else can get affected? Can you predict that?

What is Link prediction? What are Link Prediction Algorithms?

Is it a possibility that current communities can be fine tuned? i.e. remove links/nodes and add new links/nodes? Why, why not? How? Is it the right thing to do? why, why not?

What is the difference between clustering and community detection?

What is thought to be of more real world entitities/geared? Community Detection or Clustering?

What is thought to be more geared towards structural Properties such as Min-Cut?

What is Min-Cut anyway?

For modeling, understanding and analysis purpose – can we consider them the same/similar? Community Detection or Clustering?
Ans: Yes

What is Cluster Analysis i.e. Community Detection process? i.e. what will be your process to identify clusters/communities?

Provide the names of some Community Detection Algorithms. Can you explain them as well? Can you formally define/represent and/or show pictorially what they do and how they work (steps as well)? Check the answer section, after you give it a try. You must have studied them in Computer Networks or related courses/concepts (Data Communication, CCNA, CCNP, Vehicle Routing)

How can you evaluate that the detected communities/clusters are good/great communities/clusters?

Answers:
What is a community?
a subset of nodes within the graph such that connections between the nodes (in the subset) are denser than connections with the rest of the network (i.e. from the subset to the outer nodes)

What is Cluster Analysis i.e. Community Detection process? i.e. what will be your process to identify clusters/communities?
"The process of dividing nodes of a graph into possibly overlapping, subsets, where nodes in each subset are considered related by some similarity measure"

Provide the names of some Community Detection Algorithms
Maximal-Clique Enumeration, K- Spanning Tree, Shared Nearest Neighbor, Highly Connected Components, MinCut, Betweenness Based Algorithms, Louvain Modularity, CNM Algorithm

Resources:
Defining and identifying communities in networks
https://www.pnas.org/content/101/9/2658

Community Structure:
https://en.wikipedia.org/wiki/Community_structure

Graph Clustering:
https://www.csc2.ncsu.edu/faculty/nfsamato/practical-graph-mining-with-R/slides/pdf/Graph_Cluster_Analysis.pdf

By

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz

Graph Mining: SMALL WORLD GRAPHS & RANDOM GRAPH GENERATORS: Learn by finding answers to the following questions.

Graph Mining – 002: SMALL WORLD GRAPHS & RANDOM GRAPH GENERATORS

Graph Mining: SMALL WORLD GRAPHS & RANDOM GRAPH GENERATORS: Learn by finding answers to the following questions.

Can you answer the following?

What is a Graph?

Can you define graph formally?

What are the components in a Graph?

What are edges and nodes or vertices in a Graph?

Can you provide examples of Graphs? in Areas such as Science, Engineering, Medical, Biology, Social Sciences, Internet Applications, Social Media?

Can you draw a simple graph?

What is a Random Graph?

What do you use to answer questions about the properties of typical graphs?

What is the purpose of a Random Graph?

What types of graphs you use to answer questions about the properties of typical graphs?

How can you generate a Random Graph?

What is known as Small World Properties for Graphs?

Give some examples of Real Graphs

What is the connection between Real Graphs and Small World Properties?

What are some properties of Small World Networks?

Can you give examples of Non Small World Networks?

For small World Graphs – what will be the average path lengths (count of vertices) – a large number of a small number?

Are small world graphs sparse or dense?

What is power law distribution? How does that related to Small World Graphs?

How can you relate Power Law Distribution to Github network?

What is power law in statistics?

Can you name some Random Graph Generators? What is the purpose? What will they do actually in general? Why will you use these generators.

When edges are created between vertices with probability p : what is this mode of generation called? Is it a Random Graph? Why?

What happens when p = probability is closer to 0? what when p is closer to 1?

What is weight in a random graph Generator for example in the above case?

When a Graph is generated using the probability approach – what is the order of the maximum size graph?

Can you represent the graph generated above as G(n, p)? n = number of vertices, p = probability

For the above case, what happens when np < 1 ?

For the above case, what happens when np = 1 ? Can np be > 1?

When a Graph G(n, p) is generated using the probability approach as mentioned above – and when np < 1, what is the order of the size of a connected component? is it O(n), or o(n), O(log(n)), O(Olog(n)). Check answer section

What is Erdos Renyi? Is it the name of a Random Graph Generator or the name of a small world property or a statistical law?

For Erdos Renyi, is the average path length short?

For Erdos Renyi, do they exhibit local clustering?

Is local clustering a property of small world graphs?

For Erdos Renyi, does it show Skewed degree distribution i.e. skewed power law distribution?

For Erdos Renyi, does it show poission distribution?

What is poission distribution? anyway?

Can you relate poission distribution to Banks or a Diagnostic Center?

What is Watts–Strogatz model for Random Graph Generation? How does it work?

Does Watts–Strogatz model create a directed graph or an undirected graph?

For Watts–Strogatz model, if the desired Node count is N, then what will be the output edge count? You can introduce and define a new parameter if required.

For Watts–Strogatz, is the average path length short?

For Watts–Strogatz, do they exhibit local clustering?

Is local clustering a propery of small world graphs?

For Watts–Strogatz, does it show Skewed degree distribution i.e. skeweed power law distribution?

For Watts–Strogatz, does it show poission distribution?

how does Watts–Strogatz relate to Dirac Delta Function?

what is Barabási–Albert (BA) model? Is it the name of a Random Graph Generator or the name of a small world property or a statistical law?

How does Barabási–Albert (BA) model work? i.e. the steps i.e. the methods

For Barabási–Albert (BA) model, how is a new node added?

For Barabási–Albert (BA) model, does it start with an empty graph or with an already connected graph?

For Barabási–Albert (BA) model, how a new node is connected to an existing node? What does it depend upon? How does a probability play a role?

For Barabási–Albert (BA) model, which nodes will accumulate more links quickly?

For Barabási–Albert (BA) model, which nodes will starve for new links?

For Barabási–Albert (BA) model, is the average path length short?

For Barabási–Albert (BA) model, do they exhibit local clustering?

For Barabási–Albert (BA) model, does it show Skewed degree distribution i.e. skeweed power law distribution?

Answers:

How can you generate a Random Graph?
Start with n isolated vertices, Add successive edges between them at random

Give some examples of Real Graphs
Food webs, Electric power grids, Metabolite processing networks, Networks of brain neurons, Voter networks, Social influence networks, Co-occurrence networks, Networks of connected proteins

What are some properties of Small World Networks?
"Properties of small-world networks
Small-world networks tend to contain cliques, and near-cliques, meaning sub-networks which have connections between almost any two nodes within them. This follows from the defining property of a high clustering coefficient. Secondly, most pairs of nodes will be connected by at least one short path. This follows from the defining property that the mean-shortest path length be small. Several other properties are often associated with small-world networks. Typically there is an over-abundance of hubs – nodes in the network with a high number of connections (known as high degree nodes). These hubs serve as the common connections mediating the short path lengths between other edges. By analogy, the small-world network of airline flights has a small mean-path length"
Reference: https://en.wikipedia.org/wiki/Small-world_network#Examples_of_small-world_networks

Can you give examples of Non Small World Networks?
Ans: Check: Reference: https://en.wikipedia.org/wiki/Small-world_network#Examples_of_small-world_networks

What is power law in statistics?
"In statistics, a power law is a functional relationship between two quantities, where a relative change in one quantity results in a proportional relative change in the other quantity, independent of the initial size of those quantities: "
https://en.wikipedia.org/wiki/Power_law#Power-law_probability_distributions

Can you name some Random Graph Generators?
Erdős–Rényi model, Watts–Strogatz model, Barabási–Albert (BA) model

When a Graph is generated using the probability approach – what is the order of the maximum size graph? Check answer
n to the power 2/3

For Watts–Strogatz model, if the desired Node count is N, then what will be the output edge count? Ans: NK/2

Watts–Strogatz model
K = mean degree = an even number
B = a special parameter
N >> K >> ln(K) >> 1

Check degree distribution for Watts–Strogatz model at: https://en.wikipedia.org/wiki/Watts%E2%80%93Strogatz_model
"The degree distribution in the case of the ring lattice is just a Dirac delta function centered at {\displaystyle K}K. The degree distribution for {\displaystyle 0<\beta <1}0<\beta <1 can be written as:

Resources:
Graph mining – lesson 1: Introduction to graphs and networks
http://www.nathalievialaneix.eu/teaching/m2se/M2SE-network_1.pdf

Erdős–Rényi model
https://en.wikipedia.org/wiki/Erd%C5%91s%E2%80%93R%C3%A9nyi_model

Random Graphs
https://en.wikipedia.org/wiki/Random_graph

Examples of small-world networks
https://en.wikipedia.org/wiki/Small-world_network#Examples_of_small-world_networks

Case Study: Small World Phenomenon
https://introcs.cs.princeton.edu/java/45graph/

Random Graphs: Model of Social Networks
http://www.pnas.org/content/99/suppl_1/2566

Power Law Distribution
https://en.wikipedia.org/wiki/Power_law#Power-law_probability_distributions

Poisson distribution
https://en.wikipedia.org/wiki/Poisson_distribution

Watts–Strogatz model
https://en.wikipedia.org/wiki/Watts%E2%80%93Strogatz_model

Dirac delta function
Everywhere it is zero except at zero
https://en.wikipedia.org/wiki/Dirac_delta_function

Barabási–Albert (BA):
https://en.wikipedia.org/wiki/Barab%C3%A1si%E2%80%93Albert_model#Degree_distribution

Random Graph Generators in Python Libraries
https://networkx.github.io/documentation/networkx-1.10/reference/generators.html

Random Graph Generators in R Libraries
https://rpubs.com/lgadar/generate-graphs

By
Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz

Graph Mining: Link Prediction. Learn by finding answers to the following questions.

Graph Mining: Link Prediction. Learn by finding answers to the following questions.

What is Link Prediction in Graph Mining?

How can Link Prediction help? and where? can it help in social network, research co-authorship, or spread of disease

Give uses of Link Prediction in epidemiology or any medical/biology application?

Give uses of Link Prediction in Social Network?

Give uses of Link Prediction in co-authorship?

What are the steps in Link Prediction? i.e. define link prediction problem.

Define Link Prediction Problem formally i.e. with Notations i.e. with Graph notations i.e. Graph Representations.

How is node similarity related in Link Prediction?

What are possible Node Similarity algorithms that can be used for the purpose?

Which nodes are the most possible new links? think in terms of Node Similarity Index (Highest, Lowest)

What are the types of Node Similarity? Node proximity?

What is Node Similarity based upon? i.e. Network Topology

What is Local Structure for Node Similarity?

What is Global Structure for Node Similarity?

What are examples of Local Structure for Node Similarity?

What are examples of Global Structure for Node Similarity?

Define Node Neighborhoods. Is it local or global structure?

What is Preferential Attachment Index? Is it LocaL or global structure

What are the types of Node Neighborhoods?

If two nodes do not have any common node — what is the probable similarity (Node Neigh. aspect)?

What is Common Neighbors ? Is it local or global structure? is it Node Neighbor or Preferential Attachment? and why?

What is Jaccard Coefficient ? Is it local or global structure? is it Node Neighbor or Preferential Attachment? and why?

What is Adamic-Adar? Is it local or global structure? is it Node Neighbor or Preferential Attachment? and why?

What is the name of the Node Similarity approach where Sum of the inverse logarithmic degree centrality of the neighbors shared by the two nodes are counted?

What is the name of the Node Similarity approach where the ratio of seize of set intersection to the set union is counted?

What is the name of the Node Similarity approach where the count of common nodes are used?

If node similarity score is counted as multiplication of the number/count of outgoing edges for a node pair. And then the higher results are assumed to create new links. What is this approach called?

When similarity scores are calculated based on global link structure of graph – what is this called local structure or global structure?

What is an example of Global Structure?

What are the examples of Global Structure?

What is Kartz Index for Global Structure?

What is Simrank for Global Structure?

When Node Similarity is calculated as: Sum of count of all paths between node pairs – what is this approach called? Then how is the link prediction made?

When Node Similarity is calculated as: two nodes are similar if they are referred by similar nodes. What is the name?

How can you measure if your implemented link prediction algorithm is great or not?

Can you use train and test concept for the measurement? Can you define the steps/problems formally? with Graph Notations?

What are some measures to calculate in the train/test approach?

Answers:
What is the name of the Node Similarity approach where Sum of the inverse logarithmic degree centrality of the neighbors shared by the two nodes are counted?
Ans: Adamic-Adar

What is the name of the Node Similarity approach where the ratio of seize of set intersection to the set union is counted?
Ans: Jaccard Coefficient

What is the name of the Node Similarity approach where the count of common nodes are used?
Ans: Common Neighbors

If node similarity score is counted as multiplication of the number/count of outgoing edges for a node pair. And then the higher results are assumed to create new links. What is this approach called?
Ans: Preferential Attachment Index

What is an example of Global Structure?
Ans: Path Length > 2

What are the examples of Global Structure?
Shortest Paths – use inverse of distance as similarity, Kartz Index, SimRank

What are some measures to calculate in the train/test approach?
Accuracy, F1-score, Sensitivity, Most metrics that would work for classification

Resources
Link Prediction
https://paperswithcode.com/task/link-prediction

Similarity Index based Link. Prediction Algorithms in Social Networks: A Survey
https://pdfs.semanticscholar.org/8e72/fa77f3d788f3c67da1e1c6347c3aaf280723.pdf

Proximity-based Methods for Link Prediction
https://cran.r-project.org/web/packages/linkprediction/vignettes/proxfun.html

Evaluating Link Prediction Methods
https://arxiv.org/pdf/1505.04094.pdf

Link Prediction Algorithm
http://be.amazd.com/link-prediction/

Evaluating link prediction methods
https://www3.nd.edu/~dial/publications/yang2015evaluating.pdf

By

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz

Deep Learning (DL): can you answer these introductory questions on DL? Target: Starters in DL

Deep Learning – 001: Introduction to Deep Learning.

Deep Learning (DL): can you answer these introductory questions on DL? Target: Starters in DL

Can you define AI, ML, DL?
Can you draw a diagram to show the relations of AI, ML, DL?

What is Symbolic AI?

Is Symbolic AI good for Image Classification, Speech recognition, and Language Translation? and Why?

How are AI, ML, DL for Image Classification, Speech recognition, and Language Translation? and Why? Can you compare these among them?

What is the motivation behind inventing Machine Learning?

How does programming differ for ML than classical programming?

Which one uses data intensively? Classical Programming or ML?

Which one uses Rules intensively? Classical Programming or ML? and why?

When was ML invented i.e. the idea? When it flourished? and why?

Why did not ML flourish earlier?

How recent is Deep Learning (DL)? When did it start to flourish?

How prominent is DL in Kaggle contests?

What is more extensively use in DL? Math or Engineering? and why? Is it for good or for bad? Can you think of any limitations?

Among ML, and DL where math is more extensively used? and why? where in AI, ML< DL such Math focused study might not be the greatest option? and why?

How ML differ from Math and Statistics esp. Statistics and why?

Can you name some Prediction approaches in ML?

How are data represented in ML and DL?

What is important in ML? Deep understanding or Representation of the layers of the process and data?

What is important in Dl? Deep understanding or Representation of the layers of the process and data?

What is the link among Brain, DL, and NN (Neural Network?)

What are other possible names of Deep Learning?

Does deep learning identify digits (picture recognition, image classification) in one step or multiple steps?

What is Deep NN?

What are some parameters for NN and DL? or just name what you use when writing (representing) a DL solution.

What are Weights, Input, Layers, Predictions, loss function, objective function in DL?

Can you define/explain (not memorized) and give examples of Weights, Input, Layers, Predictions, loss function, objective function in DL?

What is the optimizer in DL?

What is a Back-propagation Algorithm in DL? Explain, Example, Draw, write code and show

What applications have used DL successfully?

Will you apply DL for all Learning, classification, prediction problems? Why or why not? When DL can do the best?

What are other alternate approaches than DL?

Why did not DL florished/used much in the past?

Is handwritten digit classification be an application where DL can be used? What companies might get benefit with such applications?


Misc.

What are Kernel methods? Give examples. How are they related to DL?

Is SVM a kernel method? What does SVM stand for?

How does SVM work?

Is SVM great for image classification?

What is Gini Index? What does it mean?

What is Information gain in decision trees? what is the name?

What is random forest?

What is Gradient Boosting?

What DL approach worked best for image classification?

What is CNN? What is RNN? What is RNN-LSTM? What is LSTM? What is gated RNN?

How does CNN work? Can CNN be used for Image Classification? What level of accuracy you can achieve with CNN? If used, how will CNN work to classify images?

Do you know what Gradient Boosting is?

What are XGBoost Library?

What is Keras library?

What are the libraries available for DL implementation? What are the languages that are best for DL? and why?

CNN got invented on 1989; LSTM on 1997. Why they got popularity or in use today? Any similarities why DL flourished now?

what kind of CPU is geared towards DL?

What is CUDA?

What are the contributions of NVIDIA for DL?

What is common in NN, and DL? Addition, multiplication, matrix, matrix-multiplication?

What are the potential future applications of DL?

What are the potential improvements potential or important for DL?

Can you do DL in Python Library, Theano, TensorFlow, Keras?

What is the most used for DL? Python Library, Theano, TensorFlow, Keras

What is Deep Mind? What does the project do?

What is RMSProp?

What is Adam?

What are some optimization approaches in DL?

What is the role of Weight in DL? and in DL code?

What is TPU? Who invented it?

Give examples of some Loss functions used in DL

Some Answers

What is Symbolic AI?

"Symbolic artificial intelligence is the term for the collection of all methods in artificial intelligence research that are based on high-level "symbolic" (human-readable) representations of problems, logic and search. Symbolic AI was the dominant paradigm of AI research from the mid-1950s until the late 1980s."
From: https://en.wikipedia.org/wiki/Symbolic_artificial_intelligence

Example: Prolog
I did teach a very introductory practical course on Prolog on around 2002 – 2003. For sure, I do not remember the content that I taught.

"Symbolic Reasoning. A reasoning is an operation of cognition that allows – following implicit links (rules, definitions, axioms, etc.) – to produce new knowledge from already existing knowledge. The reasoning is said to be automated when done by an algorithm.
Symbolic Reasoning – Sem Spirit"

From: www.semspirit.com › artificial-intelligence › symbolic-reasoning

What are other possible names of Deep Learning?
Ans: Layered representations learning?
Hierarchical representations learning?

What are some parameters for NN and DL?
Ans: Weights, Input, Layer, Predictions, loss function, objective function

What applications have used DL successfully?
Ans: Human level image classification, Speech recognition, Hand-writing, machine translation, text to speech, Digital Assistant Google Now, Amazon Alexa, Autonomous Cars, Ads Targeting, Natural Language Questions, superhuman go Playing

What are other alternate approaches than DL?
Ans: Statistics for data analysis, Naive Bayes Algorithm, logistic Regression, hello world algorithm.

Why did not DL flourished/used much in the past?
Ans: Missing efficient way of training large neural networks

What is XGBoost?
"XGBoost is an implementation of gradient boosted decision trees designed for speed and performance."
https://machinelearningmastery.com/gentle-introduction-xgboost-applied-machine-learning/

Give examples of some Loss functions used in DL
"This tutorial is divided into three parts; they are:
Regression Loss Functions. Mean Squared Error Loss. Mean Squared Logarithmic Error Loss. …
Binary Classification Loss Functions. Binary Cross-Entropy. Hinge Loss. …
Multi-Class Classification Loss Functions. Multi-Class Cross-Entropy Loss. Sparse Multiclass Cross-Entropy Loss."
https://machinelearningmastery.com/how-to-choose-loss-functions-when-training-deep-learning-neural-networks/

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz