Category Archives: Graph Mining

Graph Mining

Graph Mining: Misc. Topics to Learn: Misc. Resources to Learn From

Shop Online: https://www.ShopForSoul.com/

8112223 Canada Inc./JustEtc: http://JustEtc.net

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Courses: http://Training.SitesTree.com (Big Data, Cloud, Security, Machine Learning)

Bloghttp://Bangla.SaLearningSchool.comhttp://SitesTree.com

Mediumhttps://medium.com/@SayedAhmedCanada

Graph Mining: Graph Mining Use Cases

Graphs are everywhere. https://pdfs.semanticscholar.org/144b/130323cb94d618c2c5e66982a56f31d36396.pdf
Click https://pdfs.semanticscholar.org/144b/130323cb94d618c2c5e66982a56f31d36396.pdf link to open resource.

Learn From: Anomaly Detection and Graph Mining

https://hpi.de/fileadmin/user_upload/fachgebiete/mueller/courses/graphmining/GraphMining-13a-AnomalyDetection.pdf
Click https://hpi.de/fileadmin/user_upload/fachgebiete/mueller/courses/graphmining/GraphMining-13a-AnomalyDetection.pdf link to open resource.

Graph based Anomaly Detection and Description: A Survey

Graph based Anomaly Detection and Description: A Survey
Click https://www.andrew.cmu.edu/user/lakoglu/pubs/14-dami-graphanomalysurvey.pdf link to open resource.

Shop Online: https://www.ShopForSoul.com/

8112223 Canada Inc./JustEtc: http://JustEtc.net

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Courses: http://Training.SitesTree.com (Big Data, Cloud, Security, Machine Learning)

Bloghttp://Bangla.SaLearningSchool.comhttp://SitesTree.com

Mediumhttps://medium.com/@SayedAhmedCanada

Graph Mining: Time Evolving Graphs

To Learn From: Time Evolving GraphsURL

Not completed: To Learn From: Time Evolving Graphs. Select to mark as complete.

https://link.springer.com/article/10.1007/s41019-019-00105-0

Learn by finding the answers to the following questions from the given/external resources?

Time Dependent Graphs

What is a time dependent graph?

Can you name a time dependent graph that you interact with everyday? Or that a good number of people interact everday/very often?

True or False? Time Dependent Graphs change over time?

True or False? Edge weights of Time Dependent Graphs change over time?

Give examples of real-life scenarios that can be modeled very effectively using Time Dependent Graphs?

What is time-dependent relationship?

Give some examples of Allen's time-dependent relationships in Graphs.

Visualize Allen's time-dependent relationships in Graphs. i.e. Show in Graphs.

What are the approaches to model the data provided through a time dependent graphs?

Can you give examples on how you might be able to utilize time dependent graphs in your domain?

How do you represnt time dependent graphs in Neo4j? Show/visualize an example?

More questions will be added later....

Some Answers:
Give examples of real-life scenarios that can be modeled very effectively using Time Dependent Graphs?

Ans: bioinformatics networks, transportation networks, and social networks

Shop Online: https://www.ShopForSoul.com/

8112223 Canada Inc./JustEtc: http://JustEtc.net

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Courses: http://Training.SitesTree.com (Big Data, Cloud, Security, Machine Learning)

Bloghttp://Bangla.SaLearningSchool.comhttp://SitesTree.com

Mediumhttps://medium.com/@SayedAhmedCanada

Graph Mining: Shared Nearest Neighbors – Community Detection

Resources to Learn From

Resources:

Jarvis-Patrick Clustering
https://btluke.com/jpclust.html

Empirical Comparison of Algorithms for Network Community Detection
https://cs.stanford.edu/~jure/pubs/communities-www10.pdf

Read the resources above to find answers. Shared Nearest Neighbors : Clustering : Community Detection

Graph Mining: Shared Nearest Neighbors : Clustering : Community Detection

Graph Mining: Shared Nearest Neighbors (SNN): Clustering : Community Detection: Learn by Finding Answers to the Following Questions.

Will use SNN sometimes.

What is one another name of the algorithm: Shared Nearest Neighbors?

What is the purpose of the Algorithm: Shared Nearest Neighbors?

Can you name other Algorithms that serve the same or similar purpose?

What is the Criteria that SNN uses?

Is SNN Hierarchical? What does Hierarchical mean in the context of Clustering/Community detection?

How does SNN work? i.e. what is the algorithm? i.e. how does SNN create clusters/communities?

Give and explain the steps in SNN with a small example.

What is threshold τ in the SNN algorithm? Is it numeric? What can be the maximum and the minimum values?

For the following Undirected and unweighted Graphs, show the steps and the final clusters.
Edges: {0, 1} {0, 2} {0, 3} {1, 2}, {1, 3} {2, 3} {2, 4} {3, 4}
You can draw the graph first. You can use τ = 2.

What is Node Similarity or Node Proximity between two nodes in SNN?

What is the output of SNN? i.e. on an undirected and unweighted Graph?

Will the output will have weights on the edges?

Give the pseudocode for SNN algorithm

Implement the SNN algorithm in Python or R or Matlab - whichever you prefer.

Can you apply SNN on weighted Graphs?

If you can apply SNN on weighted graphs, then how will you apply the Threshold, say theta?

What is the first step in applying theta - if that is doable?

What is the difference between tau (τ) and theta.

What is k in SNN algorithms (if Weighted)?

How do we know which tau(τ) or ”k” to choose?

What are some evaluation metrics for tau or k?

what is Conductance?

Conductance whose property is this? The input graph, the output graph or the possible communities (from where you select the final communities)

Is low value or high value of Conductance - that is used for the communities?

When comparing two k or tau values - which k or tau that you accept? Think in terms of Conductance.

So, what was the purpose behind choosing these K or Tau values? What do we want to achieve ultimately? Is it to achieve Stronger communities by checking all possible communities and then measuring the community strength using Conductance? If it is - do all the questions on Weighted Graphs make sense? Why, Why not? How?

Write a Pseudo-code for SNN on Undirected Weighted Graphs. Find the optimal value for K and tau.

Implement SNN on Undirected Weighted Graphs. Print the optimal value for K and tau.

Why are we detecting communities and clusters anyway? What are the practical applications of such algorithms?
Hint: Check notes on Introduction to Clustering/Community detection

Some Answers:
What is one another name of the algorithm: Shared Nearest Neighbors?
Ans: Jarvis-Patrick algorithm

Can you name other Algorithms that serve the same or similar purpose?
Ans: Maximal-Clique Enumeration, K- Spanning Tree, Shared Nearest Neighbor, Highly Connected Components, MinCut, Betweenness Based Algorithms, Louvain Modularity, CNM Algorithm

What is the Criteria that SNN uses?
Ans: Two nodes are similar if they share a lot of neighbors

Is SNN Hierarchical?
Ans: No

What is Node Similarity or Node Proximity between two nodes in SNN?
Ans: Count of the shared number of nodes/neighbors

What is k in SNN algorithms (if Weighted)?
retain its k neighbors

Resources:

Jarvis-Patrick Clustering
https://btluke.com/jpclust.html

Empirical Comparison of Algorithms for Network Community Detection
https://cs.stanford.edu/~jure/pubs/communities-www10.pdf

By

Sayed Ahmed

Linkedinhttps://ca.linkedin.com/in/sayedjustetc

Bloghttp://Bangla.SaLearningSchool.comhttp://SitesTree.com
Online and Offline Traininghttp://Training.SitesTree.com

Shop Online: https://www.ShopForSoul.com/

8112223 Canada Inc./JustEtc: http://JustEtc.net

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Courses: http://Training.SitesTree.com (Big Data, Cloud, Security, Machine Learning)

Bloghttp://Bangla.SaLearningSchool.comhttp://SitesTree.com

Mediumhttps://medium.com/@SayedAhmedCanada

Graph Mining: Community and Cluster Detection

Resources to Learn From

Resources:
Defining and identifying communities in networks
https://www.pnas.org/content/101/9/2658


Community Structure:
https://en.wikipedia.org/wiki/Community_structure

Graph Clustering:
https://www.csc2.ncsu.edu/faculty/nfsamato/practical-graph-mining-with-R/slides/pdf/Graph_Cluster_Analysis.pdf

Read the resources above to find answers. Community Detection: Learn by finding answers to the following questions. Can you answer the following questions on Community Detection?

Graph Mining: Community Detection: Learn by finding answers to the following questions. Can you answer the following questions on Community Detection?

What is a community anyway? Describe from your real-world/social understanding of it?

Can you relate it to the Graphs concept in Computer Science?

In your daily life do you interact with a community in a real world? or in your communication with others using technologies?

What is Graph as defined in Computer Science Books/Practice? Can you formally represent a Graph? What are the components of a graph?

Is a whole Graph a Community? or Graphs can be subsets (Subgraph) as well?

What is a Sub-graph anyway?

Can you identify communities in real-world networks? i.e. real world networks that can be represented as graphs?

Give examples of real-world Graphs as well.

Can you detect a community in the tissues or the organs in the human body? Can you represent these as Graphs?

Can you detect a community in Friends on social media with similar interests? i.e. is it a community? Can you represent these/this as Graphs?

Can you detect/define a community for the Neighborhoods where a hurricane will likely pass? i.e. is it a community?

Is it a community: People that are likely to be affected by a contagious disease?

Will the properties of a community (subgraph) be the same like the whole graph (i.e. average properties of the whole graph)? or will it differ? When? How? Why? Examples?

How does the formation of communities affect? i.e. How does the existence or not of communities affect? You can provide generic answers or just explain with examples as well.

Do you think that you are a part/node of communities in online social networks such as Facebook, twitter? If so, how and when you are affected? If so, how and when you can affect others?

Can communities affect rumor spreading or disease spreading or epidemic spreading? Can you also identify how a rumor/disease will spread i.e. from the source i.e. which path will it take, how will it flow, who will get affected?

Is it that the spreading will only affect the nodes/parts of the communities? Do you see any limitations of this thought?

Who else can get affected? Can you predict that?

What is Link prediction? What are Link Prediction Algorithms?

Is it a possibility that current communities can be fine tuned? i.e. remove links/nodes and add new links/nodes? Why, why not? How? Is it the right thing to do? why, why not?

What is the difference between clustering and community detection?

What is thought to be of more real world entitities/geared? Community Detection or Clustering?

What is thought to be more geared towards structural Properties such as Min-Cut?

What is Min-Cut anyway?

For modeling, understanding and analysis purpose - can we consider them the same/similar? Community Detection or Clustering?
Ans: Yes

What is Cluster Analysis i.e. Community Detection process? i.e. what will be your process to identify clusters/communities?

Provide the names of some Community Detection Algorithms. Can you explain them as well? Can you formally define/represent and/or show pictorially what they do and how they work (steps as well)? Check the answer section, after you give it a try. You must have studied them in Computer Networks or related courses/concepts (Data Communication, CCNA, CCNP, Vehicle Routing)

How can you evaluate that the detected communities/clusters are good/great communities/clusters?

Answers:
What is a community?
a subset of nodes within the graph such that connections between the nodes (in the subset) are denser than connections with the rest of the network (i.e. from the subset to the outer nodes)

What is Cluster Analysis i.e. Community Detection process? i.e. what will be your process to identify clusters/communities?
"The process of dividing nodes of a graph into possibly overlapping, subsets, where nodes in each subset are considered related by some similarity measure"

Provide the names of some Community Detection Algorithms
Maximal-Clique Enumeration, K- Spanning Tree, Shared Nearest Neighbor, Highly Connected Components, MinCut, Betweenness Based Algorithms, Louvain Modularity, CNM Algorithm

Resources:
Defining and identifying communities in networks
https://www.pnas.org/content/101/9/2658

Community Structure:
https://en.wikipedia.org/wiki/Community_structure

Graph Clustering:
https://www.csc2.ncsu.edu/faculty/nfsamato/practical-graph-mining-with-R/slides/pdf/Graph_Cluster_Analysis.pdf

By

Sayed Ahmed

Linkedinhttps://ca.linkedin.com/in/sayedjustetc

Bloghttp://Bangla.SaLearningSchool.comhttp://SitesTree.com
Online and Offline Traininghttp://Training.SitesTree.com

Shop Online: https://www.ShopForSoul.com/

8112223 Canada Inc./JustEtc: http://JustEtc.net

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Courses: http://Training.SitesTree.com (Big Data, Cloud, Security, Machine Learning)

Bloghttp://Bangla.SaLearningSchool.comhttp://SitesTree.com

Mediumhttps://medium.com/@SayedAhmedCanada

Graph Mining: Node Importance

Resources to Learn From

A Book.

Nagiza F. Samatova, William Hendrix, John Jenkins, Kanchana Padmanabhan, and Arpan Chakraborty. 2013. Practical Graph Mining with R. Chapman & Hall/CRC.

Read the resources above to find answers. Betweenness Based Clustering: Learn by finding answers to the following questions. Can you answer the following?

Graph Mining: Betweenness Based Clustering: Learn by finding answers to the following questions. Can you answer the following?

What are the types of Betweenness?

What is Vertex Betweenness? i.e. the concept

What is Edge Betweenness? i.e. the concept

Describe Vertex Betweenness? maybe just giving an example

What does Vertex Betweenness clustering achieve? i.e. the output?

Give an Algorithm for Vertex Betweenness Clustering?

What is mu in the Algorithm for Vertex Betweenness Clustering?

Which vertex do you select at each step? i.e. the one with the highest betweenness or the lowest betweenness?

When you select a vertex at each step? Then what do you do (i.e. to create clusters)

Does the selected vertex go to all the clusters created around it?

What is the other name for Edge Betweenness?

Describe Edge Betweenness? maybe just giving an example

What does Edge Betweenness clustering achieve? i.e. the output?

Give an Algorithm for Edge Betweenness Clustering?

What is mu in the Algorithm for Edge Betweenness Clustering?

Which Edge do you select at each step? i.e. the one with the highest betweenness or the lowest betweenness?

When you select an Edge at each step? Then what do you do (i.e. to create clusters)

Does the selected Edge go to all the clusters created around it? or you just cut the graph at that edge and create clusters.

Answers:

Describe Vertex Betweenness? maybe just giving an example
Ans: The total number of shortest paths that pass through the vertex = the Vertex Betweenness for that Vertex. i.e. Among All vertices to all vertices shortest paths.

What is the other name for Edge Betweenness Clustering?
Ans: Girvan & Newman Clustering

What is Edge Betweenness? i.e. the concept
Ans: For a given edge, the number of shortest paths that pass through the edge. (All pairs/nodes to all pairs/nodes shortest paths)

Shop Online: https://www.ShopForSoul.com/

8112223 Canada Inc./JustEtc: http://JustEtc.net

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Courses: http://Training.SitesTree.com (Big Data, Cloud, Security, Machine Learning)

Bloghttp://Bangla.SaLearningSchool.comhttp://SitesTree.com

Mediumhttps://medium.com/@SayedAhmedCanada

Graph Mining: HITS Pagerank HUBS Anchors and Graph Theories

Resources to Learn From

Resources:

HITS Algorithm
https://en.wikipedia.org/wiki/HITS_algorithm

http://pi.math.cornell.edu/~mec/Winter2009/RalucaRemus/Lecture4/lecture4.html
Last modified: Thursday, 24 October 2019, 9:32 PM

Read the resources above and Learn by finding Answers to the Following Questions. Can you answer the following questions?

Graph Mining: Hits. Learn by finding Answers to the Following Questions.

What is HITS algorithm in Graph Mining esp. for Web?

What does HITS stand for?

What is another similar Algorithm?

What are different between the PageRank algorithm and HITS?

What is the PageRank algorithm? What is the purpose?

Where and how PageRank is used?

Where and how HITS is used?

Describe HITS?

Describe the PageRank algorithm.

What are the two core concepts in HITS algorithm?

Web-pages that serves as large directories of other pages with useful information - what are they called in HITS algorithm?

Web-pages that serves to provide information on Specific topics - what are they called in the HITS algorithm?

What are HUBs in HITS algorithm? What purpose do they serve?

What are Authorities in HITS algorithm? What purpose do they serve?

Compare HITS and PageRank.

A web-page containing links to Top 1000 Universities in the world - is this a HUB or Authority in HITS?

What is the Govt. of Canada website that provides information on all Govt. Services? Is this a Hub or an Authority?

How do you measure a Good Hub?

How do you measure a Good Authority?

Can a web-page be both Hub and Authority? Can you assign both measures to a page irrespective of how good or bad?

What are the two scores that HITS assign to a web-page?

What is authority score for a web-page?

What is the hub score for a web-page?

What does hub score measure?

What does authority score measure?

What are the three matrices that are used for HITS algorithm? i.e. when you want to implement HITS algorithm.

What is a Transition Matrix in HITS?

What is a HUB vector? What does it contain initially?

What is an Authority vector? What does it contain initially?

For the graph below, provide the Initial Transition, Hub, Authority Vector/Matrices.

Directed Graph {Source, Destination}
Node: Yahoo, Amazon, Microsoft
Edges: {Yahoo, Yahoo} {Yahoo, Amazon} {Yahoo, Microsoft} {Amazon, Yahoo} {Amazon, Microsoft} {Microsoft, Amazon}

For the same graph above, explain your transition matrix?

If Transition Matrix is A, Hub Vector = h0, Authority Vector = a0.
How is Hub Score for a page is updated?, How is Authority Score for a page is updated? How long does this update happen?

The algorithm/steps as mentioned above: will the update converge to a state where Hub and Authority values will no longer change? Why, why not? if convergence does not happen what to do?

What is HITS normalization? i.e. after each iteration. Why it might be important.

Give steps/equations used for HITS normalization.

What are the two ways, you can make the HITS algorithm stop? i.e. stopping points.

True or False, Destiny of PageRank and HITS were different. What does it mean?

Can you stop the HITS algorithm after a certain number of iterations?

In real life, are HITS and PageRank used/applied for the whole graph i.e. whole Internet for example? Or majority of the times, they are applied on contextual graphs?

What are contextual graphs, anyway?

Give example use cases for HITS and PageRank?

Can you think of the value aspect of these algorithms? i.e. how they affect people, communities, societies?

What are the programming languages where you will find libraries that implement the HITS and PageRank algorithm? Give the name of the libraries.

Implement the algorithms from scratch in Python or in R without using the libraries. What did you use to debug your implementation and how?
What are the challenges that you faced to implement, how did you resolve? How did you represent the Graphs (i.e. on the graph you applied for testing).

Some Answers:
For the graph below, provide the Initial Transition, Hub, Authority Vector/Matrices.

Ans: Transition Matrix

A =[

1  1  1
1  0  1
0  1  0

]

Initial Hub Vector = h0
[
Yahoo
Amazon
Microsoft
]
=[
1
1
1]

Authority Vector: a0
[
Yahoo
Amazon
Microsoft
]

=
[
1
1
1
]

For the same graph above, explain your transition matrix?

First Row: Yahoo
2nd Row: Amazon
3rd Row: Microsoft

Columns: Yahoo ---- Amazon --- Microsoft

Transition Matrix

[Yahoo-> Yahoo, Yahoo-> Amazon, Yahoo-> Microsoft]
[Amazon->Yahoo, Amazon->Amazon, Amazon->Microsoft]
[Microsoft->yahoo, Microsoft->Amazon, Microsoft->microsoft]

using: 1 if page i links to page j, otherwise 0
[1,  1, 1 ]
[1,  0, 1]
[0,  1, 0]

If Transition Matrix is A, Hub Vector = h0, Authority Vector = a0.
How is Hub Score for a page is updated?, How is Authority Score for a page is updated?
A * a0 = h1 : Hub Score Update: based on authority score of outgoing links
transition(A) h1 = a1 : Authority Score Update : based on hub score of incoming links
Last modified: Wednesday, 30 October 2019, 5:40 PM

Shop Online: https://www.ShopForSoul.com/

8112223 Canada Inc./JustEtc: http://JustEtc.net

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Courses: http://Training.SitesTree.com (Big Data, Cloud, Security, Machine Learning)

Bloghttp://Bangla.SaLearningSchool.comhttp://SitesTree.com

Mediumhttps://medium.com/@SayedAhmedCanada

Graph Mining: Resources to Learn From

Resources to Learn From

Resources to Learn From

Resources:
Graph mining - lesson 1: Introduction to graphs and networks
Nathalie Vialaneix
http://www.nathalievialaneix.eu/teaching/m2se/M2SE-network_1.pdf

phpFox Features: Start your own Social Network
https://www.phpfox.com/features/

Bipartite graph
https://en.wikipedia.org/wiki/Bipartite_graph

Tree (data structure)
https://en.wikipedia.org/wiki/Tree_(data_structure)

Tree structure
https://en.wikipedia.org/wiki/Tree_structure

Signed graph
https://en.wikipedia.org/wiki/Signed_graph

Gene regulatory network
https://en.wikipedia.org/wiki/Gene_regulatory_network

Dynamic Influence Analysis in Evolving Networks (VLDB'16)
https://www.slideshare.net/todo314/dynamic-influence-analysis-in-evolving-networks-vldb16

Introduction to Graph Theory
https://www.csc2.ncsu.edu/faculty/nfsamato/practical-graph-mining-with-R/slides/ppt/Introduction_to_Graph_Theory.ppt

Graph and its representations
https://www.geeksforgeeks.org/graph-and-its-representations/

Representing graphs
https://www.khanacademy.org/computing/computer-science/algorithms/graph-representation/a/representing-graphs

https://www.geeksforgeeks.org/graph-and-its-representations/

Shop Online: https://www.ShopForSoul.com/

8112223 Canada Inc./JustEtc: http://JustEtc.net

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Courses: http://Training.SitesTree.com (Big Data, Cloud, Security, Machine Learning)

Bloghttp://Bangla.SaLearningSchool.comhttp://SitesTree.com

Mediumhttps://medium.com/@SayedAhmedCanada

Potential Learning Outcome from a Graph Mining Course

Learning Outcomes

Knowledge Based Outcome:

  • Students will be able to Identify, Describe, and Apply Graph Mining Concepts and Algorithms to Real World challenges and applications
  • Describe big data challenges when it comes to Graphs and Graph mining. Relate and explain, and discover solutions for mining big data graphs
  • Define and explain fundamental Graph Concepts and Algorithms along with fundamental Graph Mining Concepts and Algorithms

Skills Based Outcome:

  • Implement core Graph Mining algorithms in Python and/or R
  • Practice tools and libraries such as NetworkX, Gephi for Graph Mining
  • Familiarize and practice Jupiter Notebook, Eclipse Python IDE, R Studio for implementation

Value Based Outcome:

  • Describe ethical and legal considerations when applying to real world changes i.e how that affect the people and society 
  • Collaborative Group Work with respect to others and without authoritative interactions and attitudes; however, with caring and respectful attitude
  • Take responsibility for ones' actions and give proper credit to others where applicable

Shop Online: https://www.ShopForSoul.com/

8112223 Canada Inc./JustEtc: http://JustEtc.net

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Courses: http://Training.SitesTree.com (Big Data, Cloud, Security, Machine Learning)

Bloghttp://Bangla.SaLearningSchool.comhttp://SitesTree.com

Mediumhttps://medium.com/@SayedAhmedCanada

Graph Mining: Learning Resources

Resources: Public URLs

Graph Mining: Introducing Graphs: Learn by finding answers to the following questions? http://bangla.salearningschool.com/recent-posts/graph-mining-introducing-graphs-learn-by-finding-answers-to-the-following-questions/

Graph Mining: What is Graph Mining? Learn by finding answers to the following questions. Can you answer the following questions? http://bangla.salearningschool.com/recent-posts/graph-mining-what-is-graph-mining-learn-by-finding-answers-to-the-following-questions-can-you-answer-the-following-questions/

Graph Mining: Shared Nearest Neighbors : Clustering : Community Detection. http://bangla.salearningschool.com/recent-posts/graph-mining-shared-nearest-neighbors-clustering-community-detection/

Graph Mining: Betweenness Based Clustering: Learn by finding answers to the following questions. Can you answer the following? http://bangla.salearningschool.com/recent-posts/graph-mining-betweenness-based-clustering-learn-by-finding-answers-to-the-following-questions-can-you-answer-the-following/

Graph Mining: Community Detection: Learn by finding answers to the following questions. Can you answer the following questions on Community Detection? http://bangla.salearningschool.com/recent-posts/graph-mining-community-detection-learn-by-finding-answers-to-the-following-questions-can-you-answer-the-following-questions-on-community-detection/

Graph Mining: SMALL WORLD GRAPHS & RANDOM GRAPH GENERATORS: Learn by finding answers to the following questions. http://bangla.salearningschool.com/recent-posts/graph-mining-small-world-graphs-random-graph-generators-learn-by-finding-answers-to-the-following-questions/

Graph Mining: K-Spanning Trees: Learn by finding answers to the following questions. Can you answer the questions? http://bangla.salearningschool.com/recent-posts/graph-mining-k-spanning-trees-learn-by-finding-answers-to-the-following-questions-can-you-answer-the-questions/

Graph Mining: Hits. Learn by finding Answers to the Following Questions. Can you answer the following questions?http://bangla.salearningschool.com/recent-posts/graph-mining-hits-learn-by-finding-answers-to-the-following-questions-can-you-answer-the-following-questions/

Graph Mining: Louvian Modularity: Learn by finding answers to the following questions. Can you answer the following questions? http://bangla.salearningschool.com/recent-posts/graph-mining-louvian-modularity-learn-by-finding-answers-to-the-following-questions-can-you-answer-the-following-questions/

Graph Mining: Highly Connected Subgraph Clustering: Learn by finding answers to the following questions. http://bangla.salearningschool.com/recent-posts/graph-mining-highly-connected-subgraph-clustering-learn-by-finding-answers-to-the-following-questions/

Graph Mining: Link Prediction. Learn by finding answers to the following questions. http://bangla.salearningschool.com/wp-admin/post.php?post=16297&action=edit
Last modified: Wednesday, 23 October 2019, 12:16 PM

Shop Online: https://www.ShopForSoul.com/

8112223 Canada Inc./JustEtc: http://JustEtc.net

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Courses: http://Training.SitesTree.com (Big Data, Cloud, Security, Machine Learning)

Bloghttp://Bangla.SaLearningSchool.comhttp://SitesTree.com

Mediumhttps://medium.com/@SayedAhmedCanada

Graph Mining: Possible Course Topics

Course Topics

Graph Theory Introduction, Graph Mining Introduction, Network Properties, Random Graphs, Small World Graphs, Node Importance, Node Similarity, Clustering & Community Detection, Link Prediction, Anomaly detection, Time Evolving Graphs, Influence/Virus Propagation, Graph Mining Use Cases, Big Data Graph Databases, Big Data Graph Processing
Last modified: Wednesday, 23 October 2019, 12:44 PM

Shop Online: https://www.ShopForSoul.com/

8112223 Canada Inc./JustEtc: http://JustEtc.net

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Courses: http://Training.SitesTree.com (Big Data, Cloud, Security, Machine Learning)

Bloghttp://Bangla.SaLearningSchool.comhttp://SitesTree.com

Mediumhttps://medium.com/@SayedAhmedCanada

Graph Mining: Job Prospect

Job Prospect for Graph Mining

Industry Job Prospect for Graph Mining

Sample Jobs

https://www.careerbuilder.com/jobs-graph-mining

https://www.indeed.com/q-Graph-Mining-jobs.html

For example, Google works in the following areas of Graph Mining. Google has jobs for such. Also, Facebook and any other social networking site will have jobs in relation to Graph Mining. Computational Biology, Medicine Research, Drug Discovery, Disease Diagnosis, Transportation, Scheduling, Shipping Scheduling will have applications and jobs for Graph Mining. 

Job Areas:

The general Mining (data based) jobs and Machine/Deep/Reinforcement Learning jobs will require Graph Mining expertise sometimes such as positions (real) : Research Intern - Deep Learning for Graphs, ML Engineer - Siri Knowledge Graph

Computer Networks, Network/Cyber Security application development (also R & D) positions might ask for Graph Mining expertise.

Graph Mining will have applications and jobs in Biological, Chemistry, Drug Design areas also in Transportation

Social Network Mining will always involve Graph Mining. Applications: Friend Recommendation

Trajectory Data Mining Jobs at Microsoft

https://www.microsoft.com/en-us/research/publication/trajectory-data-mining-an-overview/
Graph Mining Jobs (areas) at Google:
https://ai.google/research/teams/algorithms-optimization/graph-mining/

"Large-Scale Balanced Partitioning: Example Google Maps Driving Directions, Large-Scale Clustering:clustering graphs at Google scale, Large-Scale Connected Components, Large-Scale Link Modeling: similarity ranking and centrality metrics:  link prediction and anomalous link discovery., Large-Scale Similarity Ranking: Personalized PageRank, Egonet similarity, Adamic Adar, and others, Public-private Graph Computation, Streaming and Dynamic Graph Algorithms, ASYMP: Async Message Passing Graph Mining, Large-Scale Centrality Ranking, Large-Scale Graph Building"

More Related Jobs:

Tools in Jobs/Jobs--https://www.researchgate.net/post/Can_you_suggest_a_graph_mining_tool

--https://www.linkedin.com/jobs/gephi-jobs/

--https://bit.ly/2Nwlnrp Data Scientist 2

-- https://bit.ly/2r04wFP Graph Jobs

-- https://indeedhi.re/2oHkOmo
Last modified: Sunday, 19 January 2020, 10:06 PM

Shop Online: https://www.ShopForSoul.com/

8112223 Canada Inc./JustEtc: http://JustEtc.net

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Courses: http://Training.SitesTree.com (Big Data, Cloud, Security, Machine Learning)

Bloghttp://Bangla.SaLearningSchool.comhttp://SitesTree.com

Mediumhttps://medium.com/@SayedAhmedCanada

Course Introduction

Misc AnnouncementsForum Course: This course relates to Big Data, Data Science, and Graph Mining. This can be thought of as a 4th year course or an MSc level course in Computer Science, Data Science or related departments.  The course is under  development, and will also be developed continuously.  We had an online session to introduce the course on Nov 1st, 2019.


Inform Others, Help Others: You might want to inform your friends, your university teachers, friends in higher education, and everyone who might get benefit from such courses and platforms. If anyone subscribes at http://bangla.salearningschool.com, we might give them access to various of our courses.

Create Course: Bengali or English: If you want to create a course in our platform to make available for free, or make available to be purchased by others, please fill this form: https://forms.gle/LJAQtfUoXkVXSRbZ8 . Also, if you want to provide online Tutoring to any level of students, please fill the form above as well


Contribute: Finally, if you are being given free access to our courses, or if you just want to support our platform, and want to contribute to help others : you can do financial contribution to: safoundation@salearningschool.com using Paypal. We can give a non-profit bank account in Canada as well.

Shop Online: https://www.ShopForSoul.com/

8112223 Canada Inc./JustEtc: http://JustEtc.net

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Courses: http://Training.SitesTree.com (Big Data, Cloud, Security, Machine Learning)

Bloghttp://Bangla.SaLearningSchool.comhttp://SitesTree.com

Mediumhttps://medium.com/@SayedAhmedCanada

Graph Mining: Highly Connected Subgraph Clustering: Learn by finding answers to the following questions.

Graph Mining: Highly Connected Subgraph Clustering: Learn by finding answers to the following questions.

How do you define: Highly Connected?

What is a Cluster?

What is Subgraph?

Does connectivity indicate the number of edges between clusters?

True or false: Highly connected clustering means: within a cluster, connectivity will be low. Across clusters, connectivity will be high.

True or false: Highly connected clustering means: within clusters, connectivity will be high. Across clusters, connectivity will be low.

True or false: For Highly connected clustering, it is usually assumed that there is no prior assumptions on the number of clusters in the Graph.

What are some of the core concepts/terms that are related to Highly connected clustering?

What is a Cut when it comes to Highly connected clustering?

What is a Minimum Edge Cut when it comes to Highly connected clustering?

Describe Edge Connectivity for Highly connected clustering?

Contrast Cut and Minimum Edge Cut for Highly connected clustering?

Identify is this a Cut or a Minimum Edge Cut: The edges whose removal will disconnect the Graph?

Identify is this a Cut or a Minimum Edge Cut: The minimum edges whose removal will disconnect the Graph?

Edge removal and disconnecting the Graph: Does it mean we will have two clusters?

Assuming this is True: Highly connected clustering means: within a cluster, connectivity will be high. Across clusters, connectivity will be low.
---True or false: The more edges are there in a group of vertices - the more connected they are?
---True or false: The more edges are there in a group of vertices - the more similar are the vertices?
---True or false: The more edges are there in a group of vertices - it is highly likely that they will form a highly connected cluster?
---True or false: For a group of vertices, the more edges we need to remove to create a disconnected subgraph -- the more similar are the vertices

Is it a highly connected subgraph? For V vertices, the minimum edge cut contains less than V/2 edges? Why? or Why not?

Is it a highly connected subgraph? For V vertices, the minimum edge cut contains more than V/2 edges? Why? or Why not?

Is it a highly connected subgraph? Undirected and Unweighted. why or why not?
Nodes: 0 to 8
Edges: {0-1} {0-2} {0-3} {1-2} {2-3} {2-4} {3-5} {4-5} {4-6} {4-7} {4-8}
{5-6} {5-7} {5-8} {6-7} {6-8} {7-8}

What is the minimum edge cut for the above graph?

What is V/2 for the above Graph?

If EC(G) is the min-cut and V = 9, is EC(G) > V/2? If it is not, then can we call the subgraph to be highly connected?

Why we use V/2 to check for a Highly connected subgraph?

What is the maximum diameter of every single highly connected graph? Can you please explain.

What is the number of edges in a highly connected subgraph?

True or false: For highly connected subgraphs, the number of edges will be at least V^2/4?

True or false? explain: the number of edges in a highly connected subgraph is quadratic?

True or false: the number of edges in a highly connected subgraph is at most two?

Give and Describe the steps for creating/finding highly connected subgraphs.

Give the algorithm and pseudo-code Highly connected subgraphs?

What are the input and output for Highly connected subgraphs?

As we saw the definition of Min Edge Cut, can it be used in an algorithm for finding highly connected subgraphs?

As we saw that for highly connected subgraphs: EC(G) > V/2 i.e. Min Edge Cut > V/2. Can we just find min edge cuts for graphs/subgraphs that satisfy EC(G) > V/2 and divide the graph to get clusters? Also, we can do the same to the clusters to find smaller clusters if any. Can these be sufficient (core) to find highly connected subgraphs?

When will the Algorithm stop? i.e. Terminating Condition?

What are the algorithms to find the minimum edge cut for graphs?

Can you apply Karger's algorithm for Minimum Cut to Weighted Graphs? if so, How? What was its intended purpose?

Can you apply Stoer-Wagner's algorithm for Minimum Cut to Unweighted Graphs? if so, How? What was its intended purpose?

What is the core concept of Karger's algorithm?

What is the “contraction of an edge” in Karger's algorithm?

What is the number of edge reduction caused by “contraction of an edge”?

In Karger's algorithm when you merge two nodes u and v, what happens to the edges that were previously connected to those nodes?

What is the complexity of Karger's algorithm?

What is the complexity of Karger's algorithm? is it O(E) or O(V) or O (Log(E))

Describe Karger's algorithm?

Give the steps in Karger's algorithm?

Describe Pseudo-code and Python/C++/R/Java code for Karger's algorithm?

What is the input for Karger's algorithm? What is the output?

What is the terminating condition for Karger's algorithm?

Can you continue with Karger's algorithm when there are <= 2 vertices left?

Give the minimum cut for the following Graph using Karger's algorithm?
STEPS: while there are more than 2 vertices, pick a random edge (u, v), contract u and v into one vertex, reattach the edges of u and v to this new vertex, remove self-loops. Keep doing.
Graph, Edges: {0, 1} {0, 2} {0, 3} {1, 3} {2, 3}
Also, show all the steps in picture or text

Will Karger's algorithm? always create Min Cut?

What is the probability that Karger's algorithm will create Min Cut?

For a 10 node graph, is the probability = 1/100?

If Karger's algorithm is not supposed to generate the min-cut always, how can you increase the probability of creating i.e. implement in such a way so that you increase your chances to find the min-cut before applying for Highly Connected Subgraph Clustering algorithm?

Can running the Karger's algorithm multiple/sufficient times and then taking the least cut found so far will produce better min-cut and better highly connected clusters (than just running once)?

Some Answers:
What is the minimum edge cut for the above graph?
Ans: 2

What is V/2 for the above Graph? 4.5

If EC(G) is the min-cut and V = 9, is EC(G) > V/2? If it is not, then can we call the subgraph to be highly connected?
Ans: No, False, G is not a highly connected subgraph.
Resources

HCS clustering algorithm
https://en.wikipedia.org/wiki/HCS_clustering_algorithm

Karger’s algorithm for Minimum Cut | Set 1 (Introduction and Implementation)
https://www.geeksforgeeks.org/kargers-algorithm-for-minimum-cut-set-1-introduction-and-implementation/

By
Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz



Not that this site is doing any great work; However, if you want to contribute to the operation of this site (or charitable/non-profit work in the education sector), you can financially contribute to: safoundation at salearningschool.com using Paypal.

Graph Mining: Louvian Modularity: Learn by finding answers to the following questions. Can you answer the following questions?

Graph Mining: Louvian/Louvain Modularity: Learn by finding answers to the following questions. Can you answer the following questions?

I have seen both: Louvian Modularity and Louvain Modularity. I will use Louvain Modularity

What is the primary problem that Louvain Modularity solved?

Is Louvain good at detecting small communities?

What is modularity in Graphs?

What is the range of values used to show the modularity of a Graph?

What is the equation for Modularity? You can just only explain conceptually - what are the parameters/parts - how do the parameters work?

Give the algorithm for Louvain Modularity? You can just describe or you can show the steps.

Give the Pseudo-code for Louvain Modularity and implement in Python or R or C/C++?

Is Louvain Modularity easily parallelize-able or it is more sequential?

Investigate if any other algorithm is there that works like Louvain (i.e. can be an extension/modification of Louvain) that are better suited for Parallelization.

What is the first step in Louvain Algorithm?

True or False, at first step, all nodes are assigned to one community?

True or false, at first each node is assigned to it's own community? i.e. no of vertices = no of communities

What are the two steps in Louvain Algorithm? What is the name of the steps?

What can be the terminating condition for the Louvain Algorithm?

How a node i is merged to a neighbor node/community to create a community? What is the measure used? Can you use the modularity equation as the measure to see how and to what node/community node i will merge to? Can it happen that node i will not be merged with anything though initially, you wanted to merge/assign i to another node/community?

When you will leave a node i alone without merging with a neighbor node/community? In that case, what will be the value of the modularity i.e. modularity change (i alone or i merged with a node/community)

when you try to merge node i with a neighbor node/community, do you try to assign node i to all possible neighbor node/community and calculate modularity change? If so, to which node/community i will get assigned to (describe in terms of modularity change)?

What is Modularity Optimization in Louvain modularity? Describe the steps/process for this Optimization.

What is the second step/phase for the Louvain Algorithm? What it is called? What happens here? what are the steps/process?

Is the Louvain Algorithm supervised?

Is the Louvain Algorithm fast or slow?

How many passes do the Louvain Algorithm require to create the communities?

What is the resolution limit for the Louvain Algorithm?

When the resolution limit is low, do you get more/smaller communities?

When the resolution limit is high, do you get less/larger output communities?

What is the numerical range used for the resolution limit?

Resources:
Louvain: Fast and Memory efficient. A Parallel Alternative
https://phys.org/news/2015-01-fast-problems.html

Modularity_(networks)
https://en.wikipedia.org/wiki/Modularity_(networks)

Louvain Modularity
https://en.wikipedia.org/wiki/Louvain_Modularity

Fast unfolding of communities in large networks
https://arxiv.org/pdf/0803.0476.pdf

Resolution Limit
https://en.wikipedia.org/wiki/Modularity_%28networks%29#Resolution_limit

By

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Not that this site is doing any great work; However, if you want to contribute to the operation of this site (or charitable/non-profit work in the education sector), you can financially contribute to: safoundation at salearningschool.com (Justetc Social Services) using Paypal.

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz


Graph Mining: Hits. Learn by finding Answers to the Following Questions. Can you answer the following questions?

Graph Mining: Hits. Learn by finding Answers to the Following Questions.

What is HITS algorithm in Graph Mining esp. for Web?

What does HITS stand for?

What is another similar Algorithm?

What are different between the PageRank algorithm and HITS?

What is the PageRank algorithm? What is the purpose?

Where and how PageRank is used?

Where and how HITS is used?

Describe HITS?

Describe the PageRank algorithm.

What are the two core concepts in HITS algorithm?

Web-pages that serves as large directories of other pages with useful information - what are they called in HITS algorithm?

Web-pages that serves to provide information on Specific topics - what are they called in the HITS algorithm?

What are HUBs in HITS algorithm? What purpose do they serve?

What are Authorities in HITS algorithm? What purpose do they serve?

Compare HITS and PageRank.

A web-page containing links to Top 1000 Universities in the world - is this a HUB or Authority in HITS?

What is the Govt. of Canada website that provides information on all Govt. Services? Is this a Hub or an Authority?

How do you measure a Good Hub?

How do you measure a Good Authority?

Can a web-page be both Hub and Authority? Can you assign both measures to a page irrespective of how good or bad?

What are the two scores that HITS assign to a web-page?

What is authority score for a web-page?

What is the hub score for a web-page?

What does hub score measure?

What does authority score measure?

What are the three matrices that are used for HITS algorithm? i.e. when you want to implement HITS algorithm.

What is a Transition Matrix in HITS?

What is a HUB vector? What does it contain initially?

What is an Authority vector? What does it contain initially?

For the graph below, provide the Initial Transition, Hub, Authority Vector/Matrices.

Directed Graph {Source, Destination}
Node: Yahoo, Amazon, Microsoft
Edges: {Yahoo, Yahoo} {Yahoo, Amazon} {Yahoo, Microsoft} {Amazon, Yahoo} {Amazon, Microsoft} {Microsoft, Amazon}

For the same graph above, explain your transition matrix?

If Transition Matrix is A, Hub Vector = h0, Authority Vector = a0.
How is Hub Score for a page is updated?, How is Authority Score for a page is updated? How long does this update happen?

The algorithm/steps as mentioned above: will the update converge to a state where Hub and Authority values will no longer change? Why, why not? if convergence does not happen what to do?

What is HITS normalization? i.e. after each iteration. Why it might be important.

Give steps/equations used for HITS normalization.

What are the two ways, you can make the HITS algorithm stop? i.e. stopping points.

True or False, Destiny of PageRank and HITS were different. What does it mean?

Can you stop the HITS algorithm after a certain number of iterations?

In real life, are HITS and PageRank used/applied for the whole graph i.e. whole Internet for example? Or majority of the times, they are applied on contextual graphs?

What are contextual graphs, anyway?

Give example use cases for HITS and PageRank?

Can you think of the value aspect of these algorithms? i.e. how they affect people, communities, societies?

What are the programming languages where you will find libraries that implement the HITS and PageRank algorithm? Give the name of the libraries.

Implement the algorithms from scratch in Python or in R without using the libraries. What did you use to debug your implementation and how?
What are the challenges that you faced to implement, how did you resolve? How did you represent the Graphs (i.e. on the graph you applied for testing).

Some Answers:
For the graph below, provide the Initial Transition, Hub, Authority Vector/Matrices.

Ans: Transition Matrix

A =[

1  1  1
1  0  1
0  1  0

]

Initial Hub Vector = h0
[
Yahoo
Amazon
Microsoft
]
=[
1
1
1]

Authority Vector: a0
[
Yahoo
Amazon
Microsoft
]

=
[
1
1
1
]

For the same graph above, explain your transition matrix?

First Row: Yahoo
2nd Row: Amazon
3rd Row: Microsoft

Columns: Yahoo ---- Amazon --- Microsoft

Transition Matrix

[Yahoo-> Yahoo, Yahoo-> Amazon, Yahoo-> Microsoft]
[Amazon->Yahoo, Amazon->Amazon, Amazon->Microsoft]
[Microsoft->yahoo, Microsoft->Amazon, Microsoft->microsoft]

using: 1 if page i links to page j, otherwise 0
[1,  1, 1 ]
[1,  0, 1]
[0,  1, 0]

If Transition Matrix is A, Hub Vector = h0, Authority Vector = a0.
How is Hub Score for a page is updated?, How is Authority Score for a page is updated?
A * a0 = h1 : Hub Score Update: based on authority score of outgoing links
transition(A) h1 = a1 : Authority Score Update : based on hub score of incoming links

Graph Mining: Introducing Graphs: Learn by finding answers to the following questions?

Graph Mining: Introducing Graphs: Learn by finding answers to the following questions?

Questions

What is a Graph?

Give an example of a Graph?

What are the different Types of Graphs? i.e. Try to give some examples of different ways how Graphs are classified?

What is a directed Graph?

What is an undirected Graph?

What are Labeled Graphs?

What are Weighted Graphs?

What are the unweighted Graphs?

What are the Bipartite Graphs?

What is a Tree?

What is a Signed Graph?

What is a Looped Graph? i.e. What is a loop?

Compare Static and Time-evolving Graphs.

What is an Isomorphic Graph?

What does Isomorphism mean when it comes to Graphs?

What are different ways to represent Graphs? i.e. when implementing Graph Algorithms?

How to store a Graph data in programming languages?

Describe the Adjacency Matrix approach of Graph Representation.

Give a small example graph then represent it using the Adjacency Matrix approach of Graph Representation.

Represent the following Graph using Adjacency Matrix approach of Graph Representation
Edges: {0, 1} {0, 4} {1, 2} {1, 3} {1, 4} {2, 3} {3, 4}

What are the advantages/pros of the Adjacency Matrix Representation?

What are the disadvantages/cons of the Adjacency Matrix Representation?

Describe the Adjacency List approach of Graph Representation.

Give a small example graph then represent it using the Adjacency List approach of Graph Representation.

Represent the following Graph using Adjacency List approach of Graph Representation
Edges: {0, 1} {0, 4} {1, 2} {1, 3} {1, 4} {2, 3} {3, 4}

What are the advantages/pros of the Adjacency List Representation?

What are the disadvantages/cons of the Adjacency List Representation?

Describe the Adjacency Tensor approach of Graph Representation.

Give a small example graph then represent it using the Adjacency Tensor approach of Graph Representation.

Represent the following Graph using Adjacency Tensor approach of Graph Representation
Edges: {0, 1} {0, 4} {1, 2} {1, 3} {1, 4} {2, 3} {3, 4}

What are the advantages/pros of Adjacency Tensor Representation?

What are the disadvantages/cons of Adjacency Tensor Representation?

True or False? Adjacency Matrix is a 2D array of size V x V where V is the number of vertices in a graph.

Can the Adjacency Matrix represent weighted graphs?

How easy it is you think to represent graphs using Adjacency Matrix?

What is the complexity of removing an edge from Adjacency Matrix representation?

What is the complexity of queries such as whether there is an edge between vertex a and b for Adjacency Matrix representation?

What is the Space complexity for Adjacency Matrix representation? Use vertex count to represent.

What is the complexity of adding an edge from Adjacency Matrix representation?

For the Adjacency List approach, how many total lists are there? How many vertices each list can have at best?

For the Adjacency List approach, how many lists are there for one vertex?

How much Space does the Adjacency List approach take?

How many elements does an Adjacency list contain for an undirected graph? Represent using E = edge count. For the complete representation of the Graph.

Why the complete representation of an undirected Graph using Adjacency List will have 2|E| number of elements?

How many times does an edge appear in the complete representation of the Undirected Graph using the Adjacency List?

Why the complete representation of the Directed Graph using Adjacency List will have |E| number of elements?

How many times does an edge appear in the complete representation of the Directed Graph using the Adjacency List?

How many dimensions will an Adjacency Tensor have for Tensor? What is this for Dynamic Graphs?

Some Answers
What are the different Types of Graphs? i.e. Try to give some examples of different ways how Graphs are classified?

Ans: Undirected vs. Directed, Attributed/Labeled (e.g., vertex, edge) vs. Unlabeled, Weighted vs. Unweighted, General vs. Bipartite (Multipartite), Signed Graphs, Trees (no cycles), Simple vs. w/ loops vs. w/ multi-edges, Static vs. Time Evolving Graphs, Isomorphic Graphs

What are the different ways to represent Graphs? i.e. when implementing Graph Algorithms?
Ans: Static Graphs, Adjacency Matrix, Adjacency List, Dynamic Graphs, Adjacency 3D Tensor

Resources:
Graph mining - lesson 1 Introduction to graphs and networks Nathalie Vialaneix
http://www.nathalievialaneix.eu/teaching/m2se/M2SE-network_1.pdf

phpFox Features: Start your own Social Network
https://www.phpfox.com/features/

Bipartite graph
https://en.wikipedia.org/wiki/Bipartite_graph

Tree (data structure)
https://en.wikipedia.org/wiki/Tree_(data_structure)

Tree structure
https://en.wikipedia.org/wiki/Tree_structure

Signed graph
https://en.wikipedia.org/wiki/Signed_graph

Gene regulatory network
https://en.wikipedia.org/wiki/Gene_regulatory_network

Dynamic Influence Analysis in Evolving Networks (VLDB'16)
https://www.slideshare.net/todo314/dynamic-influence-analysis-in-evolving-networks-vldb16

Introduction to Graph Theory
https://www.csc2.ncsu.edu/faculty/nfsamato/practical-graph-mining-with-R/slides/ppt/Introduction_to_Graph_Theory.ppt

Graph and its representations
https://www.geeksforgeeks.org/graph-and-its-representations/

Representing graphs
https://www.khanacademy.org/computing/computer-science/algorithms/graph-representation/a/representing-graphs
https://www.geeksforgeeks.org/graph-and-its-representations/

By

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz



We do not claim that this site is doing any great work; However, if you want to contribute to the operation of this site (or charitable/non-profit work in the education sector), you can financially contribute to: safoundation at salearningschool.com using Paypal. Monthly our articles/short-notes sites usually have visitors in the range 10k to 20k.

Graph Mining: What is Graph Mining? Learn by finding answers to the following questions. Can you answer the following questions?

Graph Mining: What is Graph Mining? Learn by finding answers to the following questions. Can you answer the following questions?

What is Graph Mining?

What is a Graph anyway?

Is Graph mining just a kind of Machine Learning? i.e. is Machine Learning the only primary component of Graph Mining?

Does Graph Mining involve Only Statistics? Or Statistics can help?

Does Graph Mining involve Only Graph Theory? Or Graph Theory can help?

Is Graph Theory primarily relates to Computer Science or Math/Statistics or Machine Learning?

Does Graph Mining involve Only Machine Learning? Or Machine Learning is just one component?

To be great at Graph Mining - what are the areas that you need to master?

What are the applications of Graph Mining?

How the hackers can use/utilize Graph Mining?

What is the Origin of Graph Theory?

What is the first paper in Graph Theory? What does it try to solve?

What (where) are the applications of the Graph Algorithm named: Seven Bridges of König

Why is Graph and Graph Mining Important? Why do we care?

What are some application areas of Graph Mining?

How can you model Social Network to be a Graph?

Give an example of Graphs in Biology?

What is a Semantic Graph? Give examples.

Give an example of Graphs from Non-Graph Data.

Network vs. Graph?

Some Answers:
What is Graph Mining?
Ans: Discovering and Analyzing Graph Data

What are the applications of Graph Mining?
Ans: Fraud Detection, Community/Cluster detection, Recommending friends, Finding Influential Nodes [Virus Spread - not a good application]

What (where) are the applications of the Graph Algorithm named: Seven Bridges of König
Ans: Transportation, Biology, Chip Designing, Chemistry

What are some application areas of Graph Mining?
Ans: Social Networks, Semantic Web, World Wide Web, Drug Design, Computer Networks, Sensor Networks, Chemical Components

How can you model Social Network to be a Graph?
Ans: Nodes = Users, Edges = Friends/Followers

Give an example of Graphs in Biology?
Ans: Protein-Protein Interaction
Nodes: Proteins
Edges: Physical Interactions

What is a Semantic Graph? Give examples.
Ans: Wikipedia, Nodes = Concepts, Edges = Property/Type

What is a Semantic Graph? Give examples.
Ans: Concepts and relations

Give an example of Graphs from Non-Graph Data.
Ans: Network of Thrones (See resources), Student Enrollment

Network vs. Graph?
Ans: Network: Real Systems - Web, Social, Biology Terms: Network, Node, Link, Relationship

Graph: Mathematical representation Terms: Graph, Vertex/node, Edge
We can use them interchangeably

Resources:
Graph mining - lesson 1: Introduction to graphs and networks
http://www.nathalievialaneix.eu/teaching/m2se/M2SE-network_1.pdf

Graph theory
https://en.wikipedia.org/wiki/Graph_theory

Seven Bridges of König
https://en.wikipedia.org/wiki/Graph_theory#History
http://www.cs.kent.edu/~dragan/ST-Spring2016/The%20Seven%20Bridges%20of%20Konigsberg-Euler's%20solution.pdf

Semantic Network
https://en.wikipedia.org/wiki/Semantic_network

Network of Thrones
https://www.macalester.edu/~abeverid/thrones.html

Graph from Non-Graph data
Manage Data in Excel With Databases, Tables, Records, and Fields
https://www.lifewire.com/manage-data-with-databases-tables-records-and-fields-in-excel-4178649

Graph Mining: Introduction
https://hpi.de/fileadmin/user_upload/fachgebiete/mueller/courses/graphmining/2017/01-Introduction.pdf

Software for graph visualization and mining: Gephi (https://gephi.org/), Tulip (https://tulip.labri.fr/TulipDrupal/) and Cytoscape (https://cytoscape.org/)

Packages dedicated to graphs:
for Python: igraph (https://igraph.org/), NetworkX (https://networkx.github.io/) and graph-tool (https://graph-tool.skewed.de/); Snappy (https://snap.stanford.edu/snappy/)

¤ for R: igraph (https://igraph.org/), statnet (http://statnet.org/), bipartite (https://cran.r-project.org/web/packages/bipartite/) and tnet (https://toreopsahl.com/tnet/);

¨Datasets
¤ Mark Neuman’s page: http://www-personal.umich.edu/~mejn/netdata
¤ Stanford Dataset: https://snap.stanford.edu/data/
¤ KONECT: http://konect.uni-koblenz.de/networks/
¤ ICON: https://icon.colorado.edu/#!/

¨Animation of Algorithms
¤ https://www.cs.usfca.edu/~galles/visualization/Algorithms.html

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz



Not that this site is doing any great work; However, if you want to contribute to the operation of this site (or other charitable/non-profit work in the education sector), you can financially contribute to: safoundation at salearningschool.com using Paypal (Credit Card Accepted).

Graph Mining: K-Spanning Trees: Learn by finding answers to the following questions. Can you answer the questions?

Graph Mining: K-Spanning Trees: Learn by finding answers to the following questions. Can you answer the questions?

What is a Spanning Tree?

What is a Minimum Spanning Tree?

For the undirected Graph below, what is a Spanning Tree? First, draw the Graph, then Try to find the answer, and then check the answer below.
Edges with weights: {1, 2, 7} {1, 3, 2} {1, 4, 4} {2, 3, 3} {2, 4, 5} {3, 4, 2} {3, 5, 6} {4, 5, 4}

What is a minimum spanning tree?

What is the minimum spanning tree for the graph as provided above?

What is Prim's Algorithm for Spanning Trees?

Describe the steps as used in Prim's Algorithm?

Give the Algorithm i.e. Prim's Algorithm?

Write pseudo-code for Prim's Algorithm?

Implement Prim's Algorithm in Python, R, Matlab, or in any other language of your choice?

What is the initialization step in Prim's Algorithm?

Can you choose any node to start with Prim's Algorithm?

True or False, Prim's algorithm does not include all nodes in the output?

How an edge/path is selected in Prim's Algorithm?

True or False, to proceed and when to select an edge, the edge does not need to belong to the already formed Minimum Spanning Tree (MST)?

True or False, to proceed and to select a new edge, the edge will need to have the highest weight outgoing from the MST or from the vertex under consideration)?

True or False, to proceed and to select a new edge, the edge will need to have the minimum weight outgoing from the MST (or from the vertex under consideration)?

True or False? The new edge selection steps are as follows.
Step 1. For the new edge, exactly one of its vertices/edge-points will need to be in the MST already
Step 2. The new edge has the minimum weight that satisfies step 1.

How long will you continue to add new edges to the MST? i.e. terminating condition for your algorithm.

Apply Prim's algorithm to find the MST in Graph given above/below: See the answer after you give it a try
Edges with weights: {1, 2, 7} {1, 3, 2} {1, 4, 4} {2, 3, 3} {2, 4, 5} {3, 4, 2} {3, 5, 6} {4, 5, 4}

What is K-Spanning tree?

What are the steps in K-Spanning Tree?

Give the Algorithm for K-Spanning?

Write pseudo-code for K-Spanning?

Implement the K-Spanning tree algorithm in Python, R, Matlab, or in any other language of your choice?

What is K in the K-Spanning Tree algorithms?

How many edges do you remove to create k Clusters?

What is the first step in the K-Spanning tree?

When you remove edges from an MST - which edges do you remove? Edges represent distance. The ones with the highest weights or the lowest weights?

Some Answers:

What is a Spanning Tree?
Ans: A connected subgraph with all vertices and no cycles

For the undirected Graph below, what is a Spanning Tree?
Edges with weights: {1, 2, 7} {1, 3, 2} {3, 4, 2} {3, 5, 6}
Weight: 17

What is a minimum spanning tree?
Ans: Spanning trees with the minimum sum of edge weights where weights indicate distances.

What is the minimum spanning tree for the graph as provided above?
Ans: the same spanning tree with weight sum = 17
i.e. {1, 2, 7} {1, 3, 2} {3, 4, 2} {3, 5, 6}

What is Prim's Algorithm for Spanning Trees?
Ans: Finds the minimum spanning tree

Apply Prim's algorithm to find the MST in Graph given above/below: See the answer after you give it a try
Edges with weights: {1, 2, 7} {1, 3, 2} {1, 4, 4} {2, 3, 3} {2, 4, 5} {3, 4, 2} {3, 5, 6} {4, 5, 4}
Draw the graph that will make things easier

Ans: Select node 5
Select (5, 4, 4) from {3, 5, 6} {4, 5, 4}
Add (5, 4) to the MST
select edge {3, 4, 2} from {3, 5, 6} {2, 4, 5} {3, 4, 2} {1, 4, 4}
MST: {5, 4, 4} {3, 4, 2}
Similarly: select (3, 2, 3) {1, 3, 2}
Final Output: {5, 4, 4} {3, 4, 2} (3, 2, 3) {1, 3, 2}

What is the K-Spanning tree?
Ans: Community/Clustering Algorithm

When you remove edges from an MST - which edges do you remove? Edges represent distance. The ones with the highest weights or the lowest weights?
Ans: highest

By

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz



Not that this site is doing any great work; However, if you want to contribute to the operation of this site (or charitable/non-profit work in the education sector), you can financially contribute to: safoundation at salearningschool.com using Paypal.

Graph Mining: Betweenness Based Clustering: Learn by finding answers to the following questions. Can you answer the following?

Graph Mining: Betweenness Based Clustering: Learn by finding answers to the following questions. Can you answer the following?

What are the types of Betweenness?

What is Vertex Betweenness? i.e. the concept

What is Edge Betweenness? i.e. the concept

Describe Vertex Betweenness? maybe just giving an example

What does Vertex Betweenness clustering achieve? i.e. the output?

Give an Algorithm for Vertex Betweenness Clustering?

What is mu in the Algorithm for Vertex Betweenness Clustering?

Which vertex do you select at each step? i.e. the one with the highest betweenness or the lowest betweenness?

When you select a vertex at each step? Then what do you do (i.e. to create clusters)

Does the selected vertex go to all the clusters created around it?

What is the other name for Edge Betweenness?

Describe Edge Betweenness? maybe just giving an example

What does Edge Betweenness clustering achieve? i.e. the output?

Give an Algorithm for Edge Betweenness Clustering?

What is mu in the Algorithm for Edge Betweenness Clustering?

Which Edge do you select at each step? i.e. the one with the highest betweenness or the lowest betweenness?

When you select an Edge at each step? Then what do you do (i.e. to create clusters)

Does the selected Edge go to all the clusters created around it? or you just cut the graph at that edge and create clusters.

Answers:

Describe Vertex Betweenness? maybe just giving an example
Ans: The total number of shortest paths that pass through the vertex = the Vertex Betweenness for that Vertex. i.e. Among All vertices to all vertices shortest paths.

What is the other name for Edge Betweenness Clustering?
Ans: Girvan & Newman Clustering

What is Edge Betweenness? i.e. the concept
Ans: For a given edge, the number of shortest paths that pass through the edge. (All pairs/nodes to all pairs/nodes shortest paths)

By

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz





Not that this site is doing any great work; However, if you want to contribute to the operation of this site (or charitable/non-profit work in the education sector), you can financially contribute to: safoundation at salearningschool.com using Paypal.




Graph Mining: Shared Nearest Neighbors : Clustering : Community Detection

Graph Mining: Shared Nearest Neighbors : Clustering : Community Detection

Graph Mining: Shared Nearest Neighbors (SNN): Clustering : Community Detection: Learn by Finding Answers to the Following Questions.

Will use SNN sometimes.

What is one another name of the algorithm: Shared Nearest Neighbors?

What is the purpose of the Algorithm: Shared Nearest Neighbors?

Can you name other Algorithms that serve the same or similar purpose?

What is the Criteria that SNN uses?

Is SNN Hierarchical? What does Hierarchical mean in the context of Clustering/Community detection?

How does SNN work? i.e. what is the algorithm? i.e. how does SNN create clusters/communities?

Give and explain the steps in SNN with a small example.

What is threshold τ in the SNN algorithm? Is it numeric? What can be the maximum and the minimum values?

For the following Undirected and unweighted Graphs, show the steps and the final clusters.
Edges: {0, 1} {0, 2} {0, 3} {1, 2}, {1, 3} {2, 3} {2, 4} {3, 4}
You can draw the graph first. You can use τ = 2.

What is Node Similarity or Node Proximity between two nodes in SNN?

What is the output of SNN? i.e. on an undirected and unweighted Graph?

Will the output will have weights on the edges?

Give the pseudocode for SNN algorithm

Implement the SNN algorithm in Python or R or Matlab - whichever you prefer.

Can you apply SNN on weighted Graphs?

If you can apply SNN on weighted graphs, then how will you apply the Threshold, say theta?

What is the first step in applying theta - if that is doable?

What is the difference between tau (τ) and theta.

What is k in SNN algorithms (if Weighted)?

How do we know which tau(τ) or ”k” to choose?

What are some evaluation metrics for tau or k?

what is Conductance?

Conductance whose property is this? The input graph, the output graph or the possible communities (from where you select the final communities)

Is low value or high value of Conductance - that is used for the communities?

When comparing two k or tau values - which k or tau that you accept? Think in terms of Conductance.

So, what was the purpose behind choosing these K or Tau values? What do we want to achieve ultimately? Is it to achieve Stronger communities by checking all possible communities and then measuring the community strength using Conductance? If it is - do all the questions on Weighted Graphs make sense? Why, Why not? How?

Write a Pseudo-code for SNN on Undirected Weighted Graphs. Find the optimal value for K and tau.

Implement SNN on Undirected Weighted Graphs. Print the optimal value for K and tau.

Why are we detecting communities and clusters anyway? What are the practical applications of such algorithms?
Hint: Check notes on Introduction to Clustering/Community detection

Some Answers:
What is one another name of the algorithm: Shared Nearest Neighbors?
Ans: Jarvis-Patrick algorithm

Can you name other Algorithms that serve the same or similar purpose?
Ans: Maximal-Clique Enumeration, K- Spanning Tree, Shared Nearest Neighbor, Highly Connected Components, MinCut, Betweenness Based Algorithms, Louvain Modularity, CNM Algorithm

What is the Criteria that SNN uses?
Ans: Two nodes are similar if they share a lot of neighbors

Is SNN Hierarchical?
Ans: No

What is Node Similarity or Node Proximity between two nodes in SNN?
Ans: Count of the shared number of nodes/neighbors

What is k in SNN algorithms (if Weighted)?
retain its k neighbors

Resources:

Jarvis-Patrick Clustering
https://btluke.com/jpclust.html

Empirical Comparison of Algorithms for Network Community Detection
https://cs.stanford.edu/~jure/pubs/communities-www10.pdf

By

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Not that this site is doing anything great work; However, if you want to contribute to the operation of this site (or charitable/non-profit work in education sector), you can donate to: safoundation using Paypal.

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz

Graph Mining: Community Detection: Learn by finding answers to the following questions. Can you answer the following questions on Community Detection?

Graph Mining: Community Detection: Learn by finding answers to the following questions. Can you answer the following questions on Community Detection?

What is a community anyway? Describe from your real-world/social understanding of it?

Can you relate it to the Graphs concept in Computer Science?

In your daily life do you interact with a community in a real world? or in your communication with others using technologies?

What is Graph as defined in Computer Science Books/Practice? Can you formally represent a Graph? What are the components of a graph?

Is a whole Graph a Community? or Graphs can be subsets (Subgraph) as well?

What is a Sub-graph anyway?

Can you identify communities in real-world networks? i.e. real world networks that can be represented as graphs?

Give examples of real-world Graphs as well.

Can you detect a community in the tissues or the organs in the human body? Can you represent these as Graphs?

Can you detect a community in Friends on social media with similar interests? i.e. is it a community? Can you represent these/this as Graphs?

Can you detect/define a community for the Neighborhoods where a hurricane will likely pass? i.e. is it a community?

Is it a community: People that are likely to be affected by a contagious disease?

Will the properties of a community (subgraph) be the same like the whole graph (i.e. average properties of the whole graph)? or will it differ? When? How? Why? Examples?

How does the formation of communities affect? i.e. How does the existence or not of communities affect? You can provide generic answers or just explain with examples as well.

Do you think that you are a part/node of communities in online social networks such as Facebook, twitter? If so, how and when you are affected? If so, how and when you can affect others?

Can communities affect rumor spreading or disease spreading or epidemic spreading? Can you also identify how a rumor/disease will spread i.e. from the source i.e. which path will it take, how will it flow, who will get affected?

Is it that the spreading will only affect the nodes/parts of the communities? Do you see any limitations of this thought?

Who else can get affected? Can you predict that?

What is Link prediction? What are Link Prediction Algorithms?

Is it a possibility that current communities can be fine tuned? i.e. remove links/nodes and add new links/nodes? Why, why not? How? Is it the right thing to do? why, why not?

What is the difference between clustering and community detection?

What is thought to be of more real world entitities/geared? Community Detection or Clustering?

What is thought to be more geared towards structural Properties such as Min-Cut?

What is Min-Cut anyway?

For modeling, understanding and analysis purpose - can we consider them the same/similar? Community Detection or Clustering?
Ans: Yes

What is Cluster Analysis i.e. Community Detection process? i.e. what will be your process to identify clusters/communities?

Provide the names of some Community Detection Algorithms. Can you explain them as well? Can you formally define/represent and/or show pictorially what they do and how they work (steps as well)? Check the answer section, after you give it a try. You must have studied them in Computer Networks or related courses/concepts (Data Communication, CCNA, CCNP, Vehicle Routing)

How can you evaluate that the detected communities/clusters are good/great communities/clusters?

Answers:
What is a community?
a subset of nodes within the graph such that connections between the nodes (in the subset) are denser than connections with the rest of the network (i.e. from the subset to the outer nodes)

What is Cluster Analysis i.e. Community Detection process? i.e. what will be your process to identify clusters/communities?
"The process of dividing nodes of a graph into possibly overlapping, subsets, where nodes in each subset are considered related by some similarity measure"

Provide the names of some Community Detection Algorithms
Maximal-Clique Enumeration, K- Spanning Tree, Shared Nearest Neighbor, Highly Connected Components, MinCut, Betweenness Based Algorithms, Louvain Modularity, CNM Algorithm

Resources:
Defining and identifying communities in networks
https://www.pnas.org/content/101/9/2658

Community Structure:
https://en.wikipedia.org/wiki/Community_structure

Graph Clustering:
https://www.csc2.ncsu.edu/faculty/nfsamato/practical-graph-mining-with-R/slides/pdf/Graph_Cluster_Analysis.pdf

By

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz

Graph Mining: SMALL WORLD GRAPHS & RANDOM GRAPH GENERATORS: Learn by finding answers to the following questions.

Graph Mining - 002: SMALL WORLD GRAPHS & RANDOM GRAPH GENERATORS

Graph Mining: SMALL WORLD GRAPHS & RANDOM GRAPH GENERATORS: Learn by finding answers to the following questions.

Can you answer the following?

What is a Graph?

Can you define graph formally?

What are the components in a Graph?

What are edges and nodes or vertices in a Graph?

Can you provide examples of Graphs? in Areas such as Science, Engineering, Medical, Biology, Social Sciences, Internet Applications, Social Media?

Can you draw a simple graph?

What is a Random Graph?

What do you use to answer questions about the properties of typical graphs?

What is the purpose of a Random Graph?

What types of graphs you use to answer questions about the properties of typical graphs?

How can you generate a Random Graph?

What is known as Small World Properties for Graphs?

Give some examples of Real Graphs

What is the connection between Real Graphs and Small World Properties?

What are some properties of Small World Networks?

Can you give examples of Non Small World Networks?

For small World Graphs - what will be the average path lengths (count of vertices) - a large number of a small number?

Are small world graphs sparse or dense?

What is power law distribution? How does that related to Small World Graphs?

How can you relate Power Law Distribution to Github network?

What is power law in statistics?

Can you name some Random Graph Generators? What is the purpose? What will they do actually in general? Why will you use these generators.

When edges are created between vertices with probability p : what is this mode of generation called? Is it a Random Graph? Why?

What happens when p = probability is closer to 0? what when p is closer to 1?

What is weight in a random graph Generator for example in the above case?

When a Graph is generated using the probability approach - what is the order of the maximum size graph?

Can you represent the graph generated above as G(n, p)? n = number of vertices, p = probability

For the above case, what happens when np < 1 ?

For the above case, what happens when np = 1 ? Can np be > 1?

When a Graph G(n, p) is generated using the probability approach as mentioned above - and when np < 1, what is the order of the size of a connected component? is it O(n), or o(n), O(log(n)), O(Olog(n)). Check answer section

What is Erdos Renyi? Is it the name of a Random Graph Generator or the name of a small world property or a statistical law?

For Erdos Renyi, is the average path length short?

For Erdos Renyi, do they exhibit local clustering?

Is local clustering a property of small world graphs?

For Erdos Renyi, does it show Skewed degree distribution i.e. skewed power law distribution?

For Erdos Renyi, does it show poission distribution?

What is poission distribution? anyway?

Can you relate poission distribution to Banks or a Diagnostic Center?

What is Watts–Strogatz model for Random Graph Generation? How does it work?

Does Watts–Strogatz model create a directed graph or an undirected graph?

For Watts–Strogatz model, if the desired Node count is N, then what will be the output edge count? You can introduce and define a new parameter if required.

For Watts–Strogatz, is the average path length short?

For Watts–Strogatz, do they exhibit local clustering?

Is local clustering a propery of small world graphs?

For Watts–Strogatz, does it show Skewed degree distribution i.e. skeweed power law distribution?

For Watts–Strogatz, does it show poission distribution?

how does Watts–Strogatz relate to Dirac Delta Function?

what is Barabási–Albert (BA) model? Is it the name of a Random Graph Generator or the name of a small world property or a statistical law?

How does Barabási–Albert (BA) model work? i.e. the steps i.e. the methods

For Barabási–Albert (BA) model, how is a new node added?

For Barabási–Albert (BA) model, does it start with an empty graph or with an already connected graph?

For Barabási–Albert (BA) model, how a new node is connected to an existing node? What does it depend upon? How does a probability play a role?

For Barabási–Albert (BA) model, which nodes will accumulate more links quickly?

For Barabási–Albert (BA) model, which nodes will starve for new links?

For Barabási–Albert (BA) model, is the average path length short?

For Barabási–Albert (BA) model, do they exhibit local clustering?

For Barabási–Albert (BA) model, does it show Skewed degree distribution i.e. skeweed power law distribution?

Answers:

How can you generate a Random Graph?
Start with n isolated vertices, Add successive edges between them at random

Give some examples of Real Graphs
Food webs, Electric power grids, Metabolite processing networks, Networks of brain neurons, Voter networks, Social influence networks, Co-occurrence networks, Networks of connected proteins

What are some properties of Small World Networks?
"Properties of small-world networks
Small-world networks tend to contain cliques, and near-cliques, meaning sub-networks which have connections between almost any two nodes within them. This follows from the defining property of a high clustering coefficient. Secondly, most pairs of nodes will be connected by at least one short path. This follows from the defining property that the mean-shortest path length be small. Several other properties are often associated with small-world networks. Typically there is an over-abundance of hubs – nodes in the network with a high number of connections (known as high degree nodes). These hubs serve as the common connections mediating the short path lengths between other edges. By analogy, the small-world network of airline flights has a small mean-path length"
Reference: https://en.wikipedia.org/wiki/Small-world_network#Examples_of_small-world_networks

Can you give examples of Non Small World Networks?
Ans: Check: Reference: https://en.wikipedia.org/wiki/Small-world_network#Examples_of_small-world_networks

What is power law in statistics?
"In statistics, a power law is a functional relationship between two quantities, where a relative change in one quantity results in a proportional relative change in the other quantity, independent of the initial size of those quantities: "
https://en.wikipedia.org/wiki/Power_law#Power-law_probability_distributions

Can you name some Random Graph Generators?
Erdős–Rényi model, Watts–Strogatz model, Barabási–Albert (BA) model

When a Graph is generated using the probability approach - what is the order of the maximum size graph? Check answer
n to the power 2/3

For Watts–Strogatz model, if the desired Node count is N, then what will be the output edge count? Ans: NK/2

Watts–Strogatz model
K = mean degree = an even number
B = a special parameter
N >> K >> ln(K) >> 1

Check degree distribution for Watts–Strogatz model at: https://en.wikipedia.org/wiki/Watts%E2%80%93Strogatz_model
"The degree distribution in the case of the ring lattice is just a Dirac delta function centered at {\displaystyle K}K. The degree distribution for {\displaystyle 0<\beta <1}0<\beta <1 can be written as:

Resources:
Graph mining - lesson 1: Introduction to graphs and networks
http://www.nathalievialaneix.eu/teaching/m2se/M2SE-network_1.pdf

Erdős–Rényi model
https://en.wikipedia.org/wiki/Erd%C5%91s%E2%80%93R%C3%A9nyi_model

Random Graphs
https://en.wikipedia.org/wiki/Random_graph

Examples of small-world networks
https://en.wikipedia.org/wiki/Small-world_network#Examples_of_small-world_networks

Case Study: Small World Phenomenon
https://introcs.cs.princeton.edu/java/45graph/

Random Graphs: Model of Social Networks
http://www.pnas.org/content/99/suppl_1/2566

Power Law Distribution
https://en.wikipedia.org/wiki/Power_law#Power-law_probability_distributions

Poisson distribution
https://en.wikipedia.org/wiki/Poisson_distribution

Watts–Strogatz model
https://en.wikipedia.org/wiki/Watts%E2%80%93Strogatz_model

Dirac delta function
Everywhere it is zero except at zero
https://en.wikipedia.org/wiki/Dirac_delta_function

Barabási–Albert (BA):
https://en.wikipedia.org/wiki/Barab%C3%A1si%E2%80%93Albert_model#Degree_distribution

Random Graph Generators in Python Libraries
https://networkx.github.io/documentation/networkx-1.10/reference/generators.html

Random Graph Generators in R Libraries
https://rpubs.com/lgadar/generate-graphs

By
Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz

Graph Mining: Link Prediction. Learn by finding answers to the following questions.

Graph Mining: Link Prediction. Learn by finding answers to the following questions.

What is Link Prediction in Graph Mining?

How can Link Prediction help? and where? can it help in social network, research co-authorship, or spread of disease

Give uses of Link Prediction in epidemiology or any medical/biology application?

Give uses of Link Prediction in Social Network?

Give uses of Link Prediction in co-authorship?

What are the steps in Link Prediction? i.e. define link prediction problem.

Define Link Prediction Problem formally i.e. with Notations i.e. with Graph notations i.e. Graph Representations.

How is node similarity related in Link Prediction?

What are possible Node Similarity algorithms that can be used for the purpose?

Which nodes are the most possible new links? think in terms of Node Similarity Index (Highest, Lowest)

What are the types of Node Similarity? Node proximity?

What is Node Similarity based upon? i.e. Network Topology

What is Local Structure for Node Similarity?

What is Global Structure for Node Similarity?

What are examples of Local Structure for Node Similarity?

What are examples of Global Structure for Node Similarity?

Define Node Neighborhoods. Is it local or global structure?

What is Preferential Attachment Index? Is it LocaL or global structure

What are the types of Node Neighborhoods?

If two nodes do not have any common node -- what is the probable similarity (Node Neigh. aspect)?

What is Common Neighbors ? Is it local or global structure? is it Node Neighbor or Preferential Attachment? and why?

What is Jaccard Coefficient ? Is it local or global structure? is it Node Neighbor or Preferential Attachment? and why?

What is Adamic-Adar? Is it local or global structure? is it Node Neighbor or Preferential Attachment? and why?

What is the name of the Node Similarity approach where Sum of the inverse logarithmic degree centrality of the neighbors shared by the two nodes are counted?

What is the name of the Node Similarity approach where the ratio of seize of set intersection to the set union is counted?

What is the name of the Node Similarity approach where the count of common nodes are used?

If node similarity score is counted as multiplication of the number/count of outgoing edges for a node pair. And then the higher results are assumed to create new links. What is this approach called?

When similarity scores are calculated based on global link structure of graph - what is this called local structure or global structure?

What is an example of Global Structure?

What are the examples of Global Structure?

What is Kartz Index for Global Structure?

What is Simrank for Global Structure?

When Node Similarity is calculated as: Sum of count of all paths between node pairs - what is this approach called? Then how is the link prediction made?

When Node Similarity is calculated as: two nodes are similar if they are referred by similar nodes. What is the name?

How can you measure if your implemented link prediction algorithm is great or not?

Can you use train and test concept for the measurement? Can you define the steps/problems formally? with Graph Notations?

What are some measures to calculate in the train/test approach?

Answers:
What is the name of the Node Similarity approach where Sum of the inverse logarithmic degree centrality of the neighbors shared by the two nodes are counted?
Ans: Adamic-Adar

What is the name of the Node Similarity approach where the ratio of seize of set intersection to the set union is counted?
Ans: Jaccard Coefficient

What is the name of the Node Similarity approach where the count of common nodes are used?
Ans: Common Neighbors

If node similarity score is counted as multiplication of the number/count of outgoing edges for a node pair. And then the higher results are assumed to create new links. What is this approach called?
Ans: Preferential Attachment Index

What is an example of Global Structure?
Ans: Path Length > 2

What are the examples of Global Structure?
Shortest Paths – use inverse of distance as similarity, Kartz Index, SimRank

What are some measures to calculate in the train/test approach?
Accuracy, F1-score, Sensitivity, Most metrics that would work for classification

Resources
Link Prediction
https://paperswithcode.com/task/link-prediction

Similarity Index based Link. Prediction Algorithms in Social Networks: A Survey
https://pdfs.semanticscholar.org/8e72/fa77f3d788f3c67da1e1c6347c3aaf280723.pdf

Proximity-based Methods for Link Prediction
https://cran.r-project.org/web/packages/linkprediction/vignettes/proxfun.html

Evaluating Link Prediction Methods
https://arxiv.org/pdf/1505.04094.pdf

Link Prediction Algorithm
http://be.amazd.com/link-prediction/

Evaluating link prediction methods
https://www3.nd.edu/~dial/publications/yang2015evaluating.pdf

By

Sayed Ahmed

Linkedin: https://ca.linkedin.com/in/sayedjustetc

Blog: http://Bangla.SaLearningSchool.com, http://SitesTree.com
Online and Offline Training: http://Training.SitesTree.com

Affiliate Links:
Hottest Deals on Amazon USA: http://tiny.cc/38lddz

Hottest Deals on Amazon CA: http://tiny.cc/bgnddz

Hottest Deals on Amazon Europe: http://tiny.cc/w4nddz