## 23.2 Graph Terminology

A graph is $$G = (V, E)$$ consisting of vertices/nodes and edges/links.

Graph links can be represented in a matrix with individual nodes labelling the rows and columns.

An attributed graph has features associated with each node which can be captured in a data frame.

Graphs may be used for social networks where the nodes represent users, the edges identify friendship or following, the node attributes identify the user features and the node class labels are user groups or tags. Publicly available datasets include flickr and blogcatalog.

Graphs may be used for citation networks where the nodes represent research papers, edges capture citation links, the node attributes are the keywords vectors generated from the research paper or its abstract, and the node class labels capture the area of research. Publicly available datasets include cora, citeseer, and pubmed.

Your donation will support ongoing development and give you access to the PDF version of this book. Desktop Survival Guides include Data Science, GNU/Linux, and MLHub. Books available on Amazon include Data Mining with Rattle and Essentials of Data Science. Popular open source software includes rattle, wajig, and mlhub. Hosted by Togaware, a pioneer of free and open source software since 1984.