CS224W: Machine Learning with Graphs - 01 Introduction

本文详细探讨了传统图论方法、节点嵌入技术、GNN模型(如GCN、GraphSAGE和GAT)、知识图谱推理(TransE和BetaE)以及在生物医学、科学和工业中的应用。涵盖了节点分类、链接预测、图分类、社区检测和图生成等经典任务,以及网络组件、图的定向性和度量,以及不同类型的图和连接性概念。

Introduction

1. Course Outline

1). Traditional methods: Graphlets: Graph kernels
2). Methods for node embeddings: DeepWalk, Node2Vec
3). Graph Neural Networks: GCN, GraphSAGE, GAT, Theory of GNNs
4). Knowledge graphs and reasoning: TransE, BetaE
5). Deep generative models for graphs
6). Applications to Biomedicine, Science, Industry

2. Different Types of Tasks

  • Node level
  • Edge level
  • Community (subgraph) level
  • Graph-level prediction, Graph generation

3. Classic Graph ML Tasks

1). Node classification: predict a property of a node
  • Example: Categorize online users/items
2). Link prediction: predict whether there are missing links
  • Example: Knowledge graph completion
3). Graph classification: categorize different graphs
  • Example: Molecule property prediction
4). Clustering: detect if nodes form a community
  • Example: Social circle detection
5). Graph generation: drug discovery
6). Physical simulation

3. Components of a Network

  • Objects: nodes, vertices NNN
  • Interactions: links, edges EEE
  • Systems: network, graph G(N,E)G(N,E)G(N,E)

4. Directed vs Undirected Graphs

  • Undirected
    Links: undirected (symmetrical, reciprocal)
    Examples: Collaborations, Friendship on Facebook
  • Directed
    Links: directed (arcs)
    Examples: Phone calls, Following on Twitter

5. Node Degrees

  • Undirected
    Node degree kik_iki: the number of edges adjacent to node iii
    Average degree: kˉ=2EN\bar k=\frac{2E}{N}kˉ=N2E
  • Directed
    The total degree of a node is the sum of in-degrees and out-degrees.
    kˉ=EN\bar k=\frac{E}{N}kˉ=NE kˉin=kˉout\bar k^{in}=\bar k^{out}kˉin=kˉout

6. Representing Graphs

  • Adjacency Matrix
    Aij=1A_{ij}=1Aij=1 if there is a link from node iii to node jjj
    Aij=0A_{ij}=0Aij=0 otherwise
    The adjacency matrix of an undirected graph is symmtrical (Aij=AjiA_{ij}=A_{ji}Aij=Aji) but that of a directed graph may not (Aij≠AjiA_{ij}\neq A_{ji}Aij=Aji).
    Adjacency Matrices are sparse (filled with zeros).
  • Edge List
    (Ni,Nj)(N_i, N_j)(Ni,Nj): an edge from node iii to node jjj
  • Adjacency List
    Easier to work with if network is large and sparse.
    Allow us to quickly retrieve all neighbors of a give node
    NiN_iNi: its neighors

7. Node and Edge Attributes

  • Weight (e.g., frequency of communication)
  • Ranking (best friend, second best friend…)
  • Type (friend, relative, co-worker)
  • Sign: Friend vs. Foe, Trust vs. Distrust
  • Properties depending on the structure of the rest of the graph: Number of common friends

8. More Types of Graphs

  • Unweighted graphs: Aij∈{0,1}A_{ij} \in \{0, 1\}Aij{0,1}; Weighted graphs: Aij∈RA_{ij} \in RAijR
  • Self-edges (self-loops): Aii≠0A_{ii}\neq 0Aii=0

9. Connectivity of Undirected Graphs

Connected graph: Any two vertices can be joined by a path
A disconnected graph is made up by two or more connected components

10. Connectivity of Directed Graphs

Strongly connected directed graph has a path from each node to every other node.
Weakly connected directed graph is connected if we disregard the edge directions.

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值