Skip to content

ztor2/gnn_patent_link_prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

6 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๊ทธ๋ž˜ํ”„ ์‹ ๊ฒฝ๋ง์„ ์ด์šฉํ•œ ๊ตญ๋ฐฉ๊ณผํ•™๊ธฐ์ˆ  ์œตํ•ฉ ์˜ˆ์ธก ์–‘์ƒ(Prediction of Defense Science and Technology Convergence using Graph Neural Networks)


codes: ์‹คํ—˜์— ์‚ฌ์šฉ๋œ jupyter notebook ์ฝ”๋“œ ํŒŒ์ผ ๋ฐ ๋ชจ๋“ˆ ํŒŒ์ผ์„ ํฌํ•จํ•ฉ๋‹ˆ๋‹ค.
data: ์‹คํ—˜์— ์‚ฌ์šฉ๋œ ๋ฐ์ดํ„ฐ ํŒŒ์ผ์„ ํฌํ•จํ•ฉ๋‹ˆ๋‹ค.
results: ์‹คํ—˜ ๊ฒฐ๊ณผ๊ฐ’ ํŒŒ์ผ๋“ค์„ ํฌํ•จํ•ฉ๋‹ˆ๋‹ค.

<code ํด๋”>

  • construct_original_graph.ipynb: ์› ๋ฐ์ดํ„ฐ๋ฅผ ๋ชจ๋‘ ์‚ฌ์šฉํ•˜์—ฌ ๋ชจ๋“  ์ •๋ณด๋ฅผ ํฌํ•จํ•˜๋Š” IPC ๋„คํŠธ์›Œํฌ ๊ทธ๋ž˜ํ”„๋ฅผ ๊ตฌ์ถ•ํ•ฉ๋‹ˆ๋‹ค.
  • construct_reduced_graph.ipynb: ๋ชจ๋ธ๋ณ„๋กœ ๋งํฌ ์˜ˆ์ธก ๊ฒ€์ฆ์ด ๊ฐ€๋Šฅํ•˜๋„๋ก, 2019-2020๋…„ ๋ฐ์ดํ„ฐ์—๋Š” ์กด์žฌํ•˜๋‚˜ ~2018๋…„๊นŒ์ง€์˜ ๋ฐ์ดํ„ฐ์—๋Š” ๋‚˜ํƒ€๋‚˜์ง€ ์•Š๋Š” IPC๋ฅผ ์ œ๊ฑฐํ•˜๊ณ  IPC ๋„คํŠธ์›Œํฌ ๊ทธ๋ž˜ํ”„(reduced graph)๋ฅผ ๊ตฌ์ถ•ํ•ฉ๋‹ˆ๋‹ค(ํ›ˆ๋ จ์…‹์— ์กด์žฌํ•˜์ง€ ์•Š๋Š” ๋ฐ์ดํ„ฐ๋ฅผ ํ…Œ์ŠคํŠธ ๋ฐ์ดํ„ฐ์—์„œ ์˜ˆ์ธกํ•  ์ˆ˜ ์—†์œผ๋ฏ€๋กœ).
  • validation_baseline_sc&dw.ipynb: Reduced graph๋ฅผ ์ด์šฉํ•˜์—ฌ spectral clustering ๋ฐ DeepWalk ๋ชจ๋ธ์˜ ๋งํฌ ์˜ˆ์ธก ์„ฑ๋Šฅ์„ ๊ฒ€์ฆํ•ฉ๋‹ˆ๋‹ค.
  • validation_baseline_centrality_node_emb.ipynb: Reduced graph๋ฅผ ์ด์šฉํ•˜์—ฌ ์ค‘์‹ฌ์„ฑ ๊ธฐ๋ฐ˜ ๋งํฌ ์˜ˆ์ธก ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์„ ๊ฒ€์ฆํ•ฉ๋‹ˆ๋‹ค.
  • validation_baseline_topological_edge_score.ipynb: Reduced graph๋ฅผ ์ด์šฉํ•˜์—ฌ ๋„คํŠธ์›Œํฌ ์œ„์ƒ ์ง€ํ‘œ ๊ธฐ๋ฐ˜ ๋งํฌ ์˜ˆ์ธก ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์„ ๊ฒ€์ฆํ•ฉ๋‹ˆ๋‹ค.
  • validation_gae.ipynb: Reduced graph๋ฅผ ์ด์šฉํ•˜์—ฌ ๊ทธ๋ž˜ํ”„ ์˜คํ† ์ธ์ฝ”๋” ๋ชจ๋ธ์˜ ๋งํฌ ์˜ˆ์ธก ์„ฑ๋Šฅ์„ ๊ฒ€์ฆํ•ฉ๋‹ˆ๋‹ค.
  • prediction_gae.ipynb: Original graph๋ฅผ ๋ชจ๋‘ ํ›ˆ๋ จ์…‹์œผ๋กœ ํ™œ์šฉํ•˜์—ฌ ๋ฏธ๋ž˜์˜(2020๋…„ ์ดํ›„) ๋งํฌ๋ฅผ ์˜ˆ์ธกํ•ฉ๋‹ˆ๋‹ค.
  • results_analysis.ipynb: ๋งํฌ ์˜ˆ์ธก ๊ฒฐ๊ณผ๋ฅผ ์‹ค์ œ ์—ฃ์ง€๋กœ ์ถ”๊ฐ€ํ•˜๊ณ , ๊ธฐ์กด ๋„คํŠธ์›Œํฌ์™€์˜ ์–‘์ƒ ๋ณ€ํ™”๋ฅผ ์ค‘์‹ฌ์„ฑ์„ ์ค‘์‹ฌ์œผ๋กœ ๋ถ„์„ํ•ฉ๋‹ˆ๋‹ค.
  • gae ํด๋”: ๊ทธ๋ž˜ํ”„ ์˜คํ† ์ธ์ฝ”๋” ๋ชจ๋ธ ๊ตฌ์ถ•์— ํ•„์š”ํ•œ ๊ฐ์ข… ํ•จ์ˆ˜๋ฅผ ํฌํ•จํ•ฉ๋‹ˆ๋‹ค.
  • sc_dw ํด๋”: spectral clustering ๋ฐ DeepWalk ๋ชจ๋ธ ๊ตฌ์ถ•์— ํ•„์š”ํ•œ ๊ฐ์ข… ํ•จ์ˆ˜๋ฅผ ํฌํ•จํ•ฉ๋‹ˆ๋‹ค.

<data ํด๋”>

  • add_patent.xlsx: ํŠนํ—ˆ๋ณ„ IPC๋ฅผ ๋‚˜ํƒ€๋‚ธ raw ๋ฐ์ดํ„ฐ. (excel ํŒŒ์ผ)
  • idx2nodes.pkl / nodes2idx.pkl: ๋ฌธ์ž์—ด ํƒ€์ž…์ธ IPC๋ฅผ ์ผ๋ฐ˜ index์— ๋Œ€์‘์‹œํ‚ค๋Š” dictionary ํŒŒ์ผ. ๋„คํŠธ์›Œํฌ ๋ฐ์ดํ„ฐ๊ฐ€ ์˜คํ† ์ธ์ฝ”๋” ๋ชจ๋ธ์— ๋“ค์–ด๊ฐˆ ๋•Œ ๋ฌธ์ž์—ด ํƒ€์ž…์ธ ๋…ธ๋“œ๋ช…์ด ์†Œ์‹ค๋˜๋ฏ€๋กœ ์ถ”ํ›„ ๊ฒฐ๊ณผ ๋ถ„์„์— ํ•„์š”ํ•˜๋‹ค. (pickle ํƒ€์ž…)
  • original.graph: ์› ๋ฐ์ดํ„ฐ๋ฅผ ๋ชจ๋‘ ์‚ฌ์šฉํ•˜์—ฌ ๊ตฌ์ถ•ํ•œ IPC ๋„คํŠธ์›Œํฌ ๊ทธ๋ž˜ํ”„. (json ํƒ€์ž…)
  • reduced_train.graph: Reduced graph์˜ train ๊ทธ๋ž˜ํ”„. (json ํƒ€์ž…)
  • reduced_val.graph: Reduced graph์˜ validation ๊ทธ๋ž˜ํ”„. (json ํƒ€์ž…)
  • val_edges.pkl / val_non_edges.pkl: validation ๊ทธ๋ž˜ํ”„์˜ edge/non-edge๋ฅผ ๋‚˜ํƒ€๋‚ธ ๋ฆฌ์ŠคํŠธ. ๋ชจ๋ธ์— ๋“ค์–ด๊ฐ€๋Š” ๋ฐ์ดํ„ฐ ํ˜•ํƒœ๋ฅผ ๋งž์ถ”์–ด์ฃผ๊ธฐ ์œ„ํ•ด ํ•„์š”. (pickle ํƒ€์ž…)
  • val_edges_name.pkl / val_non_edges_name.pkl: val_edges.pkl / val_non_edges.pkl ๋ฅผ ์› ๋…ธ๋“œ๋ช…์ธ IPC๋กœ ๋‚˜ํƒ€๋‚ธ ๋ฆฌ์ŠคํŠธ. (์‹ค์ œ๋กœ๋Š” ์‹คํ—˜์— ์‚ฌ์šฉ๋˜์ง€ ์•Š์Œ) (pickle ํƒ€์ž…)

<results ํด๋”>

  • SC_results_ADD_patent.json: spectral clustering ๊ฒ€์ฆ ์‹คํ—˜ ๊ฒฐ๊ณผ. (๋…ผ๋ฌธ์—๋Š” ์ˆ˜๋ก๋˜์ง€ ์•Š์Œ)
  • DW_results_ADD_patent.json: DeepWalk ๊ฒ€์ฆ ์‹คํ—˜ ๊ฒฐ๊ณผ. (๋…ผ๋ฌธ์—๋Š” ์ˆ˜๋ก๋˜์ง€ ์•Š์Œ)
  • node_emb_results_ADD_patent.json: ์ค‘์‹ฌ์„ฑ ๊ธฐ๋ฐ˜ ๋งํฌ ์˜ˆ์ธก ๋ชจ๋ธ ๊ฒ€์ฆ ์‹คํ—˜ ๊ฒฐ๊ณผ.
  • topo_edge_score_results_ADD_patent.json: ์œ„์ƒ์  ํŠน์ง• ๊ธฐ๋ฐ˜ ๋งํฌ ์˜ˆ์ธก ๋ชจ๋ธ ๊ฒ€์ฆ ์‹คํ—˜ ๊ฒฐ๊ณผ.
  • GAE_results_ADD_patent.json: ๊ทธ๋ž˜ํ”„ ์˜คํ† ์ธ์ฝ”๋” ๋งํฌ ์˜ˆ์ธก ๋ชจ๋ธ ๊ฒ€์ฆ ์‹คํ—˜ ๊ฒฐ๊ณผ.

About

No description or website provided.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors