Simulated Innovation Network
Nodes: 30
Edges: 57
Average degree: 3.8
Clustering coefficient: 0.1287554
Average path length: 2.475862
A Guide to Innovation Ecosystem Analysis through Network Science
Janpieter van der Pol
March 25, 2026
Connecting Knowledge, Code, and Innovation Ecosystems
An open educational resource for researchers, students, and practitioners who want to understand and analyse innovation ecosystems using network science.
NetworkIsLife is an open educational website created by Janpieter van der Pol that bridges the gap between network theory and applied innovation ecosystem analysis. It provides a self-contained learning environment combining conceptual foundations, hands-on R programming, and structured learning tracks β all aimed at equipping researchers, students, and policy analysts with the tools to study how innovation actually happens: through connections.
The central thesis of the site mirrors a growing consensus in innovation studies: innovation is not a solo act β it is a networked phenomenon. Understanding the structure, dynamics, and properties of these networks is essential for anyone working in science, technology, and innovation (STI) policy, regional development, entrepreneurship research, or applied data science.
Why Networks? Innovation ecosystems are defined by relationships β between firms, universities, investors, policymakers, and knowledge. Network analysis provides the quantitative toolkit to make those relationships visible, measurable, and comparable.
The site is organised into four main sections, each serving a distinct but complementary purpose:
| Section | Purpose |
|---|---|
| π΅ Network Theory | Conceptual and mathematical foundations of network analysis |
| π¦ R Package | Custom R tooling for innovation ecosystem data and analysis |
| π Data Science Track | Applied data science methods for STI/innovation research |
| π ESM Track | Innovation ecosystem mapping using structural methods |
The Network Theory section provides the conceptual backbone for everything else on the site. It introduces the mathematical and structural concepts that underpin all network-based analyses of innovation ecosystems.
Key topics typically covered in this strand include:
Innovation ecosystems are, at their core, networks of relationships. A universityβindustry collaboration graph, a patent co-citation network, or a co-authorship network can all be modelled and analysed using the same underlying machinery. Network theory provides the common language for doing so rigorously.
# Example: Computing basic network statistics with igraph
library(igraph)
# Create a simple innovation network
g <- graph_from_data_frame(
d = data.frame(
from = c("Uni_A", "Firm_B", "Firm_B", "Lab_C", "Uni_A"),
to = c("Firm_B", "Lab_C", "Startup_D", "Uni_A", "Lab_C")
),
directed = FALSE
)
# Key metrics
degree(g) # How connected is each actor?
betweenness(g) # Who bridges different parts of the ecosystem?
transitivity(g) # How clustered is the network?
mean_distance(g) # Average separation between actorsThe site features a dedicated R package that provides convenience functions and data structures tailored to innovation ecosystem analysis. Rather than requiring users to piece together multiple general-purpose packages, the package offers a coherent set of tools designed specifically for the kinds of data and questions that arise in STI research.
The package is likely to include functionality for:
Reproducibility First β Packaging analytical workflows means that others can verify, replicate, and build on your innovation ecosystem analyses. This is especially valuable in policy-relevant research.
NetworkIsLife offers two structured learning tracks. These allow learners to follow a curated pathway through the material rather than navigating it ad hoc.
DS Track
Students and researchers who want to develop applied data science competencies in the context of STI and innovation analysis. This track assumes some prior exposure to R or data analysis, and progressively builds toward network-based methods.
The Data Science Track typically progresses through stages such as:
library(tidyverse)
library(igraph)
library(ggraph)
# Step 1: Load co-patent data
patents <- read_csv("co_patents.csv")
# Step 2: Build network
collab_net <- patents |>
select(applicant_1, applicant_2, n_patents) |>
graph_from_data_frame(directed = FALSE)
E(collab_net)$weight <- patents$n_patents
# Step 3: Visualise
ggraph(collab_net, layout = "fr") +
geom_edge_link(aes(width = weight), alpha = 0.4, colour = "#0f3460") +
geom_node_point(aes(size = degree(collab_net)), colour = "#e94560") +
geom_node_text(aes(label = name), repel = TRUE, size = 3) +
theme_graph() +
labs(title = "Patent Collaboration Network",
subtitle = "Node size = degree centrality; Edge width = joint patents")ESM Track
The Entrepreneurship & Systems Mapping (ESM) Track is oriented toward researchers and practitioners who want to map and analyse innovation ecosystems in a more applied, policy-facing context. It is likely suited to students of innovation management, economic geography, or science & technology studies.
This track moves from abstract network methods to concrete ecosystem analysis tasks:
# Classify actors by Quadruple Helix category
actors <- tibble(
name = c("TU Delft", "Philips", "NWO", "Rotterdam City", "StartupDelta"),
category = c("Academia", "Industry", "Government", "Government", "Civil Society")
)
# Merge into network for visualisation
V(ecosystem_net)$helix <- actors$category[match(V(ecosystem_net)$name, actors$name)]
ggraph(ecosystem_net, layout = "stress") +
geom_edge_link(alpha = 0.2) +
geom_node_point(aes(colour = helix), size = 5) +
scale_colour_manual(values = c(
"Academia" = "#2196F3",
"Industry" = "#FF5722",
"Government" = "#4CAF50",
"Civil Society" = "#9C27B0"
)) +
theme_graph() +
labs(title = "Innovation Ecosystem β Quadruple Helix View", colour = "Actor Type")Below is a concise reference of the core ideas that run throughout the site.
An innovation ecosystem is a community of interdependent actors β firms, universities, investors, public bodies, and intermediaries β whose interactions generate, diffuse, and apply new knowledge. Unlike linear models of innovation, the ecosystem metaphor emphasises co-evolution, interdependence, and emergence.
Centrality measures capture how important or influential a node is within the network. Different centrality metrics reflect different types of importance:
| Metric | What It Captures | Innovation Context |
|---|---|---|
| Degree | Number of direct connections | Most active collaborators |
| Betweenness | Bridging between groups | Knowledge brokers |
| Closeness | Average reach to all others | Information diffusion speed |
| Eigenvector | Connection to well-connected actors | Prestige/influence |
Innovation networks tend to cluster into communities β dense subgraphs with relatively sparse ties to the rest of the network. These often correspond to technological domains, geographic clusters, or research fields. Detecting them algorithmically (e.g.Β via the Louvain or Leiden algorithm) reveals the meso-structure of the ecosystem.
Actors that bridge otherwise disconnected parts of the ecosystem occupy structural holes (Burt, 1992). These brokers have a strategic advantage: they control information flows between groups and can recombine knowledge in novel ways β a classic mechanism of innovation.
Simulated Innovation Network
Nodes: 30
Edges: 57
Average degree: 3.8
Clustering coefficient: 0.1287554
Average path length: 2.475862
The site is valuable for several reasons:
For students β It provides a structured, code-first entry point into a methodologically demanding field. Rather than being overwhelmed by raw academic literature, learners can build up competency through guided exercises and reproducible examples.
For researchers β The R package and documented workflows reduce the time-to-analysis for common innovation data science tasks. Methods that might otherwise take weeks to implement from scratch become accessible in hours.
For practitioners and policymakers β The ESM Track in particular connects analytical outputs to real-world ecosystem mapping tasks, making it easier to translate network evidence into actionable insights about where to invest, which actors to connect, and how ecosystems are changing.
For the field β By making methods open and reproducible, NetworkIsLife contributes to cumulative, comparable science in the innovation studies tradition.
If you are new to the site, the following paths are suggested:
Start with Network Theory to build your conceptual foundations, then move to the Data Science Track from the beginning. Focus on getting comfortable with igraph and tidygraph in R before tackling metrics.
Jump into the Data Science Track at the network analysis modules. Consult the Network Theory pages as a reference when new concepts arise. Explore the R Package documentation in parallel.
Head directly to the ESM Track for ecosystem mapping workflows. Reference the Network Theory page for methodological grounding when needed.
The following packages form the core computational stack used across the site:
# Core network packages
install.packages(c(
"igraph", # The workhorse of network analysis in R
"tidygraph", # Tidy data principles for network objects
"ggraph", # Grammar of graphics for network visualisation
"visNetwork" # Interactive network visualisation
))
# Data science support
install.packages(c(
"tidyverse", # Data wrangling and visualisation
"bibliometrix", # Bibliometric and scientometric analysis
"ggplot2", # General-purpose visualisation
"janitor" # Data cleaning
))NetworkIsLife fills a genuine gap in the open educational landscape. Innovation ecosystem analysis sits at the intersection of network science, data science, and innovation studies β three fields that each have rich literatures but are rarely brought together in a practical, code-first format.
The site achieves this through four complementary components: theoretical grounding in Network Theory, reusable tools via the R Package, skill-building through the Data Science Track, and applied ecosystem methodology in the ESM Track. Together, these make network-based innovation analysis accessible to a much wider audience than the specialist academic literature alone can reach.
Whether you are a PhD student mapping a regional innovation ecosystem, a policy analyst tracking how knowledge flows between universities and firms, or a data scientist building reproducible STI indicators, NetworkIsLife offers a coherent set of tools and concepts to support that work.
*This document was generated with Quarto.
---
title: "NetworkIsLife"
subtitle: "A Guide to Innovation Ecosystem Analysis through Network Science"
author: "Janpieter van der Pol"
date: today
format:
html:
theme: cosmo
toc: true
toc-depth: 3
toc-title: "Contents"
number-sections: true
smooth-scroll: true
code-fold: true
code-tools: true
highlight-style: github
fig-align: center
embed-resources: true
css: |
body { max-width: 960px; margin: auto; }
.hero { background: linear-gradient(135deg, #1a1a2e 0%, #16213e 50%, #0f3460 100%);
color: white; padding: 3rem 2rem; border-radius: 12px; margin-bottom: 2rem; text-align: center; }
.hero h2 { color: #e94560; font-size: 2rem; }
.hero p { font-size: 1.1rem; opacity: 0.9; }
.card { border: 1px solid #dee2e6; border-radius: 8px; padding: 1.5rem;
margin-bottom: 1rem; box-shadow: 0 2px 8px rgba(0,0,0,0.07); }
.card h4 { color: #0f3460; margin-top: 0; }
.track-badge { display: inline-block; background: #e94560; color: white;
padding: 0.2rem 0.7rem; border-radius: 20px; font-size: 0.85rem;
font-weight: bold; margin-right: 0.5rem; }
.callout-note { border-left: 4px solid #0f3460; }
bibliography: Biblio5G.bib
---

::: {.hero}
# π NetworkIsLife
**Connecting Knowledge, Code, and Innovation Ecosystems**
*An open educational resource for researchers, students, and practitioners who want to understand and analyse innovation ecosystems using network science.*
[Visit the Website](https://jpvdp.github.io/NetworkIsLife/){.btn .btn-primary target="_blank"}
:::
---
# Overview
**NetworkIsLife** is an open educational website created by **Janpieter van der Pol** that bridges the gap between network theory and applied innovation ecosystem analysis. It provides a self-contained learning environment combining conceptual foundations, hands-on R programming, and structured learning tracks β all aimed at equipping researchers, students, and policy analysts with the tools to study how innovation actually happens: through connections.
The central thesis of the site mirrors a growing consensus in innovation studies: *innovation is not a solo act β it is a networked phenomenon*. Understanding the structure, dynamics, and properties of these networks is essential for anyone working in science, technology, and innovation (STI) policy, regional development, entrepreneurship research, or applied data science.
::: {.callout-note}
**Why Networks?** Innovation ecosystems are defined by relationships β between firms, universities, investors, policymakers, and knowledge. Network analysis provides the quantitative toolkit to make those relationships visible, measurable, and comparable.
:::
---
# Website Structure
The site is organised into four main sections, each serving a distinct but complementary purpose:
| Section | Purpose |
|---|---|
| π΅ **Network Theory** | Conceptual and mathematical foundations of network analysis |
| π¦ **R Package** | Custom R tooling for innovation ecosystem data and analysis |
| π **Data Science Track** | Applied data science methods for STI/innovation research |
| π **ESM Track** | Innovation ecosystem mapping using structural methods |
---
# Network Theory
## What Is Covered
The **Network Theory** section provides the conceptual backbone for everything else on the site. It introduces the mathematical and structural concepts that underpin all network-based analyses of innovation ecosystems.
Key topics typically covered in this strand include:
- **Graph fundamentals** β nodes, edges, directed vs. undirected networks, weighted ties
- **Network metrics** β degree, betweenness centrality, clustering coefficient, path length
- **Community structure** β detecting clusters and sub-groups within larger networks
- **Scale-free and small-world properties** β why innovation networks tend to have hubs
- **Bipartite and multilayer networks** β modelling heterogeneous ecosystems (e.g. firms *and* technologies)
- **Network dynamics** β how innovation networks evolve over time
## Why It Matters for Innovation
Innovation ecosystems are, at their core, networks of relationships. A universityβindustry collaboration graph, a patent co-citation network, or a co-authorship network can all be modelled and analysed using the same underlying machinery. Network theory provides the common language for doing so rigorously.
```r
# Example: Computing basic network statistics with igraph
library(igraph)
# Create a simple innovation network
g <- graph_from_data_frame(
d = data.frame(
from = c("Uni_A", "Firm_B", "Firm_B", "Lab_C", "Uni_A"),
to = c("Firm_B", "Lab_C", "Startup_D", "Uni_A", "Lab_C")
),
directed = FALSE
)
# Key metrics
degree(g) # How connected is each actor?
betweenness(g) # Who bridges different parts of the ecosystem?
transitivity(g) # How clustered is the network?
mean_distance(g) # Average separation between actors
```
---
# R Package
## Purpose
The site features a dedicated **R package** that provides convenience functions and data structures tailored to innovation ecosystem analysis. Rather than requiring users to piece together multiple general-purpose packages, the package offers a coherent set of tools designed specifically for the kinds of data and questions that arise in STI research.
## What It Offers
The package is likely to include functionality for:
- **Data ingestion** β loading and cleaning common innovation data sources (patents, publications, funding databases)
- **Network construction** β converting relational data into graph objects ready for analysis
- **Ecosystem metrics** β pre-built functions for computing innovation-relevant network statistics
- **Visualisation** β ggraph/igraph-based plotting helpers for publication-quality network figures
- **Reproducible workflows** β wrapper functions that make multi-step analyses scriptable and shareable
## Installation
```r
# Install from GitHub
# install.packages("remotes")
remotes::install_github("jpvdp/NetworkIsLife")
library(NetworkIsLife)
```
::: {.callout-tip}
**Reproducibility First** β Packaging analytical workflows means that others can verify, replicate, and build on your innovation ecosystem analyses. This is especially valuable in policy-relevant research.
:::
---
# Learning Tracks
NetworkIsLife offers two structured learning tracks. These allow learners to follow a curated pathway through the material rather than navigating it ad hoc.
---
## Data Science Track {.unnumbered}
::: {.track-badge}
DS Track
:::
### Audience
Students and researchers who want to develop **applied data science competencies** in the context of STI and innovation analysis. This track assumes some prior exposure to R or data analysis, and progressively builds toward network-based methods.
### Learning Arc
The Data Science Track typically progresses through stages such as:
1. **Data Foundations** β working with STI datasets (OECD, Eurostat, patent offices, Web of Science)
2. **Exploratory Analysis** β summary statistics, distributions, and visualisation of innovation indicators
3. **Relational Data** β moving from tabular data to network representations
4. **Network Analysis** β applying graph metrics to innovation data
5. **Interpretation** β linking quantitative results back to innovation theory and policy
### Sample Workflow
```r
library(tidyverse)
library(igraph)
library(ggraph)
# Step 1: Load co-patent data
patents <- read_csv("co_patents.csv")
# Step 2: Build network
collab_net <- patents |>
select(applicant_1, applicant_2, n_patents) |>
graph_from_data_frame(directed = FALSE)
E(collab_net)$weight <- patents$n_patents
# Step 3: Visualise
ggraph(collab_net, layout = "fr") +
geom_edge_link(aes(width = weight), alpha = 0.4, colour = "#0f3460") +
geom_node_point(aes(size = degree(collab_net)), colour = "#e94560") +
geom_node_text(aes(label = name), repel = TRUE, size = 3) +
theme_graph() +
labs(title = "Patent Collaboration Network",
subtitle = "Node size = degree centrality; Edge width = joint patents")
```
---
## ESM Track {.unnumbered}
::: {.track-badge}
ESM Track
:::
### Audience
The **Entrepreneurship & Systems Mapping (ESM)** Track is oriented toward researchers and practitioners who want to **map and analyse innovation ecosystems** in a more applied, policy-facing context. It is likely suited to students of innovation management, economic geography, or science & technology studies.
### Focus Areas
This track moves from abstract network methods to concrete ecosystem analysis tasks:
- **Ecosystem boundary definition** β who belongs to the ecosystem, and why?
- **Stakeholder mapping** β identifying and categorising actors (Triple/Quadruple Helix)
- **Structural analysis** β core vs. periphery, broker organisations, isolated actors
- **Knowledge flow mapping** β tracing how ideas move through the ecosystem
- **Longitudinal analysis** β how ecosystems grow, contract, or restructure over time
- **Policy implications** β using network evidence to inform intervention strategies
### Ecosystem Mapping Example
```r
# Classify actors by Quadruple Helix category
actors <- tibble(
name = c("TU Delft", "Philips", "NWO", "Rotterdam City", "StartupDelta"),
category = c("Academia", "Industry", "Government", "Government", "Civil Society")
)
# Merge into network for visualisation
V(ecosystem_net)$helix <- actors$category[match(V(ecosystem_net)$name, actors$name)]
ggraph(ecosystem_net, layout = "stress") +
geom_edge_link(alpha = 0.2) +
geom_node_point(aes(colour = helix), size = 5) +
scale_colour_manual(values = c(
"Academia" = "#2196F3",
"Industry" = "#FF5722",
"Government" = "#4CAF50",
"Civil Society" = "#9C27B0"
)) +
theme_graph() +
labs(title = "Innovation Ecosystem β Quadruple Helix View", colour = "Actor Type")
```
---
# Key Concepts
Below is a concise reference of the core ideas that run throughout the site.
## Innovation Ecosystems
An **innovation ecosystem** is a community of interdependent actors β firms, universities, investors, public bodies, and intermediaries β whose interactions generate, diffuse, and apply new knowledge. Unlike linear models of innovation, the ecosystem metaphor emphasises co-evolution, interdependence, and emergence.
## Network Centrality
**Centrality** measures capture how important or influential a node is within the network. Different centrality metrics reflect different types of importance:
| Metric | What It Captures | Innovation Context |
|---|---|---|
| Degree | Number of direct connections | Most active collaborators |
| Betweenness | Bridging between groups | Knowledge brokers |
| Closeness | Average reach to all others | Information diffusion speed |
| Eigenvector | Connection to well-connected actors | Prestige/influence |
## Community Detection
Innovation networks tend to cluster into **communities** β dense subgraphs with relatively sparse ties to the rest of the network. These often correspond to technological domains, geographic clusters, or research fields. Detecting them algorithmically (e.g. via the Louvain or Leiden algorithm) reveals the meso-structure of the ecosystem.
```r
# Detect communities in an innovation network
library(igraph)
communities <- cluster_louvain(g)
V(g)$community <- membership(communities)
cat("Number of communities detected:", max(membership(communities)), "\n")
cat("Modularity:", modularity(communities), "\n")
```
## Structural Holes & Brokerage
Actors that bridge otherwise disconnected parts of the ecosystem occupy **structural holes** (Burt, 1992). These brokers have a strategic advantage: they control information flows between groups and can recombine knowledge in novel ways β a classic mechanism of innovation.
---
# Why This Resource Matters
```{r}
#| echo: false
#| warning: false
#| message: false
# This chunk illustrates how a simple network can reveal ecosystem structure
# Run this in R to reproduce the figure
library(igraph)
set.seed(42)
# Simulate a small innovation ecosystem
g_sim <- sample_pa(30, m = 2, directed = FALSE)
V(g_sim)$type <- sample(c("Firm", "University", "Government", "Intermediary"),
30, replace = TRUE, prob = c(0.5, 0.25, 0.15, 0.1))
cat("Simulated Innovation Network\n")
cat("Nodes:", vcount(g_sim), "\n")
cat("Edges:", ecount(g_sim), "\n")
cat("Average degree:", mean(degree(g_sim)), "\n")
cat("Clustering coefficient:", transitivity(g_sim), "\n")
cat("Average path length:", mean_distance(g_sim), "\n")
```
The site is valuable for several reasons:
**For students** β It provides a structured, code-first entry point into a methodologically demanding field. Rather than being overwhelmed by raw academic literature, learners can build up competency through guided exercises and reproducible examples.
**For researchers** β The R package and documented workflows reduce the time-to-analysis for common innovation data science tasks. Methods that might otherwise take weeks to implement from scratch become accessible in hours.
**For practitioners and policymakers** β The ESM Track in particular connects analytical outputs to real-world ecosystem mapping tasks, making it easier to translate network evidence into actionable insights about where to invest, which actors to connect, and how ecosystems are changing.
**For the field** β By making methods open and reproducible, NetworkIsLife contributes to cumulative, comparable science in the innovation studies tradition.
---
# Getting Started
## Recommended Entry Points
If you are new to the site, the following paths are suggested:
::: {.card}
#### π° Beginner β No Network Background
Start with **Network Theory** to build your conceptual foundations, then move to the **Data Science Track** from the beginning. Focus on getting comfortable with `igraph` and `tidygraph` in R before tackling metrics.
:::
::: {.card}
#### π Intermediate β Some R/Data Science Experience
Jump into the **Data Science Track** at the network analysis modules. Consult the **Network Theory** pages as a reference when new concepts arise. Explore the **R Package** documentation in parallel.
:::
::: {.card}
#### πΊοΈ Applied β Policy or Ecosystem Practitioner
Head directly to the **ESM Track** for ecosystem mapping workflows. Reference the **Network Theory** page for methodological grounding when needed.
:::
## Essential R Packages
The following packages form the core computational stack used across the site:
```r
# Core network packages
install.packages(c(
"igraph", # The workhorse of network analysis in R
"tidygraph", # Tidy data principles for network objects
"ggraph", # Grammar of graphics for network visualisation
"visNetwork" # Interactive network visualisation
))
# Data science support
install.packages(c(
"tidyverse", # Data wrangling and visualisation
"bibliometrix", # Bibliometric and scientometric analysis
"ggplot2", # General-purpose visualisation
"janitor" # Data cleaning
))
```
---
# Summary
NetworkIsLife fills a genuine gap in the open educational landscape. Innovation ecosystem analysis sits at the intersection of network science, data science, and innovation studies β three fields that each have rich literatures but are rarely brought together in a practical, code-first format.
The site achieves this through four complementary components: theoretical grounding in **Network Theory**, reusable tools via the **R Package**, skill-building through the **Data Science Track**, and applied ecosystem methodology in the **ESM Track**. Together, these make network-based innovation analysis accessible to a much wider audience than the specialist academic literature alone can reach.
Whether you are a PhD student mapping a regional innovation ecosystem, a policy analyst tracking how knowledge flows between universities and firms, or a data scientist building reproducible STI indicators, NetworkIsLife offers a coherent set of tools and concepts to support that work.
---
*This document was generated with [Quarto](https://quarto.org).