NetworkIsLife

A Guide to Innovation Ecosystem Analysis through Network Science

Author

Janpieter van der Pol

Published

March 25, 2026

1 🌐 NetworkIsLife

Connecting Knowledge, Code, and Innovation Ecosystems

An open educational resource for researchers, students, and practitioners who want to understand and analyse innovation ecosystems using network science.

Visit the Website


2 Overview

NetworkIsLife is an open educational website created by Janpieter van der Pol that bridges the gap between network theory and applied innovation ecosystem analysis. It provides a self-contained learning environment combining conceptual foundations, hands-on R programming, and structured learning tracks β€” all aimed at equipping researchers, students, and policy analysts with the tools to study how innovation actually happens: through connections.

The central thesis of the site mirrors a growing consensus in innovation studies: innovation is not a solo act β€” it is a networked phenomenon. Understanding the structure, dynamics, and properties of these networks is essential for anyone working in science, technology, and innovation (STI) policy, regional development, entrepreneurship research, or applied data science.

Note

Why Networks? Innovation ecosystems are defined by relationships β€” between firms, universities, investors, policymakers, and knowledge. Network analysis provides the quantitative toolkit to make those relationships visible, measurable, and comparable.


3 Website Structure

The site is organised into four main sections, each serving a distinct but complementary purpose:

Section Purpose
πŸ”΅ Network Theory Conceptual and mathematical foundations of network analysis
πŸ“¦ R Package Custom R tooling for innovation ecosystem data and analysis
πŸ“Š Data Science Track Applied data science methods for STI/innovation research
πŸš€ ESM Track Innovation ecosystem mapping using structural methods

4 Network Theory

4.1 What Is Covered

The Network Theory section provides the conceptual backbone for everything else on the site. It introduces the mathematical and structural concepts that underpin all network-based analyses of innovation ecosystems.

Key topics typically covered in this strand include:

  • Graph fundamentals β€” nodes, edges, directed vs.Β undirected networks, weighted ties
  • Network metrics β€” degree, betweenness centrality, clustering coefficient, path length
  • Community structure β€” detecting clusters and sub-groups within larger networks
  • Scale-free and small-world properties β€” why innovation networks tend to have hubs
  • Bipartite and multilayer networks β€” modelling heterogeneous ecosystems (e.g.Β firms and technologies)
  • Network dynamics β€” how innovation networks evolve over time

4.2 Why It Matters for Innovation

Innovation ecosystems are, at their core, networks of relationships. A university–industry collaboration graph, a patent co-citation network, or a co-authorship network can all be modelled and analysed using the same underlying machinery. Network theory provides the common language for doing so rigorously.

# Example: Computing basic network statistics with igraph
library(igraph)

# Create a simple innovation network
g <- graph_from_data_frame(
  d = data.frame(
    from = c("Uni_A", "Firm_B", "Firm_B", "Lab_C", "Uni_A"),
    to   = c("Firm_B", "Lab_C",  "Startup_D", "Uni_A", "Lab_C")
  ),
  directed = FALSE
)

# Key metrics
degree(g)               # How connected is each actor?
betweenness(g)          # Who bridges different parts of the ecosystem?
transitivity(g)         # How clustered is the network?
mean_distance(g)        # Average separation between actors

5 R Package

5.1 Purpose

The site features a dedicated R package that provides convenience functions and data structures tailored to innovation ecosystem analysis. Rather than requiring users to piece together multiple general-purpose packages, the package offers a coherent set of tools designed specifically for the kinds of data and questions that arise in STI research.

5.2 What It Offers

The package is likely to include functionality for:

  • Data ingestion β€” loading and cleaning common innovation data sources (patents, publications, funding databases)
  • Network construction β€” converting relational data into graph objects ready for analysis
  • Ecosystem metrics β€” pre-built functions for computing innovation-relevant network statistics
  • Visualisation β€” ggraph/igraph-based plotting helpers for publication-quality network figures
  • Reproducible workflows β€” wrapper functions that make multi-step analyses scriptable and shareable

5.3 Installation

# Install from GitHub
# install.packages("remotes")
remotes::install_github("jpvdp/NetworkIsLife")

library(NetworkIsLife)
Tip

Reproducibility First β€” Packaging analytical workflows means that others can verify, replicate, and build on your innovation ecosystem analyses. This is especially valuable in policy-relevant research.


6 Learning Tracks

NetworkIsLife offers two structured learning tracks. These allow learners to follow a curated pathway through the material rather than navigating it ad hoc.


Data Science Track

DS Track

6.0.1 Audience

Students and researchers who want to develop applied data science competencies in the context of STI and innovation analysis. This track assumes some prior exposure to R or data analysis, and progressively builds toward network-based methods.

6.0.2 Learning Arc

The Data Science Track typically progresses through stages such as:

  1. Data Foundations β€” working with STI datasets (OECD, Eurostat, patent offices, Web of Science)
  2. Exploratory Analysis β€” summary statistics, distributions, and visualisation of innovation indicators
  3. Relational Data β€” moving from tabular data to network representations
  4. Network Analysis β€” applying graph metrics to innovation data
  5. Interpretation β€” linking quantitative results back to innovation theory and policy

6.0.3 Sample Workflow

library(tidyverse)
library(igraph)
library(ggraph)

# Step 1: Load co-patent data
patents <- read_csv("co_patents.csv")

# Step 2: Build network
collab_net <- patents |>
  select(applicant_1, applicant_2, n_patents) |>
  graph_from_data_frame(directed = FALSE)

E(collab_net)$weight <- patents$n_patents

# Step 3: Visualise
ggraph(collab_net, layout = "fr") +
  geom_edge_link(aes(width = weight), alpha = 0.4, colour = "#0f3460") +
  geom_node_point(aes(size = degree(collab_net)), colour = "#e94560") +
  geom_node_text(aes(label = name), repel = TRUE, size = 3) +
  theme_graph() +
  labs(title = "Patent Collaboration Network",
       subtitle = "Node size = degree centrality; Edge width = joint patents")

ESM Track

ESM Track

6.0.4 Audience

The Entrepreneurship & Systems Mapping (ESM) Track is oriented toward researchers and practitioners who want to map and analyse innovation ecosystems in a more applied, policy-facing context. It is likely suited to students of innovation management, economic geography, or science & technology studies.

6.0.5 Focus Areas

This track moves from abstract network methods to concrete ecosystem analysis tasks:

  • Ecosystem boundary definition β€” who belongs to the ecosystem, and why?
  • Stakeholder mapping β€” identifying and categorising actors (Triple/Quadruple Helix)
  • Structural analysis β€” core vs.Β periphery, broker organisations, isolated actors
  • Knowledge flow mapping β€” tracing how ideas move through the ecosystem
  • Longitudinal analysis β€” how ecosystems grow, contract, or restructure over time
  • Policy implications β€” using network evidence to inform intervention strategies

6.0.6 Ecosystem Mapping Example

# Classify actors by Quadruple Helix category
actors <- tibble(
  name     = c("TU Delft", "Philips", "NWO", "Rotterdam City", "StartupDelta"),
  category = c("Academia", "Industry", "Government", "Government", "Civil Society")
)

# Merge into network for visualisation
V(ecosystem_net)$helix <- actors$category[match(V(ecosystem_net)$name, actors$name)]

ggraph(ecosystem_net, layout = "stress") +
  geom_edge_link(alpha = 0.2) +
  geom_node_point(aes(colour = helix), size = 5) +
  scale_colour_manual(values = c(
    "Academia"      = "#2196F3",
    "Industry"      = "#FF5722",
    "Government"    = "#4CAF50",
    "Civil Society" = "#9C27B0"
  )) +
  theme_graph() +
  labs(title = "Innovation Ecosystem β€” Quadruple Helix View", colour = "Actor Type")

7 Key Concepts

Below is a concise reference of the core ideas that run throughout the site.

7.1 Innovation Ecosystems

An innovation ecosystem is a community of interdependent actors β€” firms, universities, investors, public bodies, and intermediaries β€” whose interactions generate, diffuse, and apply new knowledge. Unlike linear models of innovation, the ecosystem metaphor emphasises co-evolution, interdependence, and emergence.

7.2 Network Centrality

Centrality measures capture how important or influential a node is within the network. Different centrality metrics reflect different types of importance:

Metric What It Captures Innovation Context
Degree Number of direct connections Most active collaborators
Betweenness Bridging between groups Knowledge brokers
Closeness Average reach to all others Information diffusion speed
Eigenvector Connection to well-connected actors Prestige/influence

7.3 Community Detection

Innovation networks tend to cluster into communities β€” dense subgraphs with relatively sparse ties to the rest of the network. These often correspond to technological domains, geographic clusters, or research fields. Detecting them algorithmically (e.g.Β via the Louvain or Leiden algorithm) reveals the meso-structure of the ecosystem.

# Detect communities in an innovation network
library(igraph)

communities <- cluster_louvain(g)
V(g)$community <- membership(communities)

cat("Number of communities detected:", max(membership(communities)), "\n")
cat("Modularity:", modularity(communities), "\n")

7.4 Structural Holes & Brokerage

Actors that bridge otherwise disconnected parts of the ecosystem occupy structural holes (Burt, 1992). These brokers have a strategic advantage: they control information flows between groups and can recombine knowledge in novel ways β€” a classic mechanism of innovation.


8 Why This Resource Matters

Simulated Innovation Network
Nodes: 30 
Edges: 57 
Average degree: 3.8 
Clustering coefficient: 0.1287554 
Average path length: 2.475862 

The site is valuable for several reasons:

For students β€” It provides a structured, code-first entry point into a methodologically demanding field. Rather than being overwhelmed by raw academic literature, learners can build up competency through guided exercises and reproducible examples.

For researchers β€” The R package and documented workflows reduce the time-to-analysis for common innovation data science tasks. Methods that might otherwise take weeks to implement from scratch become accessible in hours.

For practitioners and policymakers β€” The ESM Track in particular connects analytical outputs to real-world ecosystem mapping tasks, making it easier to translate network evidence into actionable insights about where to invest, which actors to connect, and how ecosystems are changing.

For the field β€” By making methods open and reproducible, NetworkIsLife contributes to cumulative, comparable science in the innovation studies tradition.


9 Getting Started

9.1 Recommended Entry Points

If you are new to the site, the following paths are suggested:

9.1.0.1 πŸ”° Beginner β€” No Network Background

Start with Network Theory to build your conceptual foundations, then move to the Data Science Track from the beginning. Focus on getting comfortable with igraph and tidygraph in R before tackling metrics.

9.1.0.2 πŸ“Š Intermediate β€” Some R/Data Science Experience

Jump into the Data Science Track at the network analysis modules. Consult the Network Theory pages as a reference when new concepts arise. Explore the R Package documentation in parallel.

9.1.0.3 πŸ—ΊοΈ Applied β€” Policy or Ecosystem Practitioner

Head directly to the ESM Track for ecosystem mapping workflows. Reference the Network Theory page for methodological grounding when needed.

9.2 Essential R Packages

The following packages form the core computational stack used across the site:

# Core network packages
install.packages(c(
  "igraph",       # The workhorse of network analysis in R
  "tidygraph",    # Tidy data principles for network objects
  "ggraph",       # Grammar of graphics for network visualisation
  "visNetwork"    # Interactive network visualisation
))

# Data science support
install.packages(c(
  "tidyverse",    # Data wrangling and visualisation
  "bibliometrix", # Bibliometric and scientometric analysis
  "ggplot2",      # General-purpose visualisation
  "janitor"       # Data cleaning
))

10 Summary

NetworkIsLife fills a genuine gap in the open educational landscape. Innovation ecosystem analysis sits at the intersection of network science, data science, and innovation studies β€” three fields that each have rich literatures but are rarely brought together in a practical, code-first format.

The site achieves this through four complementary components: theoretical grounding in Network Theory, reusable tools via the R Package, skill-building through the Data Science Track, and applied ecosystem methodology in the ESM Track. Together, these make network-based innovation analysis accessible to a much wider audience than the specialist academic literature alone can reach.

Whether you are a PhD student mapping a regional innovation ecosystem, a policy analyst tracking how knowledge flows between universities and firms, or a data scientist building reproducible STI indicators, NetworkIsLife offers a coherent set of tools and concepts to support that work.


*This document was generated with Quarto.