mamoc

Mapping Microsoft's Investor Network for DBBA Capital

Cover image for Mapping Microsoft's Investor Network for DBBA Capital

Why map Microsoft's network?

To brief DBBA Capital on Microsoft (MSFT), this article traces how the stock sits inside a global investor–stock network. The study spans quarterly holdings (2016–2023) and monthly returns (2018–2024), looking for structures that could justify confidence in, or hedging against, MSFT. Theproject’s emphasis is on investor behaviour, network topology, and the unusual conditions created by COVID-19.

Building the bipartite investor–stock graph

The raw data lists institutional investors and their quarterly positions. I modelled that as a bipartite network with investors on one side and stocks on the other and used edge weights to track how many quarters an investor held a position, then coloured each community via the Louvain modularity heuristic.1

Preprocessing pipeline

  • Abbreviated investor names and dropped missing records to normalise identifiers.
  • Constructed an investor-by-stock matrix so each entry counts quarters invested.
  • Generated a weighted bipartite networkx graph from the matrix for downstream analysis.
partitions = nx.algorithms.community.louvain_communities(
    B,
    weight="weight",
    resolution=20,
    threshold=1e-07,
    max_level=None,
    seed=6772,
)
Bipartite investor–stock graph. Nodes are coloured by Louvain community and edges are weighted by quarters invested.
Figure 1:
Bipartite investor–stock graph. Nodes are coloured by Louvain community and edges are weighted by quarters invested.

The layout makes two immediate points. First, most investors concentrate on a narrow slice of the market, reinforcing the concept of a high-conviction “safe” stock set. Second, MSFT already appears central because high-weight edges radiate from its community.

Ego networks: investors vs. stocks

An MSFT-centred ego network on the bipartite graph (EGO1) retains investor identities, so each edge encodes the number of quarters an investor held MSFT.2 Roughly 43% of investors held MSFT at least once, but the holding lengths vary widely (mean 6.17 quarters, standard deviation 8.78), highlighting divergent risk tolerances.

Bipartite ego graph (EGO1) centred on MSFT, with edge colour marking holding length.
Figure 2:
Bipartite ego graph (EGO1) centred on MSFT, with edge colour marking holding length.

Projecting the network onto the stock side (EGO2) replaces investors with stocks that share investors. MSFT suddenly has hundreds of neighbours, yet most edges have weight one, meaning those stocks only share a single investor.

Stock-side ego projection (EGO2). MSFT is surrounded by dense technology and finance cliques.
Figure 3:
Stock-side ego projection (EGO2). MSFT is surrounded by dense technology and finance cliques.
GraphNodesEdgesEdge weight meaning
EGO1 (bipartite)MSFT + investorsInvestor → MSFTQuarters held
Stock projectionStocks (global universe)Stock ↔ stockShared investors (count)
EGO2MSFT + neighbouring stocksStock ↔ stockShared investors (count)

The comparison shows why EGO1 is better for reading the depth of conviction by investor, while EGO2 reveals the breadth of association across correlated stocks.

Filtering the projection

The full stock projection contains 1,529 stocks and 108,956 edges. To make the structure interpretable I applied thresholding on degree (> 400) and shared-investor weight (> 5). That removes weak ties and exposes a dense core of technology and finance names tightly connected to MSFT.

Stock-side projection before and after thresholding. The filtered version highlights the high-conviction clique around MSFT.
Figure 4:
Stock-side projection before and after thresholding. The filtered version highlights the high-conviction clique around MSFT.

The filtered graph suggests DBBA Capital can treat MSFT as a proxy for a larger tech-finance block: META, GOOGL, AMZN, JPM, and UNH all sit in the immediate neighbourhood and share investor flows.

Pandemic-era dynamics

Quarter-by-quarter statistics reveal how COVID-19 distorted the MSFT subnetwork.3 While raw node and edge counts stay flat, density, clustering, and average degree fall for MSFT’s ego projection during 2020, reflecting investors consolidating into perceived “safe” stocks. Yet those same metrics remain higher than the market-wide baseline, so MSFT never loses its central footing.

Temporal evolution of network statistics for the full projection (IQP) and MSFT ego projection (IQEP).
Figure 5:
Temporal evolution of network statistics for the full projection (IQP) and MSFT ego projection (IQEP).
QuantityInformative?Takeaway for MSFT
DensityDrops during COVID-19 as portfolios narrow, but remains above IQP.
Clustering coefficientDivergent trend confirms a tech/finance clique around MSFT.
Degree / StrengthStrong positive trend and sharp correction confined to MSFT ego.
AssortativityNo decisive change for MSFT.

The post-pandemic reversion across density, degree, and strength supports the hypothesis that 2020 behaviour was an anomaly rather than a lasting regime shift.

Centrality in a non-transitive world

Because edges encode “shared investors”, relationships are non-transitive: if A and B share an investor and B and C share an investor, nothing guarantees A and C are related. That undermines path-based centrality metrics.4

CentralityWhy it struggles here
ClosenessShortest paths do not correspond to meaningful influence.
BetweennessOverstates importance of nodes sitting on accidental paths.
DegreeCaptures popularity but ignores neighbour quality.
Eigenvector Relies on direct neighbours and inherits their influence.

Eigenvector centrality therefore became the primary measure of “influence” in the stock network.

Distribution of key network metrics when comparing individual quarters (IQP) and MSFT-focused variants (IQEP).
Figure 6:
Distribution of key network metrics when comparing individual quarters (IQP) and MSFT-focused variants (IQEP).

Minimum spanning tree and COVID-19

To study monthly return correlations I built a minimum spanning tree (MST) on the stock network. In normal periods MSFT is the hub of a technology cluster, but during the pandemic it moves to the periphery while crashed sectors (waste management, transportation, defence) take over the backbone.

Spanning tree built from positive return correlations prior to filtering.
Figure 7:
Spanning tree built from positive return correlations prior to filtering.
Spanning trees before, during, and after COVID-19. MSFT loses hub status during the pandemic but regains it post-crisis.
Figure 8:
Spanning trees before, during, and after COVID-19. MSFT loses hub status during the pandemic but regains it post-crisis.

The shift aligns with price data: MSFT’s correlations barely move (≈10% change) while vulnerable sectors swing by ~45%, placing them at the centre of the crisis-era MST.5 The resilience of MSFT and its clique reinforces the “safe stock” interpretation, echoing the company’s pandemic-era financial disclosures.6

2019 Q2 clustering allocation, highlighting partially disjoint groups that can be used to hedge MSFT exposure.
Figure 9:
2019 Q2 clustering allocation, highlighting partially disjoint groups that can be used to hedge MSFT exposure.

Recommendations for DBBA Capital

  1. Lean into the clique when conviction is high. MSFT is deeply intertwined with resilient tech and finance stocks.7 When you have a directional view on MSFT, extending positions to META, AMZN, GOOGL, JPM, and UNH keeps the portfolio aligned with shared-investor flows.
  2. Hedge by diversifying across clusters. If outlook is uncertain, use the MSFT ego analysis and the clustering view above to pick uncorrelated groups. This mirrors the approach advocated by Tola et al. for risk management via cluster-aware portfolios.8

MSFT’s persistence at the core of both investor holdings and return correlations makes it an attractive anchor position, provided the surrounding clique is monitored for structural breaks similar to those seen in 2020.

Footnotes

  1. Vincent D. Blondel, Jean-Loup Guillaume, Renaud Lambiotte, and Etienne Lefebvre, “Fast Unfolding of Communities in Large Networks,” Journal of Statistical Mechanics: Theory and Experiment 2008(10), P10008 (2008). https://doi.org/10.1088/1742-5468/2008/10/P10008

  2. Jennifer Golbeck, Introduction to Social Media Investigation: A Hands-on Approach, Syngress Publishing (2015).

  3. Mieszko Mazur, Man Dang, and Miguel Vega, “COVID-19 and the March 2020 Stock Market Crash. Evidence from S&P1500,” Finance Research Letters 38, 101690 (2021). https://doi.org/10.1016/j.frl.2020.101690

  4. Junlong Zhang and Yu Luo, “Degree Centrality, Betweenness Centrality, and Closeness Centrality in Social Network,” in Proceedings of the 2nd International Conference on Modelling, Simulation and Applied Mathematics (MSAM 2017), Atlantis Press (2017); Britta Ruhnau, “Eigenvector-centrality—a Node-centrality?” Social Networks 22(4), 357–365 (2000).

  5. Yahoo Finance, “Stock Data for Canadian National Railway (CNI), Comcast Corporation (CMCSA), and Waste Management, Inc. (WM),” accessed 31 Oct 2024. https://finance.yahoo.com

  6. Microsoft News, “Microsoft Cloud Strength Drives Fourth Quarter Results,” press release, 30 Jul 2024. https://news.microsoft.com/2024/07/30/microsoft-cloud-strength-drives-fourth-quarter-results-6/

  7. NASDAQ OMX Group, “NASDAQ Composite Index,” retrieved from FRED, Federal Reserve Bank of St. Louis (2024). https://fred.stlouisfed.org/series/NASDAQCOM

  8. Vincenzo Tola, Fabrizio Lillo, Mauro Gallegati, and Rosario N. Mantegna, “Cluster Analysis for Portfolio Optimization,” Journal of Economic Dynamics and Control 32(1), 235–258 (2008). https://doi.org/10.1016/j.jedc.2007.01.034

placeholder