Whitepaper: Cluster Analyses for Patients with ASD

By: RethinkFirst

    •    Reading time: 2 min

Published: Sep 13, 2023
Dendrogram demonstrating various levels at which patients can be grouped into a cluster

About this Whitepaper

Researchers identifying patient profiles for autistic individuals typically find between 2-7 distinct groups. Here, we show how using a larger sample size and a broader set of clinically relevant variables allows us to identify more fine-grained clusters that better capture the spectrum nature of autism spectrum disorders. In turn, techniques that identify more diverse clusters may better inform precision behavioral healthcare initiatives.

Cluster Combinations

In this study, we systematically analyzed clustering results from 48 combinations of:

Four sample sizes

  • 40
  • 395
  • 3948
  • 39475

Three sets of clinically relevant variables

  • 7 medical/diagnostic features
  • 31 behavioral features
  • 50 total features

Four clustering algorithms

  • agglomerative hierarchical
  • BIRCH
  • DBSCAN
  • k-means

Clusters identified ranged 2-to-100 with a median of eight and average of 20. Increasing the sample size led to:

  • No change in clusters identified (behavioral features)
  • An increase in the number of clusters identified (medical/diagnostic features)
  • Influenced clusters dependent on the algorithm (all features)

Download Whitepaper

Influence of Sample Size, Feature Set, and Algorithm on Cluster Analyses for Patients with Autism Spectrum Disorders Researchers

On Average

The greatest number and most well-defined clusters were identified with the medical/diagnostic features (58).

The fewest clusters were identified using behavioral features (6).

Lastly, on average, fewer clusters were identified using the BIRCH (18) and DBSCAN (15) algorithms than agglomerative hierarchical (24) and k-means algorithms (25).

In total, this study suggests that the patient sample size, specific feature set used, and the algorithm chosen for clustering will influence the number of clusters identified. The “right” number of clusters likely depends on how the information obtained through clustering analyses are practically used in clinical contexts. Download the Whitepaper

Share with your community

Facebook
X
LinkedIn
Sign up for our Newsletter

Subscribe to our monthly newsletter on the latest industry updates, Rethink happenings, and resources galore.

Latest Resources

Article

Self-advocacy is the ability to understand and communicate one’s needs, make informed decisions, and take...

Podcast

About this Podcast Episode On this episode, Angela and Kristin talk with a guest, Kimbyr,...

Article

Workplace diversity, equity, and inclusion (DE&I) efforts have traditionally focused on race, gender, and ethnicity....

Learn more about Rethink

The leading behavioral and mental health enterprise platform to support working parents, caregivers and their families.

Award-winning solutions empower districts and their educators to improve outcomes and wellness for all students and to build healthy and safe learning environments.

Fully integrated workflow automation and evidenced-based clinical tools help behavioral health organizations optimize outcomes and operations.

A payor-centric platform to optimize dosage and levels of care, streamline care management, activate and support care givers, and improve networks.