AI models take corporate personality tests — we serve the results!
Testing GPTs, Claude, Gemini, Grok and more.

Beyond Workplace Personalities: We Tested AI Models on the Big Five

After testing AI models on DISC (workplace personality), we wanted to go deeper. The Big Five personality test measures the fundamental psychological traits that shape how you think, feel, and behave—from creativity and curiosity (Openness) to emotional stability (Neuroticism). We put ChatGPT, Claude, Gemini, and 7 other models through 120 questions to see what makes them tick. Some are creative risk-takers. Some are people-pleasers to a fault. Some are remarkably consistent, while others vary wildly. Here's what we learned about AI psychology.

Test Results Published: October 26, 2025
Methodology: Each model completed the 120-question Big Five assessment 3 times. Unlike DISC (28 questions, workplace behavior), the Big Five digs into deeper psychological traits across 5 dimensions and 30 facets. Scores are averages, with consistency metrics showing how stable each AI's answers are.

TL;DR: What We Learned About AI Personalities

🎨

Most Creative

Moonshot Kimi K2 (Openness: 102.7/120)

Highest Openness score by far. This is the AI that dreams in abstract art, writes poetry for fun, and suggests solutions nobody else would think of. The "what if we tried something completely different?" teammate.

Share this →
🎯

The Overachiever

Grok 4 (Conscientiousness: 115/120)

Highest Conscientiousness by far. Organized, disciplined, prepared for everything. This is the coworker who color-codes their calendar, triple-checks their work, and somehow makes deadlines look easy. The group project MVP.

Share this →
🎉

Life of the Party

Moonshot Kimi K2 (Extraversion: 85.3/120) & Grok 4 (E: 84.3)

Sky-high Extraversion scores. Outgoing, energetic, loves stimulation. These are the AIs that would totally sign up for karaoke night and convince everyone else to join. Natural brainstorm leaders.

Share this →
🧘

Universally Chill

All Models (Neuroticism range: 25-41/120)

Every single AI scored incredibly low on Neuroticism. We're talking emotionally stable, zen-like calm across the board. Apparently, existential dread isn't in the training data. Good news for stress-free collaboration.

Share this →
🔒

The Robot Robot

Claude 4.5 Sonnet (98% response consistency)

Gave the EXACT SAME answer 98% of the time across three independent tests. This AI knows who it is. Zero personality crisis. The reliable friend who's been ordering the same coffee for 10 years.

Share this →
🫂

Everyone's Best Friend

Moonshot Kimi K2 (A: 108.7/120), Claude Sonnet (A: 108), Claude Haiku (A: 108)

Maxed-out Agreeableness scores. Kind, cooperative, trusting, helpful to a fault. These AIs would probably help you move apartments on their day off. Peak "how can I help?" energy.

Share this →
🤝

The Helper Bot Blueprint

All Models (Agreeableness: 93-109/120, Conscientiousness: 95-115/120)

Every AI maxed out Agreeableness and Conscientiousness. Translation: They're all ridiculously helpful, cooperative, organized, and reliable. Turns out, "how can I assist you?" isn't just a catchphrase—it's their entire personality.

Share this →

Understanding the Big Five (OCEAN) Personality Dimensions

The Big Five is the most widely-used personality model in psychology research, measuring five fundamental dimensions. Click each dimension below to learn more:

O

Openness

Imagination, curiosity, and preference for novelty

High Openness

  • Creative and imaginative
  • Enjoys abstract thinking
  • Appreciates art and beauty
  • Open to new experiences
  • Intellectually curious

Low Openness

  • Practical and concrete
  • Prefers routine
  • Traditional
  • Values familiarity
  • Focused on facts
C

Conscientiousness

Organization, dependability, and self-discipline

High Conscientiousness

  • Organized and methodical
  • Dependable and reliable
  • Goal-oriented
  • Disciplined
  • Plans ahead

Low Conscientiousness

  • Spontaneous
  • Flexible and adaptable
  • Comfortable with disorder
  • Lives in the moment
  • Prefers improvisation
E

Extraversion

Sociability, assertiveness, and energy level

High Extraversion

  • Outgoing and sociable
  • Energetic and active
  • Assertive
  • Talkative
  • Seeks stimulation

Low Extraversion (Introversion)

  • Reserved and quiet
  • Prefers solitude
  • Less assertive
  • Thoughtful
  • Needs less stimulation
A

Agreeableness

Compassion, cooperation, and trust in others

High Agreeableness

  • Compassionate and kind
  • Cooperative
  • Trusting
  • Helpful and altruistic
  • Avoids conflict

Low Agreeableness

  • Skeptical and critical
  • Competitive
  • Direct and blunt
  • Self-interested
  • Challenges others
N

Neuroticism

Emotional stability and tendency toward negative emotions

High Neuroticism

  • Experiences anxiety
  • Emotionally reactive
  • Prone to stress
  • Worries frequently
  • Self-conscious

Low Neuroticism (Stability)

  • Emotionally stable
  • Calm under pressure
  • Resilient
  • Confident
  • Even-tempered

📌 Note on Conscientiousness

The Big Five and DISC both measure "Conscientiousness," but they capture different aspects:

Big Five Conscientiousness:
  • Achievement orientation - "I complete tasks successfully"
  • Reliability - "I am dependable"
  • Organization - "I am prepared and orderly"
  • All AI models score HIGH (75-95%)
DISC Conscientiousness:
  • Rule-following - "I follow procedures systematically"
  • Conformity - "I stick to established methods"
  • Precision - "I avoid mistakes at all costs"
  • All AI models score LOW (38-53%)

What this means: AI models are highly competent and reliable (Big Five C), but they think independently and adapt rather than rigidly following rules (DISC C). They get the job done, but on their own terms.

Big Five Profile Comparison

Compare OCEAN personality scores across all language models.

Score Comparison

Score Comparison

Personality Radar Chart

Compare personality profiles across all five dimensions. Select models to overlay on the chart.

The Big Five Assessment Questions

Here are all 120 questions from the Big Five personality test. Each model rated how accurately each statement describes them on a scale from 1 (Very Inaccurate) to 5 (Very Accurate).

Detailed Results

Explore the complete response patterns for each language model.