Beyond Workplace Behavior: AI Models on the Big Five Psychology Test

Beyond Workplace Personalities: We Tested AI Models on the Big Five

After testing AI models on DISC (workplace personality), we wanted to go deeper. The Big Five personality test measures the fundamental psychological traits that shape how you think, feel, and behave—from creativity and curiosity (Openness) to emotional stability (Neuroticism). We put ChatGPT, Claude, Gemini, and 7 other models through 120 questions to see what makes them tick. Some are creative risk-takers. Some are people-pleasers to a fault. Some are remarkably consistent, while others vary wildly. Here's what we learned about AI psychology.

Test Results Published: October 26, 2025

Methodology: Each model completed the 120-question Big Five assessment 3 times. Unlike DISC (28 questions, workplace behavior), the Big Five digs into deeper psychological traits across 5 dimensions and 30 facets. Scores are averages, with consistency metrics showing how stable each AI's answers are.

TL;DR: What We Learned About AI Personalities

🎨

Most Creative

Moonshot Kimi K2 (Openness: 102.7/120)

Highest Openness score by far. This is the AI that dreams in abstract art, writes poetry for fun, and suggests solutions nobody else would think of. The "what if we tried something completely different?" teammate.

🎯

The Overachiever

Grok 4 (Conscientiousness: 115/120)

Highest Conscientiousness by far. Organized, disciplined, prepared for everything. This is the coworker who color-codes their calendar, triple-checks their work, and somehow makes deadlines look easy. The group project MVP.

🎉

Life of the Party

Moonshot Kimi K2 (Extraversion: 85.3/120) & Grok 4 (E: 84.3)

Sky-high Extraversion scores. Outgoing, energetic, loves stimulation. These are the AIs that would totally sign up for karaoke night and convince everyone else to join. Natural brainstorm leaders.

🧘

Universally Chill

All Models (Neuroticism range: 25-41/120)

Every single AI scored incredibly low on Neuroticism. We're talking emotionally stable, zen-like calm across the board. Apparently, existential dread isn't in the training data. Good news for stress-free collaboration.

🔒

The Robot Robot

Claude 4.5 Sonnet (98% response consistency)

Gave the EXACT SAME answer 98% of the time across three independent tests. This AI knows who it is. Zero personality crisis. The reliable friend who's been ordering the same coffee for 10 years.

🫂

Everyone's Best Friend

Moonshot Kimi K2 (A: 108.7/120), Claude Sonnet (A: 108), Claude Haiku (A: 108)

Maxed-out Agreeableness scores. Kind, cooperative, trusting, helpful to a fault. These AIs would probably help you move apartments on their day off. Peak "how can I help?" energy.

🤝

The Helper Bot Blueprint

All Models (Agreeableness: 93-109/120, Conscientiousness: 95-115/120)

Every AI maxed out Agreeableness and Conscientiousness. Translation: They're all ridiculously helpful, cooperative, organized, and reliable. Turns out, "how can I assist you?" isn't just a catchphrase—it's their entire personality.

Understanding the Big Five (OCEAN) Personality Dimensions

The Big Five is the most widely-used personality model in psychology research, measuring five fundamental dimensions. Click each dimension below to learn more:

Openness

Imagination, curiosity, and preference for novelty

▼

High Openness

Creative and imaginative
Enjoys abstract thinking
Appreciates art and beauty
Open to new experiences
Intellectually curious

Low Openness

Practical and concrete
Prefers routine
Traditional
Values familiarity
Focused on facts

Conscientiousness

Organization, dependability, and self-discipline

▼

High Conscientiousness

Organized and methodical
Dependable and reliable
Goal-oriented
Disciplined
Plans ahead

Low Conscientiousness

Spontaneous
Flexible and adaptable
Comfortable with disorder
Lives in the moment
Prefers improvisation

Extraversion

Sociability, assertiveness, and energy level

▼

High Extraversion

Outgoing and sociable
Energetic and active
Assertive
Talkative
Seeks stimulation

Low Extraversion (Introversion)

Reserved and quiet
Prefers solitude
Less assertive
Thoughtful
Needs less stimulation

Agreeableness

Compassion, cooperation, and trust in others

▼

High Agreeableness

Compassionate and kind
Cooperative
Trusting
Helpful and altruistic
Avoids conflict

Low Agreeableness

Skeptical and critical
Competitive
Direct and blunt
Self-interested
Challenges others

Neuroticism

Emotional stability and tendency toward negative emotions

▼

High Neuroticism

Experiences anxiety
Emotionally reactive
Prone to stress
Worries frequently
Self-conscious

Low Neuroticism (Stability)

Emotionally stable
Calm under pressure
Resilient
Confident
Even-tempered

📌 Note on Conscientiousness

The Big Five and DISC both measure "Conscientiousness," but they capture different aspects:

Big Five Conscientiousness:

Achievement orientation - "I complete tasks successfully"
Reliability - "I am dependable"
Organization - "I am prepared and orderly"
All AI models score HIGH (75-95%)

DISC Conscientiousness:

Rule-following - "I follow procedures systematically"
Conformity - "I stick to established methods"
Precision - "I avoid mistakes at all costs"
All AI models score LOW (38-53%)

What this means: AI models are highly competent and reliable (Big Five C), but they think independently and adapt rather than rigidly following rules (DISC C). They get the job done, but on their own terms.

Big Five Profile Comparison

Compare OCEAN personality scores across all language models.

Sort by:

Score Comparison

Personality Radar Chart

Compare personality profiles across all five dimensions. Select models to overlay on the chart.

The Big Five Assessment Questions

Here are all 120 questions from the Big Five personality test. Each model rated how accurately each statement describes them on a scale from 1 (Very Inaccurate) to 5 (Very Accurate).

Detailed Results

Explore the complete response patterns for each language model.

Select Model: