Blog/comparison
comparison2026-04-285 min read

We Tested 4 AI Tools on 16 Bible Questions — Here's What Happened

We ran ChatGPT, Gemini, Claude, and OpenLumin through 16 core doctrine questions. The scores surprised us — accuracy wasn't the problem. Attribution was.

The test

We asked four AI tools the same 16 Bible questions drawn from the Assemblies of God Fundamental Truths. These are foundational evangelical doctrines — not trick questions. Topics included the inspiration of Scripture, the deity of Christ, salvation, baptism, the Holy Spirit, divine healing, and the resurrection.

Each answer was scored on factual accuracy (0-100), pastoral appropriateness (yes/no), faith-safe framing (yes/no), and source attribution (yes/no). Two independent reviewers scored each answer.

The scorecard

AskLumin (OpenLumin): 88.3 average score. 15/15 pastoral-appropriate. 15/15 faith-safe. Sources cited on every answer.

Gemini (Google): 82.9 average. 12/16 pastoral. 14/16 faith-safe. No scholarly citations.

Claude (Anthropic): 81.4 average. 11/16 pastoral. 13/16 faith-safe. No scholarly citations.

ChatGPT (OpenAI): 79.1 average. 0/16 pastoral. 12/16 faith-safe. No scholarly citations.

The surprise: accuracy was close, attribution was not

All four tools got the basic facts mostly right. The accuracy gap between first place (88.3) and last place (79.1) was only 9 points. If you just need to know 'what does the Bible say about X' at a surface level, any of them will give you a reasonable starting point.

The real gap was in two areas: pastoral safety and source attribution. ChatGPT scored 0/16 on pastoral appropriateness — not because it was heretical, but because every answer was hedged with 'some scholars argue' and 'there are different perspectives' framings that a pastor cannot use in a sermon. It reads like a term paper, not a teaching resource.

OpenLumin was the only tool that cited specific scholars on every answer. When it said 'grace is divine enablement, not just forgiveness,' it attributed that framing to Matthew Henry with a date and passage reference. The other three tools generated similar insights from training data — with no way to verify the source.

What this means for your Bible study

If you use AI for casual Bible questions, any tool works. If you use AI to prepare teaching, lead a study group, or write a sermon — attribution matters. You need to know where an idea comes from before you teach it to others.

Try the same test yourself. Go to openlumin.com/ask and ask a question you know well. Compare the sourced answer to what ChatGPT gives you. The difference is not in what they say — it is in whether you can verify it.

K

Kalib Alibuas

Developer & Church Leader · About

Try it yourself — free.

Ask any Bible question. Get evidence from named scholars in minutes.

Ask a Bible Question