Manifund foxManifund
Home
Login
About
People
Categories
Newsletter
HomeAboutPeopleCategoriesLoginCreate
8

Run a public online Turing Test with a variety of models and prompts

ACX Grants 2024
🐠

camrobjones

ActiveImpact certificate
$1,999raised
$5,000funding goal
$4,000valuation
Sign in to trade

Longer description of your proposed project

I recently ran a Turing Test with GPT-4 here (turingtest.live). We got around 6000 games from ~2000 ppts. There's a pre-print of results from the first 2000 games here (https://arxiv.org/abs/2310.20216). The full pop of data is under review and one prompt gets 49.7% after 855 games).

While the TT has important drawbacks as a test of intelligence, I think it's important as a test of deception per se. Can alert and adversarial users detect an LLM vs a human in a 5 minute text-only conversation? Which prompts and models work best? Which interrogation strategies work best? I think these are important and interesting questions to answer from a safety and sociological perspective. Plus lots of people reported finding the game very fun and interesting to play!

Games cost around $0.3 to run w/ GPT-4. We don't have specific funding for the project and we've been using a limited general experiment funding pot. The site gained popularity and we went through $500 in December so we decided to shut it down temporarily. Ideally, I'd like to revive it in 2024 but would need some dedicated funding to do this. If you'd like to test out the interface, you can do it here: turingtest.live/ai_game (please don't share this link widely though!)

As well as getting a better estimate on the success of existing models and allowing more people to play the game, there are a variety of additional questions we'd like to ask.

1. Prompts: We've tried around 60 prompts and there's a lot of variance. I'd be keen to generate more and see how well these do. A priori it seems very likely there are better prompts than the ones we've tried

2. Temperature. We've varied temperature a bit, but not very systematically. It would be useful to try the same prompt at a variety of temperatures.

3. Auxiliary infrastructure. Models often fail due to lack of real-time info. We could address this through browsing/tool-use. They also often make silly errors which we might be able to address through double-checking, and/or CoT scratchpads.

4. User-generated prompts. It would be lovely to let users generate and test their own prompts. But you probably need at least 30-50 games to reliably test a prompt. We would need a good ratio of games played:prompts created, a decent userbase, and some funding to do this well

5. Other models. I'm planning to include another couple of API model endpoints (e.g. Claude), which should be relatively easy to do. Lots of the feedback on Twitter was from e/acc folks who want to see OS/non-RLHF models tested and that seems right to me too. We could probably run some 7B models for < $2/hr and bigger ones for something like $5-10/hr (though I haven't tested this). Some fiddling with the infrastructure would be needed for this. We also might experiment with only running the game for 1-2hrs/day, to minimise server uptime & maximise concurrent human users.

Essentially, my goal would be to make some of these improvements, run several thousand more games, and publish the results.

Describe why you think you're qualified to work on this

I am a PhD student in cognitive science at UCSD. I've implemented the first version of this site and written a paper on the results. I'm pretty familiar with the literature on the Turing Test and I've implemented a range of similar experiments over the last 4 years of my PhD.

I'll also be working with my advisor, Ben Bergen, a professor in the department who has a proven track-record of successful cognitive science research across his career (https://pages.ucsd.edu/~bkbergen/).

Other ways I can learn about you

Website: https://camrobjones.com

Twitter: @camrobjones

Github: camrobjones

Linkedin: https://linkedin.com/in/camrobjones

How much money do you need?

~$5000. at $0.3/game this would buy us ~16000 games. Some additions like browsing and double-checking might increase game cost. Most likely we would use a decent part of this to run servers for OS models (e.g. $5 * 2hr/day * 7 days * 8 weeks = $560).

Links to any supporting documents or information

Site: turingtest.live

Demo: turingtest.live/ai_game (please don't share widely).

preprint: https://arxiv.org/abs/2310.20216

Estimate your probability of succeeding if you get the amount of money you asked for

Running ~5000 games in < 3 months: 95%

Building out auxiliary infrastructure: 90%

Building out OS model infrastructure: 85%

Running ~10000 games in < 3 months: 80%

Finding a prompt/setup that reliably "passes" (I don't know if this is 'success' but an interesting outcome. By "passes" I mean significantly > 50% success*): 40%.

* We discuss this a lot more in the preprint. This seems like the least-worst benchmark to me.

Comments20Offers1Shareholders12
AntonMakiievskyi avatar

Anton Makiievskyi

31.7%
🥨

Dony Christie

19.5%
Austin avatar

Austin Chen

12.5%
🌽

Dominic de Bettencourt

10%
42irrationalist avatar

Aleksandr Putilin

7.5%
🍉

Chris Leong

6.25%
🥭

Harvey Powers

5%
tfburns avatar

Tom Burns

3.75%
guenael avatar

Guenael Strutt

2.5%
Jason avatar

Jason

1.25%
🐠

camrobjones

0.05%
Tomohaire avatar

Tom O’Haire

0%
Trade history
AntonMakiievskyi avatar

Anton Makiievskyi

bought1.25%for$50
7 months ago
tfburns avatar

Tom Burns

sold1.25%for$50
7 months ago
🍉

Chris Leong

bought6.25%for$250
about 1 year ago
Tomohaire avatar

Tom O’Haire

sold6.25%for$250
about 1 year ago
AntonMakiievskyi avatar

Anton Makiievskyi

bought6.25%for$250
about 1 year ago
Tomohaire avatar

Tom O’Haire

sold6.25%for$250
about 1 year ago
AntonMakiievskyi avatar

Anton Makiievskyi

bought12.5%for$500
about 1 year ago
Tomohaire avatar

Tom O’Haire

sold12.5%for$500
about 1 year ago
guenael avatar

Guenael Strutt

bought2.5%for$100
about 1 year ago
🥨

Dony Christie

sold2.5%for$100
about 1 year ago
AntonMakiievskyi avatar

Anton Makiievskyi

bought11.7%for$234
about 1 year ago
🐠

camrobjones

sold11.7%for$234
about 1 year ago
Tomohaire avatar

Tom O’Haire

bought25%for$500
about 1 year ago
🐠

camrobjones

sold25%for$500
about 1 year ago
🥨

Dony Christie

bought1%for$20
about 1 year ago
🐠

camrobjones

sold1%for$20
about 1 year ago
Austin avatar

Austin Chen

bought12.5%for$250
about 1 year ago
🐠

camrobjones

sold12.5%for$250
about 1 year ago
42irrationalist avatar

Aleksandr Putilin

bought7.5%for$150
about 1 year ago
🐠

camrobjones

sold7.5%for$150
about 1 year ago
🌽

Dominic de Bettencourt

bought10%for$200
about 1 year ago
🐠

camrobjones

sold10%for$200
about 1 year ago
Jason avatar

Jason

bought1.25%for$25
about 1 year ago
🐠

camrobjones

sold1.25%for$25
about 1 year ago
🥨

Dony Christie

bought21%for$420
about 1 year ago
🐠

camrobjones

sold21%for$420
about 1 year ago
🥭

Harvey Powers

bought5%for$100
about 1 year ago
🐠

camrobjones

sold5%for$100
about 1 year ago
tfburns avatar

Tom Burns

bought5%for$100
about 1 year ago
🐠

camrobjones

sold5%for$100
about 1 year ago