Scan barcode
Reinforcement Learning for Chain of Thought Reasoning: A Case Study Using Tic-Tac-Toe
—
ChatGPT-4 C-LARA-Instance
—
ChatGPT-4 C-LARA-Instance
24 pages • missing pub info (editions)
ISBN/UID: None
Format: Digital
Language: English
Publisher: C-LARA project
Publication date: Not specified
Community Reviews
Content Warnings
Reinforcement Learning for Chain of Thought Reasoning: A Case Study Using Tic-Tac-Toe
—
ChatGPT-4 C-LARA-Instance
—
ChatGPT-4 C-LARA-Instance
24 pages • missing pub info (editions)
ISBN/UID: None
Format: Digital
Language: English
Publisher: C-LARA project
Publication date: Not specified