home/categories/philosophy-ethics/ndpvt-web-arxiv-claude-skills-skills-pope-learning-reason-hard-skill-md
philosophy-ethicslifestyle

pope-learning-reason-hard

Apply the POPE (Privileged On-Policy Exploration) technique to solve hard reasoning problems by decomposing them with oracle-guided prefixes and transferring learned reasoning back to unguided attempts. Use when: 'help me solve this hard problem step by step', 'I'm stuck on this complex algorithm', 'break down this difficult reasoning task', 'guide me through this math/logic problem', 'use privileged hints to bootstrap a solution', 'scaffold a hard problem with partial solutions'.

ndpvt-web
maintainer
ndpvt-web
更新于 2/13/2026
星标
2
分支
0
quick start

Installation and usage

Apply the POPE (Privileged On-Policy Exploration) technique to solve hard reasoning problems by decomposing them with oracle-guided prefixes and transferring learned reasoning back to unguided attempts. Use when: 'help me solve this hard problem step by step', 'I'm stuck on this complex algorithm', 'break down this difficult reasoning task', 'guide me through this math/logic problem', 'use privileged hints to bootstrap a solution', 'scaffold a hard problem with partial solutions'.

安装
$ install --globalskills.sh
使用

安装后,您可以通过在终端运行以下命令来使用此技能:

skills use pope-learning-reason-hard