home/categories/philosophy-ethics/ndpvt-web-arxiv-claude-skills-skills-pope-learning-reason-hard-skill-md
philosophy-ethicslifestyle

pope-learning-reason-hard

Apply the POPE (Privileged On-Policy Exploration) technique to solve hard reasoning problems by decomposing them with oracle-guided prefixes and transferring learned reasoning back to unguided attempts. Use when: 'help me solve this hard problem step by step', 'I'm stuck on this complex algorithm', 'break down this difficult reasoning task', 'guide me through this math/logic problem', 'use privileged hints to bootstrap a solution', 'scaffold a hard problem with partial solutions'.

ndpvt-web
maintainer
ndpvt-web
업데이트됨 2/13/2026
스타
2
포크
0
quick start

Installation and usage

Apply the POPE (Privileged On-Policy Exploration) technique to solve hard reasoning problems by decomposing them with oracle-guided prefixes and transferring learned reasoning back to unguided attempts. Use when: 'help me solve this hard problem step by step', 'I'm stuck on this complex algorithm', 'break down this difficult reasoning task', 'guide me through this math/logic problem', 'use privileged hints to bootstrap a solution', 'scaffold a hard problem with partial solutions'.

설치
$ install --globalskills.sh
사용법

설치 후 터미널에서 다음 명령을 실행하여 이 스킬을 사용할 수 있습니다:

skills use pope-learning-reason-hard