pope-learning-reason-hard
Apply the POPE (Privileged On-Policy Exploration) technique to solve hard reasoning problems by decomposing them with oracle-guided prefixes and transferring learned reasoning back to unguided attempts. Use when: 'help me solve this hard problem step by step', 'I'm stuck on this complex algorithm', 'break down this difficult reasoning task', 'guide me through this math/logic problem', 'use privileged hints to bootstrap a solution', 'scaffold a hard problem with partial solutions'.
Installation and usage
Apply the POPE (Privileged On-Policy Exploration) technique to solve hard reasoning problems by decomposing them with oracle-guided prefixes and transferring learned reasoning back to unguided attempts. Use when: 'help me solve this hard problem step by step', 'I'm stuck on this complex algorithm', 'break down this difficult reasoning task', 'guide me through this math/logic problem', 'use privileged hints to bootstrap a solution', 'scaffold a hard problem with partial solutions'.
설치 후 터미널에서 다음 명령을 실행하여 이 스킬을 사용할 수 있습니다:
skills use pope-learning-reason-hard