hc 5136723d62 feat(skills): add paper-parsing skill

2026-03-31 17:38:17 +08:00

3.8 KiB

Raw Blame History

name	description
paper-parsing	Use when analyzing ML/DL papers to ensure comprehensive extraction of all relevant information

Paper Parsing Methodology

Overview

Systematic approach to parsing ML/DL papers for replication. Emphasizes completeness and openness to avoid missing critical details.

Announce at start: "I'm using the paper-parsing skill to ensure comprehensive paper analysis."

Core Philosophy

Completeness over speed: Better to extract too much than miss something
Open-ended discovery: Papers contain unique insights; don't force into templates
Cross-reference: Information appears in multiple places; cross-check
Explicit uncertainty: Mark unclear items rather than guessing

Paper Sections Checklist

Abstract

Core contribution identified
Key results/numbers extracted
Problem domain understood

Introduction

Problem motivation clear
Gap in existing work identified
Proposed solution summarized
Claimed contributions listed

Key prior methods identified
Differences from this work noted
Potential baselines for comparison

Method / Approach

Architecture fully described
All components identified
Mathematical formulation complete
Training procedure detailed
Loss functions specified
Hyperparameters listed

Experiments

Datasets listed with sizes
Evaluation metrics defined
Baseline comparisons noted
Ablation studies cataloged
Key numerical results extracted

Appendix / Supplementary

Additional implementation details
Extended results
Proofs or derivations
Code references

Information Extraction Patterns

Architecture Details

Look for:

Layer types and configurations
Activation functions
Normalization methods
Attention mechanisms
Skip connections
Input/output dimensions

Common locations:

Method section figures
Architecture diagrams
Table of hyperparameters
Appendix implementation details

Training Configuration

Parameter	Typical Locations
Learning rate	Experiments, Appendix
Batch size	Experiments, Appendix
Optimizer	Method, Appendix
Epochs	Experiments
Hardware	Experiments, Appendix
Training time	Experiments

Numerical Results

Extract from:

Main results tables
Comparison figures
Ablation tables
Training curves (approximate values)

Format as:

Metric	Dataset	Value	Conditions
Accuracy	CIFAR-10	95.2%	ResNet-50 backbone

Common Omissions to Watch For

Initialization: Often in appendix or not mentioned
Data augmentation: May be standard but unspecified
Early stopping criteria: Often implied
Evaluation protocol: Train/val/test split details
Random seeds: Reproducibility details
Software versions: PyTorch, CUDA versions

Quality Verification

Before completing analysis:

Coverage check: Every section reviewed?
Consistency check: Numbers match across sections?
Completeness check: Could someone implement from this?
Ambiguity check: Unclear items marked?

Output Quality Markers

Good analysis:

Specific numbers, not "good performance"
Exact layer configs, not "standard ResNet"
Explicit uncertainty markers
Cross-references between sections

Poor analysis:

Vague descriptions
Missing hyperparameters
No numerical targets
Assumptions without noting them

Red Flags

If you notice:

"Implementation details in code" → Check GitHub link
"Standard settings" → Look up the standard
"Following [citation]" → May need to read that paper
Inconsistent numbers → Note the discrepancy

3.8 KiB Raw Blame History