3.8 KiB
3.8 KiB
| name | description |
|---|---|
| paper-parsing | Use when analyzing ML/DL papers to ensure comprehensive extraction of all relevant information |
Paper Parsing Methodology
Overview
Systematic approach to parsing ML/DL papers for replication. Emphasizes completeness and openness to avoid missing critical details.
Announce at start: "I'm using the paper-parsing skill to ensure comprehensive paper analysis."
Core Philosophy
- Completeness over speed: Better to extract too much than miss something
- Open-ended discovery: Papers contain unique insights; don't force into templates
- Cross-reference: Information appears in multiple places; cross-check
- Explicit uncertainty: Mark unclear items rather than guessing
Paper Sections Checklist
Abstract
- Core contribution identified
- Key results/numbers extracted
- Problem domain understood
Introduction
- Problem motivation clear
- Gap in existing work identified
- Proposed solution summarized
- Claimed contributions listed
Related Work
- Key prior methods identified
- Differences from this work noted
- Potential baselines for comparison
Method / Approach
- Architecture fully described
- All components identified
- Mathematical formulation complete
- Training procedure detailed
- Loss functions specified
- Hyperparameters listed
Experiments
- Datasets listed with sizes
- Evaluation metrics defined
- Baseline comparisons noted
- Ablation studies cataloged
- Key numerical results extracted
Appendix / Supplementary
- Additional implementation details
- Extended results
- Proofs or derivations
- Code references
Information Extraction Patterns
Architecture Details
Look for:
- Layer types and configurations
- Activation functions
- Normalization methods
- Attention mechanisms
- Skip connections
- Input/output dimensions
Common locations:
- Method section figures
- Architecture diagrams
- Table of hyperparameters
- Appendix implementation details
Training Configuration
| Parameter | Typical Locations |
|---|---|
| Learning rate | Experiments, Appendix |
| Batch size | Experiments, Appendix |
| Optimizer | Method, Appendix |
| Epochs | Experiments |
| Hardware | Experiments, Appendix |
| Training time | Experiments |
Numerical Results
Extract from:
- Main results tables
- Comparison figures
- Ablation tables
- Training curves (approximate values)
Format as:
| Metric | Dataset | Value | Conditions |
|---|---|---|---|
| Accuracy | CIFAR-10 | 95.2% | ResNet-50 backbone |
Common Omissions to Watch For
- Initialization: Often in appendix or not mentioned
- Data augmentation: May be standard but unspecified
- Early stopping criteria: Often implied
- Evaluation protocol: Train/val/test split details
- Random seeds: Reproducibility details
- Software versions: PyTorch, CUDA versions
Quality Verification
Before completing analysis:
- Coverage check: Every section reviewed?
- Consistency check: Numbers match across sections?
- Completeness check: Could someone implement from this?
- Ambiguity check: Unclear items marked?
Output Quality Markers
Good analysis:
- Specific numbers, not "good performance"
- Exact layer configs, not "standard ResNet"
- Explicit uncertainty markers
- Cross-references between sections
Poor analysis:
- Vague descriptions
- Missing hyperparameters
- No numerical targets
- Assumptions without noting them
Red Flags
If you notice:
- "Implementation details in code" → Check GitHub link
- "Standard settings" → Look up the standard
- "Following [citation]" → May need to read that paper
- Inconsistent numbers → Note the discrepancy