Create bibliography citations from web page URLs with automatic Wayback Machine archival and metadata extraction. Use when the user asks to cite a website, create a citation for a URL, archive and cite a web page, or generate a bibliography entry from a web address.
View on GitHubSelect agents to install to:
npx add-skill https://github.com/Mearman/marketplace/blob/main/plugins/bib/skills/bib-cite-web/SKILL.md -a claude-code --skill bib-cite-webInstallation paths:
.claude/skills/bib-cite-web/# Web Page Citation Creator
Create bibliography citations from web page URLs with automatic archival snapshot and metadata extraction.
## Features
- **Wayback Machine Integration**: Automatically submits URLs to the Internet Archive for preservation
- **Metadata Extraction**: Extracts title, author, description, site name, and publish date from semantic HTML
- **Multiple Formats**: Outputs citations in BibTeX or CSL JSON format
- **Smart Citation Keys**: Generates citation keys from domain + author + year
## Usage
```bash
npx tsx plugins/bib/scripts/cite-web.ts <url>
npx tsx plugins/bib/scripts/cite-web.ts <url> --format=bibtex
npx tsx plugins/bib/scripts/cite-web.ts <url> --no-wayback
npx tsx plugins/bib/scripts/cite-web.ts <url> --output=citations.bib
```
## Metadata Extraction
The script extracts metadata from semantic HTML tags:
### Title
- `<title>` tag
- Open Graph: `<meta property="og:title">`
- Twitter Card: `<meta name="twitter:title">`
- Standard: `<meta name="title">`
### Author
- `<meta name="author">`
- Open Graph: `<meta property="og:author">` or `<meta property="article:author">`
- Twitter Card: `<meta name="twitter:creator">`
### Description
- `<meta name="description">`
- Open Graph: `<meta property="og:description">`
- Twitter Card: `<meta name="twitter:description">`
### Site Name
- Open Graph: `<meta property="og:site_name">`
- `<meta name="application-name">`
### Published Date
- Open Graph: `<meta property="article:published_time">`
- `<meta name="publish-date">` or `<meta name="date">`
## Arguments
- **Positional argument**: URL to cite
- **`--file <path>`**: Read URL from file (uses first line)
- **`--format <format>`**: Output format (default: bibtex)
- `bibtex` or `bib`: BibTeX format
- `csl`, `json`, or `csl-json`: CSL JSON format
- **`--no-wayback`**: Skip Wayback Machine submission (faster, but no archive)
- **`--output <file>`**: Write output to file (default: stdout)
## Output Formats
### BibTeX
```bibtex
@online{s