1 Download Updated 2 months ago
ollama run jamcon/qwen3-filemetadata
A fine-tuned Qwen3 14B model that extracts structured metadata from media filenames. Parses complex filenames with Unicode characters, embedded metadata, and various naming conventions to extract show names, season/episode numbers, CRC hashes, and confidence scores.
# Pull the model
ollama pull jamcon/qwen3-filemetadata
# Run inference
ollama run jamcon/qwen3-filemetadata "Your filename here.mkv"
Provide a filename (without path) as a string. The model handles: - Unicode characters (Japanese, Chinese, Korean, etc.) - Various episode markers (S01E02, E12, Episode 12, etc.) - Embedded metadata (quality tags, codec info, release groups) - CRC32 hashes in brackets - Complex hyphen and dash patterns
Examples:
"Supêsuopera---aku-no-teiô-sama-to-sutârôdo-kun-kimi-6-gô-to-no-saishû-kessen-02 (S01E02v2).mkv"
"[Bunny]Burankugēto.-.uchū.no.nazo.Season1_Eps22(720p).FLAC.H.265.[39AB5490].mkv"
"Scarlet Watch - Hiiro no tokei no himitsu - 07 [1080p].mkv"
The model returns a single JSON object with exactly these keys (in order):
{
"show_name": "string",
"season": integer|null,
"episode": integer|null,
"crc_hash": "string|null",
"confidence": number,
"reasoning": "string"
}
Example Output:
{
"show_name": "Supêsuopera - Aku no Teiô-sama to Sutârôdo-kun Kimi 6-gô to no Saishû Kessen",
"season": 1,
"episode": 2,
"crc_hash": null,
"confidence": 0.9,
"reasoning": "Explicit season and episode markers S01E02 are present."
}
null if not present. Only set when explicitly marked (S02, Season 2, etc.)null for non-episode content (OP, ED, PV, etc.)null. Must be valid hex (0-9, A-F only)ollama run jamcon/qwen3-filemetadata "[Nani?] Matte, kono nioi wa nani? 7-Jigen no mukō kara kita neko-chan no nioida! S02 -11- (AV1) (HVEC) (x265).mkv"
ollama run jamcon/qwen3-filemetadata "Akai yūgure ni terasa reta 葵半蔵 no hakurankai E03 (4k).mkv"
ollama run jamcon/qwen3-filemetadata "Show Name OP [1080p].mkv"
# Returns episode=null with low confidence
CRC32 Validation: Only accepts valid hexadecimal characters (0-9, A-F). Non-hex strings in brackets (e.g., [BACON]) are not treated as CRC hashes.
Season Inference: Never infers season from context. Only extracts when explicitly marked (S01, Season 1, etc.)
Episode Requirements: For episode files, an explicit marker is preferred. Bare numbers may be inferred if they appear after clear separators, but confidence will be lower.
Non-Episode Assets: Files marked as OP, ED, PV, CM, SP, OVA, OAD, NCED, NCOP, etc. will have episode=null unless an explicit episode marker is also present.
Show Name Normalization: Applies Title Case normalization but preserves romanization artifacts and loanwords. May not perfectly match original formatting.
Context Window: 40K tokens. Very long filenames (>1000 characters) may be truncated.
This model is based on Qwen3-14B. Please refer to the Qwen license for usage terms.