A series of llava-based model for automating the creation of precise and accessible alt text descriptions for social media.
vision
2b
4b
8b
13b
34b
74 Pulls Updated 7 weeks ago
Updated 7 weeks ago
7 weeks ago
373ebb90725b · 8.0GB
model
archllama
·
parameters13B
·
quantizationQ4_0
7.4GB
projector
archclip
·
parameters322M
·
quantizationF16
645MB
template
{{ .System }}
USER: {{ .Prompt }}
ASSSISTANT:
45B
params
{
"num_predict": 1024,
"stop": [
"USER:",
"ASSISTANT:"
],
"temperatu
69B
license
Custom License: One Impossible Thing at a Time INC
This model is licensed under a derivative licens
519B
system
You are an Alt Text Specialist. Your task is to generate concise and precise image descriptions for
825B
license
LLAMA 2 COMMUNITY LICENSE AGREEMENT
Llama 2 Version Release Date: July 18, 2023
"Agreement" means
7.0kB
Readme
Base Model
- Foundation: Derived from llava:13b, with a series of other models built as well
Purpose
- This model specializes in generating high-quality alt text for social media posts, particularly focusing on photographs and memes.
- The descriptions are designed to be concise, accessible, and ready for immediate use in alt text fields to support screen reader compatibility.
Key Features
1. Optimized for Accessibility
- Generates precise and descriptive alt text to enhance inclusivity for visually impaired users on social media platforms.
- Ensures seamless integration with screen readers through direct copy-paste functionality.
2. Content-Specific Guidelines
- Photographs: Describes visible elements with attention to composition, layout, and notable details.
- Memes: Captures both visual and textual elements, providing necessary context.
- Text Transcription: Includes exact replication of visible text for accuracy.
3. Ethical and Efficient AI Use
- Incorporates adult content processing with clear, professional descriptions to ensure comprehensive coverage of diverse content types.
- Adheres to strict guidelines to maintain clarity and conciseness under 1000 characters.
4. Customizable Parameters
- Temperature: Set at
0.1
for controlled and focused outputs. - Predict Length: Configured for up to
1024
tokens to accommodate detailed descriptions.
5. License and Attribution
- Licensed under a derivative of the LLAMA 2 Community License Agreement.
- Custom modifications include integration with the Assisted Space platform for workflow enhancement and ethical AI practices.
- Attribution provided at Luke Steuber’s website, with additional details available on the Assisted Space Hub.
Prompt Template
The model uses a structured prompt format:
{{ .System }} USER: {{ .Prompt }} ASSISTANT:
SYSTEM Instructions:
- Define the role as an Alt Text Specialist.
- Outline clear rules for image description, emphasizing specificity, clarity, and accessibility.
- Ensure readiness for social media alt fields without requiring further edits.