Instructions to use ncls-p/Qwen2.5-3B-blog-key-points with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use ncls-p/Qwen2.5-3B-blog-key-points with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="ncls-p/Qwen2.5-3B-blog-key-points", filename="unsloth.Q4_K_M.gguf", )
llm.create_chat_completion( messages = "\"The tower is 324 metres (1,063 ft) tall, about the same height as an 81-storey building, and the tallest structure in Paris. Its base is square, measuring 125 metres (410 ft) on each side. During its construction, the Eiffel Tower surpassed the Washington Monument to become the tallest man-made structure in the world, a title it held for 41 years until the Chrysler Building in New York City was finished in 1930. It was the first structure to reach a height of 300 metres. Due to the addition of a broadcasting aerial at the top of the tower in 1957, it is now taller than the Chrysler Building by 5.2 metres (17 ft). Excluding transmitters, the Eiffel Tower is the second tallest free-standing structure in France after the Millau Viaduct.\"" )
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use ncls-p/Qwen2.5-3B-blog-key-points with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf ncls-p/Qwen2.5-3B-blog-key-points:Q4_K_M # Run inference directly in the terminal: llama-cli -hf ncls-p/Qwen2.5-3B-blog-key-points:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf ncls-p/Qwen2.5-3B-blog-key-points:Q4_K_M # Run inference directly in the terminal: llama-cli -hf ncls-p/Qwen2.5-3B-blog-key-points:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf ncls-p/Qwen2.5-3B-blog-key-points:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf ncls-p/Qwen2.5-3B-blog-key-points:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf ncls-p/Qwen2.5-3B-blog-key-points:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf ncls-p/Qwen2.5-3B-blog-key-points:Q4_K_M
Use Docker
docker model run hf.co/ncls-p/Qwen2.5-3B-blog-key-points:Q4_K_M
- LM Studio
- Jan
- Ollama
How to use ncls-p/Qwen2.5-3B-blog-key-points with Ollama:
ollama run hf.co/ncls-p/Qwen2.5-3B-blog-key-points:Q4_K_M
- Unsloth Studio
How to use ncls-p/Qwen2.5-3B-blog-key-points with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for ncls-p/Qwen2.5-3B-blog-key-points to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for ncls-p/Qwen2.5-3B-blog-key-points to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for ncls-p/Qwen2.5-3B-blog-key-points to start chatting
- Pi
How to use ncls-p/Qwen2.5-3B-blog-key-points with Pi:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf ncls-p/Qwen2.5-3B-blog-key-points:Q4_K_M
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "llama-cpp": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "ncls-p/Qwen2.5-3B-blog-key-points:Q4_K_M" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use ncls-p/Qwen2.5-3B-blog-key-points with Hermes Agent:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf ncls-p/Qwen2.5-3B-blog-key-points:Q4_K_M
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default ncls-p/Qwen2.5-3B-blog-key-points:Q4_K_M
Run Hermes
hermes
- Docker Model Runner
How to use ncls-p/Qwen2.5-3B-blog-key-points with Docker Model Runner:
docker model run hf.co/ncls-p/Qwen2.5-3B-blog-key-points:Q4_K_M
- Lemonade
How to use ncls-p/Qwen2.5-3B-blog-key-points with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull ncls-p/Qwen2.5-3B-blog-key-points:Q4_K_M
Run and chat with the model
lemonade run user.Qwen2.5-3B-blog-key-points-Q4_K_M
List all available models
lemonade list
Qwen2.5-3B-blog-key-points
This model is fine-tuned from Qwen/Qwen2.5-3B-Instruct on the ncls-p/blog-key-points. It specializes in extracting key points from blog articles and web content, providing concise bullet-point summaries that capture the essential information.
Model Description
Qwen2.5-3B-blog-key-points is a 3B parameter model fine-tuned specifically for the task of extracting key points from articles. It can process a full article and generate a concise, bullet-point summary highlighting the most important information.
Model Details
- Model Type: Qwen2.5 (3B parameters)
- Base Model: Qwen/Qwen2.5-3B-Instruct
- Training Dataset: ncls-p/blog-key-points
- Language: English
- License: CC-BY-4.0
- Finetuning Approach: Instruction fine-tuning on article-summary pairs
Uses
Direct Use
This model is designed for extracting key points from articles. You can use it directly for:
- Summarizing blog posts
- Extracting important information from news articles
- Creating bullet-point summaries of long-form content
- Generating concise overviews of research papers
Example Usage
from transformers import AutoModelForCausalLM, AutoTokenizer
model_id = "ncls-p/Qwen2.5-3B-blog-key-points"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)
article = """
[Your article text here]
"""
prompt = f"""
Extract the key points from the following article:
{article}
"""
inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(**inputs, max_length=1024)
response = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(response)
Training
The model was fine-tuned on the blog-key-points dataset, which contains 200 article-summary pairs. Each pair consists of a full article and a bullet-point summary of key points extracted using AI.
Training Procedure
- Fine-tuning Framework: Unsloth
- Training Data Format:
{ "instruction": "", "input": "Full article content", "output": "Here are the key points of the article:\n* Key point 1\n* Key point 2\n* Key point 3\n..." }
Evaluation
The model was evaluated on its ability to extract relevant key points from articles not seen during training. Evaluation metrics focused on:
- Relevance: How well the extracted points capture the main ideas of the article
- Conciseness: The ability to summarize information in a clear, bullet-point format
- Completeness: Whether all important information is captured in the summary
Limitations and Biases
- The model may inherit biases present in the training data, including potential biases in the source articles or in the key point extraction process.
- Performance may vary depending on the length, complexity, and domain of the input article.
- The model is primarily trained on English-language content and may not perform well on content in other languages.
- As with any summarization model, there is a risk of omitting important information or misrepresenting the original content.
How to Cite
If you use this model in your research, please cite:
@misc{qwen25-3b-blog-key-points,
author = {ncls-p},
title = {Qwen2.5-3B-blog-key-points},
year = {2024},
publisher = {Hugging Face},
journal = {Hugging Face model repository},
howpublished = {\url{https://huggingface.co/ncls-p/Qwen2.5-3B-blog-key-points}},
}
Dataset Creation
The dataset used to train this model was created using the llm-to-blog-key-points-dataset, a CLI tool that extracts key points from web articles using AI and adds them to a dataset in a structured format.
- Downloads last month
- 103
4-bit
5-bit
8-bit