AI & ML interests

None defined yet.

prithivMLmodsย 
posted an update 3 days ago
view post
Post
3665
QIE-Object-Remover-Bbox Demo removes objects and artifacts from selected regions using bounding box grounding. Built on Qwen-Image-Edit-2509 with Rapid Diffusers acceleration, it delivers fast 4-step inference via the QIE-2509 adapter. ๐Ÿค—๐Ÿ”ฅ

๐Ÿ”—Demo Space: prithivMLmods/QIE-Object-Remover-Bbox
๐Ÿ”—Qwen-Image-Edit-Rapid-AIO: prithivMLmods/Qwen-Image-Edit-Rapid-AIO-V4
๐Ÿ”—Adapter-(LoRA): prithivMLmods/QIE-2509-Object-Remover-Bbox

๐Ÿ”—Collection: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-layout-bbox

To learn more, visit the app page or the respective model pages.
  • 1 reply
ยท
prithivMLmodsย 
posted an update 8 days ago
view post
Post
2463
FireRed-Image-Edit-1.0 (Rapid) Fast Experimental Demo is Out! ๐Ÿš€๐Ÿค—

Demo: prithivMLmods/FireRed-Image-Edit-1.0-Fast

-> Paired the EditPlusPipeline with the Diffusers-compatible transformer weights of Rapid AIO from Qwen-Image-Edit. (experimental)
-> This fusion delivers more accurate instruction following, higher image quality, and consistent visual coherence @ 4-step fast inference.
-> Better maintains text styles with high fidelity, along with high-quality old photo restoration, enhancement, and best-in-class virtual try-on.

prithivMLmodsย 
posted an update 13 days ago
prithivMLmodsย 
posted an update 17 days ago
view post
Post
2572
Dropping the Qwen3 VL Series of Unredacted MAX-VL models. These models have undergone multi-stage training to minimize refusal rates through continuous abliterated optimization. You can find the models in BF16, FP8-Dynamic, and GGUF formats at the links below.๐Ÿ”ฅ๐Ÿš€

Unredacted MAX - VL:
โžœ prithivMLmods/Qwen3-VL-4B-Instruct-Unredacted-MAX
โžœ prithivMLmods/Qwen3-VL-4B-Thinking-Unredacted-MAX
โžœ prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX
โžœ prithivMLmods/Qwen3-VL-8B-Thinking-Unredacted-MAX

Unredacted MAX - VL [FP8]
โžœ prithivMLmods/Qwen3-VL-4B-Instruct-Unredacted-MAX-FP8
โžœ prithivMLmods/Qwen3-VL-4B-Thinking-Unredacted-MAX-FP8
โžœ prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX-FP8
โžœ prithivMLmods/Qwen3-VL-8B-Thinking-Unredacted-MAX-FP8

Unredacted MAX - VL [GGUF]
โžœ prithivMLmods/Qwen3-VL-4B-Instruct-Unredacted-MAX-GGUF
โžœ prithivMLmods/Qwen3-VL-4B-Thinking-Unredacted-MAX-GGUF
โžœ prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX-GGUF
โžœ prithivMLmods/Qwen3-VL-8B-Thinking-Unredacted-MAX-GGUF

Unredacted MAX - VL [Collection]
โžœ https://huggingface.co/collections/prithivMLmods/unredacted-max-vl-fp8
โžœ https://huggingface.co/collections/prithivMLmods/unredacted-max-vl
โžœ https://huggingface.co/collections/prithivMLmods/unredacted-max-vl-gguf

To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update 25 days ago
view post
Post
2938
Introducing FLUX.2-Klein-LoRA-Studio, a demo for image editing using specialized LoRA adapters built for the FLUX.2-Klein-Distilled model. It features an edit-style gallery for multi-style image editing, including de-light, face swap, mannequin, and more. Try the demo below.

๐Ÿค—Demo: prithivMLmods/FLUX.2-Klein-LoRA-Studio
๐Ÿค—Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection
๐Ÿค—GitHub: https://github.com/PRITHIVSAKTHIUR/FLUX.2-Klein-LoRA-Studio

To learn more, visit the app page or the respective model pages.
prithivMLmodsย 
posted an update 28 days ago
view post
Post
867
GLM OCR, a multimodal OCR model for complex document understanding, built on the GLM-V encoderโ€“decoder architecture. It delivers high accuracy and strong generalization with a blazing-fast inference pipeline. The demo is live . Try it now. ๐Ÿค—๐Ÿš€

โœจ Demo: prithivMLmods/GLM-OCR-Demo
โœจ Multimodal Implementations: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
โœจ GitHub: https://github.com/PRITHIVSAKTHIUR/GLM-OCR-Demo
prithivMLmodsย 
posted an update 29 days ago
view post
Post
2172
Introducing the Qwen-Image-Edit-3D-Lighting-Control app, featuring 8ร— horizontal and 3ร— elevational lighting positions for precise 3D lighting control. It enables studio-level lighting using fast Qwen Image Edit fast inference, paired with Multi-Angle-Lighting adapters. ๐Ÿ”ฆ

๐Ÿ”ฅ Space: prithivMLmods/Qwen-Image-Edit-3D-Lighting-Control
โœ… Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection
๐Ÿ“‚ GitHub: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-3D-Lighting-Control
prithivMLmodsย 
posted an update about 1 month ago
view post
Post
3654
Daggr UI version of the Qwen3-TTS demo.๐Ÿ”ฅ
(custom voice, voice design, qwen3-asr and voice cloning) nodes.
No remote spaces used for API inference; all functions run in-app fn.
Powered by t4-m and built with daggr@0.5.2 and gradio@6.

๐Ÿ‘‰Demo: prithivMLmods/Qwen3-TTS-Daggr-UI
โญGithub: https://github.com/PRITHIVSAKTHIUR/Qwen3-TTS-Daggr-UI
  • 1 reply
ยท
prithivMLmodsย 
posted an update about 1 month ago
view post
Post
2704
Qwen-Image-Edit-Object-Manipulator Space is now featured in Hugging Face Space of the Week. It enables object manipulation such as extracting objects, adding designs, and removing objects or designs from the red highlighted area using specialized adapters.

๐Ÿ”ฅDo enjoy the demo! ~ prithivMLmods/Qwen-Image-Edit-Object-Manipulator

Collections:
๐ŸงจAdapters-1: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-exps
๐ŸงจAdapters-2: https://huggingface.co/collections/prithivMLmods/qie-jan-23-26
๐ŸงจAdapters-3: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-object-manipulator

โญGithub: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-Object-Manipulator

To learn more, visit the app page or the respective model pages.
  • 1 reply
ยท
prithivMLmodsย 
posted an update about 1 month ago
view post
Post
3053
Introducing QIE-2511-Zoom-Master for highlight-guided area zoom-in, enabling lossless zooming within a drawn square area, and QIE-2511-Object-Remover-v2 for precise object or highlight-guided area cleanup. These experimental adapters are trained based on QIE-2511. Find the adapters below.

๐Ÿ•น๏ธQIE-2511-Zoom-Master : prithivMLmods/QIE-2511-Zoom-Master
๐Ÿ•น๏ธQIE-2511-Object-Remover-v2: prithivMLmods/QIE-2511-Object-Remover-v2

๐Ÿค—Demo: prithivMLmods/Qwen-Image-Edit-Object-Manipulator

๐Ÿ“‚Collection: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-exps

To learn more, visit the app page or the respective model pages.
  • 2 replies
ยท
telcomย 
posted an update about 2 months ago
telcomย 
posted an update about 2 months ago
view post
Post
1581
MAD-GRPO: https://huggingface.co/blog/telcom/mad-grpo
In R1-Zero-Like Training *, Dr.GRPO treats GRPOโ€™s by dropping std, but that often comes with a hidden side effect: length-weighted updates that can nudge model toward verbosity.
MAD-GRPO provides robust scale (MAD + epsilon) per-token normalization stability without verbosity bias.

*https://huggingface.co/papers/2503.20783

prithivMLmodsย 
posted an update about 2 months ago
view post
Post
5584
LTX-2 Camera-Control LoRA demo with dolly-in/out and dolly-left/right is now available on Hugging Face, paired with ltx-2-19b-distilled-lora for fast inference. It also includes dynamic GPU duration adjustments for long video generations. Click the related Space links below.

๐Ÿค—Try it now on : prithivMLmods/LTX-2-LoRAs-Camera-Control-Dolly
โญGithub: https://github.com/PRITHIVSAKTHIUR/LTX-2-LoRAs-Camera-Control-Dolly
๐Ÿ•น๏ธCollection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection

To learn more, visit the app page or the respective model pages.
  • 2 replies
ยท
telcomย 
posted an update about 2 months ago
prithivMLmodsย 
posted an update 2 months ago
view post
Post
2480
Dropping Image Edit (Object Manipulator): Add or remove specified objects/designs, with flexible support for both single-image and multi-image modes.

๐Ÿค— Demo: prithivMLmods/Qwen-Image-Edit-Object-Manipulator

Qwen-Image-Edit-2511-Object-Remover is an adapter (LoRA) developed for Qwenโ€™s Qwen-Image-Edit-2511 image-to-image model. It is specifically designed for precise object removal from images.

โญ Model: prithivMLmods/Qwen-Image-Edit-2511-Object-Remover

Qwen-Image-Edit-2511-Object-Adder is an adapter (LoRA) developed for Qwenโ€™s Qwen-Image-Edit-2511 image-to-image model. It is specifically designed for precise object addition to images.

โญ Model: prithivMLmods/Qwen-Image-Edit-2511-Object-Adder

๐Ÿ•น๏ธ Collection: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-object-manipulator
๐Ÿ•น๏ธ github: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-Object-Manipulator

To learn more, visit the app page or the respective model pages.
telcomย 
posted an update 2 months ago
view post
Post
235
if you are interested in HUB (https://saemi410.github.io/HUB/ I recommend the fork I have created with some updates to make it smooth in running a smoke test git@github.com:javadtaghia/HUB.git) and you want to run the UCE (https://unified.baulab.info), please check:
- Model weights for UCE here: telcom/uce_NSFW
- Model weights for ESD here: telcom/esd_NSFW
- datasets and more download materials from: telcom/HUB_reference_dataset

Please read the notes in the model card.
prithivMLmodsย 
posted an update 2 months ago
view post
Post
4231
Update: TRELLIS.2 (Text to 3D, Image to 3D) Gradio with Rerun Embedded demo with improved visualization of the 3D model previewer is now available on Hugging Face. Generate assets and view them in the 3D viewer, powered and streamlined with Microsoftโ€™s TRELLIS.2 and Tongyi-MAIโ€™s Z-Image-Turbo models.

๐Ÿค— TRELLIS.2 (Demo): prithivMLmods/TRELLIS.2-Text-to-3D
๐Ÿ•น๏ธ GitHub: https://github.com/PRITHIVSAKTHIUR/TRELLIS.2-Text-to-3D-RERUN
๐Ÿ•น๏ธ Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations

To know more about it, visit the app page or the respective model page!
prithivMLmodsย 
posted an update 2 months ago
view post
Post
4283
Introducing the Qwen-Image-Edit-2511-LoRAs-Fast demo, featuring image property comparison and contrast, built on top of Gradio and the combined Rerun SDK. It supports single and multi-image edits with existing LoRAs that are lazily loaded. (Note: This is still an experimental Space for Qwen-Image-Edit-2511.)

โญ Space Demo: prithivMLmods/Qwen-Image-Edit-2511-LoRAs-Fast
โญ GitHub: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-2511-LoRAs-Fast-Multi-Image-Rerun
โญ Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection

To know more about it, visit the app page or the respective model page!
  • 2 replies
ยท
telcomย 
posted an update 2 months ago
view post
Post
268
NVIDIAโ€™s Groq deal ... I think, inference efficiency is becoming the main driver of profitability, and NVIDIAโ€™s Groq deal is evidence the market is moving from โ€œwho can train biggestโ€ to โ€œwho can serve cheapest and fastest at scale.โ€ That points to a maturing phase of AI, not necessarily the end of a bubble, but definitely a correction in what โ€œwinsโ€ long-term.
What do you think?
  • 2 replies
ยท
telcomย 
posted an update 2 months ago
view post
Post
186
CIFAR-10 your handing image dataset ...
CIFAR-10 is a small, standard computer-vision dataset used to quickly test and compare ideas.

- 60,000 color images, each 32ร—32 pixels, labeled into 10 classes: airplane, automobile, bird, cat, deer, dog, frog, horse, ship, truck.
- Label mapping (important):

- 0 airplane
- 1 automobile
- 2 bird
- 3 cat
- 4 deer
- 5 dog
- 6 frog
- 7 horse
- 8 ship
- 9 truck
- Split: 50,000 train and 10,000 test.
- Why people use it: fast benchmarking for image classifiers (small CNNs, ResNet, ViT), and quick experiments for training pipelines, augmentation, regularization, pruning, distillation, and demos.
- Sizes (downloads): Python version about 163 MB, binary about 162 MB. Hugging Face shows about 144 MB for the dataset files.
- Where to get it: the official CIFAR page (University of Toronto) and the Hugging Face CIFAR-10 dataset page.
uoft-cs/cifar10
If you want something more, check the table below
| Dataset | Resolution | Classes | Best For |
| ImageNet 1K | 224โ€“256ร—256 | 1000 | Real-world large-scale classification |
| ImageNet-256. | 256ร—256 | 1000 | Direct high-res training |
| TinyImageNet | 64ร—64 | 200 | Mid-range benchmark |
| UC Merced Land Use | 256ร—256 | ~21 | Higher resolution small classification |
| MS COCO | >256ร—256 | ~80 objects | Detection / segmentation |