David Berenstein

davidberenstein1957

AI & ML interests

Everything NLP and knowledge graphs

Articles

Organizations

Posts 17

view post
Post
1891
🧢 We are launching distilabel DataCraft: get started with synthetic data using clicks and natural language!

🌊 Workflow
- Write down your custom GenAI usecase
- Automatically generate system prompts
- Create sample datasets for quick iteration
- Produce full-scale datasets with customizable parameters
- Push generated datasets directly to the Hugging Face Hub

⚑️ Powered by Argilla's distilabel and open source LLMs
πŸ†“ Uses Free Serverless HF Inference Endpoints

πŸ’‘ Use Cases:
- Fine-tuning language models for specific domains
- Creating diverse datasets for robust model training
- Rapid prototyping of AI applications
- Generating synthetic data for privacy-sensitive projects

πŸš€ Start crafting your custom datasets today and do it quicker, easier and more private with distilabel DataCraft!
argilla/distilabel-datacraft
view post
Post
1629
πŸ¦€ Is your SQL a bit rusty? I just created theText To SQL Hub dataset explorer. To write SQL queries based on natural text input. Uses DuckDB, Llama 3.1 70B and the Hugging Face dataset-server API.

davidberenstein1957/text-to-sql-hub-datasets