Pelayo Arbués
Search
Search
Dark mode
Light mode
Explorer
attachments
Data Science Fundamentals
Resources
A/B Testing
Business Understanding
Causal Inference
Communicate with impact
Data Science Ethics
Data Visualization
Good Data Analysis
Introduction to Computer Science
Linear Algebra
Machine Learning 101
Numerical optimization
Programming Language
Rules of ML
Shell Script and others
SQL
Statistical Learning
Statistics 101: Probability
Technical Writing
The Data Science Process
The Ultimate Guide to Deploying ML Models
Time Series Analysis
Time Series Analysis
Literature Notes
Articles
LoRA: Low-Rank Adaptation of Large Language Models
![CDATA[Not Boring by Packy McCormick]]>
¡Socorro, Mi Barrio Se Esta Poniendo Precioso! Asi Es La Gentriansiedad, El Miedo a Las Mejoras Urbanas
¿Cuantas Ingestas De Proteína Debes Hacer Al Día? ¿Y Con Ayuno Intermitente?
¿De Verdad Queremos Lo Que Creemos Anhelar? Sobre Tentaciones, Prohibiciones Y Frustraciones
¿En qué nos gastamos el dinero? La IA detrás de la clasificación de gastos e ingresos
¿Han Arruinado Los Móviles La Salud Mental De Los Jóvenes? La Ciencia Busca Explicaciones a Un Problema Universal
¿Por Qué Está Condenado El Alquiler Tradicional De Viviendas? Se Gana Hasta Cuatro Veces Más Al Mes Con Un Piso Turístico
¿Por Que Los Salarios Son Mas Altos en Las Ciudades Grandes? La Importancia Del Poder De Monopsonio en El Mercado Laboral Español
¿Qué Es La Academia?
¿Que Es Product Management? Por Shreyas Doshi
¿Qué Es Un Tech Lead?
‘There Was All Sorts of Toxic Behaviour’: Timnit Gebru on Her Sacking by Google, AI’s Dangers and Big Tech’s Biases
[Insights for Intermediates] - How to Craft the Images You Want With A1111
"A Theory of Everyone" by Michael Muthukrishna
"Single Panes of Glass" Are Horrible
'Enshittification' Is Coming for Absolutely Everything
'Proyecto Trinity' O La Importancia De Que La Sociedad Entienda La Necesidad De Invertir en Vivienda
'Urbanalización' O Por Qué Todos Los Centros De Ciudad Parecen El Mismo Sitio
#158 Capitalismo Para El S. XXI
#177 Entrenamiento Y Complejidad
#206 Por Qué Son Tan Difíciles Los Productos Basados en Software
⚖️ Create a Legal Preference Dataset
⚗️ 🧑🏼🌾 Let's Grow Some Domain Specific Datasets Together
⚡️ IBM Granite 3.0 8B Surpasses Llama 3.1 on OpenLLM Benchmark
⚡️ the First AI PC and Lightest LLMs: Microsoft Is Back
⚡️ This Repo Makes LLMs 40% Faster
⚡️ This Was AI's Busiest Week
✍️ Es Peor Que Un Crimen, Es Un Error. O No. Incentivos
✍️ Ser Data-Driven No Es De Guapas
🌋 LLaVA-Plus: Large Language and Vision Assistants That Plug and Learn to Use Skills
🌌 AI Drones Are Paving the Way for Autonomous Spaceflight
🌟 Our ChatGPT Retrieval Plugin is #1 trending on GitHub!
🌟 Thrilled to introduce vLLM with @woosuk_k!
🌲 Secondary Sources Are Pretty Great, Actually
🎮 La Unrealidad Ya Está Aquí
🏆 the New Claude Is 2x Faster, 80% Cheaper
🏖️ Your Guide to AI: July 2024
📄 Microsoft's New SpreadsheetLLM
🔥 Breakthrough: Matrix Multiplication Free LLMs
🔥 Mistral Launches Its First Ever Multimodal Model, Pixtral 12B
🔦 Shedding Light on History With AI
🔫 Zero-Shot and Few-Shot Classification With SetFit
🗳️ How to Control Your Computer With Language Models
🤯 New Technique Allows Open-Source LLM to Beat Top Closed-Source Models
🦍 LLMs Are Taking Over APIs + Other Top Repos
🦙⚗️ Using Llama3 and Distilabel to Build Fine-Tuning Datasets
🧙 Create an Evol-Instruct Dataset¶
🧨 Diffusers Welcomes Stable Diffusion 3.5 Large
🧰 Claude's New AI Agent Toolbox
🙌 Analyzing Annotation Metrics With FastFit Model Predictions
🚨 Llama 3.1: Open-Source Finally Beats GPT
1 Introducing Pytimetk: Simplifying Time Series Analysis for Everyone
3-2-1: Healthy Self-Esteem, How to Build an Exercise Habit, and Improving by 1%
3-2-1: Judging potential, negotiating, and balancing life
3-2-1: On Endless Pursuits, the Value of Courage, and How to Buy Back Your Time
3-2-1: On the Biggest Barrier to Learning, Powerful Self-Talk Strategies, and the Ripples We Leave Behind
3-2-1: One of the most valuable skills in life, and starting before you feel ready
3. Fine-Tune LLaMA 13B With QLoRA on Amazon SageMaker
4 Ways to Test ML Models in Production
5 Thoughts on The 2023 MAD (Machine Learning, Artificial Intelligence and Data) Landscape
6 Habits of High-Performing Teams
7 Powers
7 Things You Need to Know About Fine-Tuning LLMs
7 Ways to Speed Up Inference of Your Hosted LLMs
8's Enough 40's Plenty
10 Ways to Go From Unhealthy to Healthy Ad-Hoc Request Cycles
10,000 Microwave Enthusiasts to Attend Annual Microwave Conference in Las Vegas
2022 Recap: Every Random Idea I Had
A Builder's Guide to Evals for LLM-based Applications
A Deepdive Into Aya Expanse: Advancing the Frontier of Multilinguality
A Dive Into Vision-Language Models
A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using Hugging Face Transformers, Accelerate and bitsandbytes
A Gentle Introduction to Vector Databases
A Guide to Structured Generation Using Constrained Decoding
A Humanoid Robots Gets Behind the Wheel 🚕
A Large Language Model Walks Into an Archive...
A Lesser-Known Detail of Dropout
A New Home for Python-Build-Standalone
A Philosopher for Our Times
A Practical Guide to Human-in-the-Loop Distillation
A Roadmap for Creating a Data Literacy Program
A Short Introduction to the Underlay
A Short Summary of Chinese AI Global Expansion
A Snapshot of AI-powered Reminiscing in Google Photos
A Survey of Generative Search and Recommendation in the Era of Large Language Models
A Visual and Intuitive Guide to What Makes ReLU a Non-Linear Activation Function
A Writing Inbox for Transient and Incomplete Notes
About nownownow.com
Acceso a La Vivienda, Un Elemento Clave en La Calidad De Vida
Accurately Valuing Homes With Deep Learning and Structural Inductive Biases
Acquiring Listing Media via Web API
Actively Listening
Adaptive Raises $19M to Transform Construction Finance With AI
Add Film Grain and Vignette to Images
Advanced Query Transformations to Improve RAG
Advanced RAG Techniques: An Illustrated Overview
Advancing Product Categorization With Vision Language Models: The Power of Fine-Tuned LLaVA
Agentes Autónomos: Salesforce Aterriza en España La Gran Revolución Empresarial De La IA en La Gestión De Clientes
AI Achieves Silver-Medal Standard Solving International Mathematical Olympiad Problems
AI Agents With Low/No Code, Hallucinations Create Security Holes, Tuning for RAG Performance, GPT Store's Lax Moderation
AI Distillation #7
AI engineers report burnout and rushed rollouts as ‘rat race’ to stay competitive hits tech industry
AI Has Unlocked a Level of Facebook Pandering Previously Unknown to Science
AI in Organizations: Some Tactics
AI Landlord Screening Tool Will Stop Scoring Low-Income Tenants After Discrimination Suit - The Verge
AI Leapfrogging: How AI Will Transform “Lagging” Industries
AI Optimism vs. AI Arms Race
AI Safety Advocates Tell Founders to Slow Down
AI Safety and the Age of Dislightenment
AI-Based Property Platform, Propcorn, Secures €600,000 to Transform Urban Development and Property Investment
AI-enhanced Development Makes Me More Ambitious With My Projects
AI-Powered Architectural Platform Raises to Bring 2D Sketches to Life
AI-powered Siri Is Here
AI’s $600B Question
Alex Karp Has Money and Power. So What Does He Want?
Algunas Reflexiones Acerca Del Mundo Real De Uno Que Echó Un Vistazo Y Se Marchó
Alternatives to Product Managers
Amazon Wishlist Web Scraper
An Analysis of Chinese LLM Censorship and Bias With Qwen 2 Instruct
An Avalanche Really Is Coming This Time
An Ethical AI Never Says "I"
An image is worth 16x16 words. Transformers for image recognition at scale
Añadir Diez “Buenos Años” a Tu Vida Enseñando Nuevos Trucos a Viejas Células
Analysis | How Walkable Is Your Neighborhood? Use Our Interactive Map to Find Out. - Washington Post
Analysis: 7 Trends That Will Dominate Real Estate Portals and Proptechs in 2025
Andrew Huberman’s Mechanisms of Control
Announcing Data Wrangler: Code-Centric Viewing and Cleaning of Tabular Data in Visual Studio Code
Announcing FLUX1.1 [Pro] and the BFL API
Announcing New Dataset Search Features
Anthropic Is Neck-and-Neck With Rivals After Its Latest Release
Apdex: Measure user satisfaction
Apple's Huge AI Announcement Is a Chatbot and an Image Generator, Which Is the Exact Same Boring Offering as Microsoft, Google and Meta
Applied LLMs - What We ve Learned From A Year of Building with LLMs
Are You Ready to Hire Your First Data Scientist?
Aria: First Open Multimodal Native MoE Model
Aristotle — How to Live a Good Life
Artificial General Intelligence Is Already Here
Artificial Intelligence Plus Human Expertise - A Combination That Can Transform RE Value
Atkinson Hyperlegible
Autonomous AI Agents: A Progress Report
Autonomous Coding Agents, Instability at Stability AI, Mamba Mania, What Users Do With GenAI
Balancing the Scales: A Comprehensive Study on Tackling Class Imbalance in Binary Classification
Balancing the Weight of Variables in a Decision Tree
BBRR Programa investigo
Be a Thermostat, Not a Thermometer
Beating Proprietary Models With a Quick Fine-Tune
Becoming a Data Engineering Force Multiplier
Becoming a Parent Made Me a Better Person
Becoming Data Driven, From First Principles
Been working on LLMs in production lately
Being Glue talk
Ben Franklin: The Thirteen Necessary Virtues
Benzodiacepinas: La Adicción Camuflada Del Consumo De Ansiolíticos en España
Better Tools, Bigger Companies
Beware the Data Science Pin Factory: The Power of the Full-Stack Data Science Generalist and the Perils of Division of Labor Through Function
Bge M3
Big Tech Goes Nuclear.
Blog Writing for Developers
Book Review – Data Culture by Dr. Shorful Islam
Books Read in 2024
Bootstrapping to $150K MRR by Doing Less, Better.
Bradley–Terry model - Wikipedia
Brain Food: Listening to Win
Breaking the Inertia of Mediocrity
Bridging the Hard and the Soft
Bringing It Up to Twelve! Going Deep Into Quality.
Buddhist Economics: How to Start Prioritizing People Over Products and Creativity Over Consumption
Build a More Open Lakehouse With Unity Catalog
Build a Search Engine, Not a Vector DB
Build a Serverless Customer Service Voicebot
Build and Deploy Generative AI and Machine Learning Models in an Enterprise
Build Interactive Data Apps of Scikit-Learn Models Using Taipy
Building a Collaborative Asynchronous Work Environment
Building a First Team Mindset
Building a Machine Learning Platform [Definitive Guide]
Building a Million Dollar Data Analytics Service
Building an Antilibrary: The Power of Unread Books
Building an Intent Router With Langchain and Zep
Building LLM Applications for Production
Building Personal and Organizational Prestige
Building Services Platform, Vehya, Raises $2.1m to Grow AI Capacity
Buy Wisely
Cada Vez Hay Más Voces Dentro De La Industria Tecnológica Que Piensan Que La IA Es Una Gigantesca Burbuja. Y Tienen Buenos Argumentos
Can Humanity Survive AI?
Carol Dweck: A Summary of Growth and Fixed Mindsets
Causal Inference in R
CDO Agenda 2024
Chain of Thought Prompting for LLMs
Changing Guidelines: Best Practices for Maintaining Data Quality
Changing Your Life Takes More Than Just Ideas
ChartGemma: Visual Instruction-Tuning for Chart Reasoning in the Wild
Chat, Discover and Find Your Next Home With Zillow’s Plugin on ChatGPT
Chatbots Remind Us That Natural Conversation Is Artificial Too
ChatGPT Gets Its “Wolfram Superpowers”!
ChatGPT Plugins
Chess Without Checkmate: The Portal Wars
Chexy Raises $3m to Disrupt Canadian Rental Market With Online Reward Platform
Chips All the Way Down
Classifieds Are Getting Smarter: How Artificial Intelligence Will Help You Buy an Apartment
Claude’s Character
Clean ML Datasets With Cleanlab
ClickHouse as Part of ETL/ELT Process
ClickHouse: A Blazingly Fast DBMS With Full SQL Join Support - Part 1
ClickHouse: A Blazingly Fast DBMS With Full SQL Join Support - Under the Hood - Part 2
Cloud Next 2024: More Momentum With Generative AI
Code Review for Statisticians, Data Scientists & Modellers
Cohere Compass Private Beta: A New Multi-Aspect Embedding Model
Cohort Revenue & Retention Analysis
Coinbase Is a Mission Focused Company
Cold Sourcing: Hire Someone You Don't Know.
Common LoRA parameters in PEFT
Cómo Afecta El Reglamento De IA a Los Modelos De Lenguaje
Cómo Aprende a Leer Nuestro Cerebro: De La Mecánica Lectora a La Comprensión
Cómo Aprendí Economía
Cómo Franco Convirtió a España en Un País De Propietarios
Cómo Preparar a Un Adolescente Para El Futuro Y Que Su Trabajo No Sea Automatizado
Company Culture Is the Last 50 Days
Competición Mimética
Comprehensive Guide to Ranking Evaluation Metrics
Computing Out-of-Sample Predicted Probabilities with Cross-Validation#
Con La Automatización De Tareas Con IA, Perdemos La Inteligencia Genuinamente Humana Para Quedarnos Sólo Con La Artificial
Concepts LLMOps
Conformal Predictions: Build Confidence in Your ML Model's Predictions
Confusing Git Terminology
Contextualized Recommendations Through Personalized Narratives Using LLMs
Contrary Research Rundown #45
ControlNet v1.1: A complete guide
Creando Una Cultura De Producto en Ingeniería
Create Capacity Rather Than Capture It.
Creating a LLM-as-a-Judge That Drives Business Results
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Crystal Ball for Interest Rates
Cuando Todos Los Coches Se Fabriquen en China. O Por Marcas Chinas en Europa
Cuando Tu IA Son en Realidad 1000 Indios Anónimos
Customer Insights Extraction With Transformers and NLP
Daniel Dennett, 1942-2024
Data Alone Is Not Enough
Data and Reality
Data Arithmetic
Data as a Product vs Data Products. What Are the Differences?
Data Career Insights: Lessons From Four Senior Leaders in the Data Space
Data Collection
Data Contracts in DataHub: Combining Verifiability With Holistic Data Management
Data Debt Is Not Evil: A Pragmatic Perspective
Data Engineering Is Evolving, but Most Data Engineers Aren’t
Data Is Better Together
Data is not a Microservice
Data Mesh and Strategy Tech Stack Alignment
Data Platform Explained Part I
Data Platform Explained Part II
Data Products and the Journey to Data-Driven
Data Scientists Work Alone and That's Bad
Data Version Control
Data VS intuition at Basecamp with Jane Yang
Data-Twitter Is Having a MOMENT on Bluesky Right Now
Database Remote-Copy Tool for SQLite
David Hume—Why We Change Our Mind
Daylight at the End of the Tunnel
Dear Stakeholder
Death by Situationship
Decoding Kaggle’s 2023 AI Report: Essential Tips for Machine Learning With Tabular Data 🔍📈
Deferred Happiness and the Retirement Trap
Demystifying LLMOps: A Practical Database of Real-World Generative AI Implementations
Designing a Culture of Reinvention
Designing a Real Estate Agent using OpenAI & Qdrant
Developing Domain Expertise: Get Your Hands Dirty.
Diffusion models have amazing image creation abilities
DINOv2: State-of-the-Art Computer Vision Models With Self-Supervised Learning
Direct Preference Optimization: Your language model is secretly a reward model
Directly Responsible Individuals
Directly Responsible Individuals: The What, How and Why of DRIs
Disband the Analytics Team
Discounted Cumulative Gain
Discovering Language Model Behaviors with Model-Written Evaluations
Discovering the Next Google Thanks to ETF Analysis
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
Dive Into Anything
Do AI Companies Work?
Do Data Teams Have Product-Market Fit?
Do Your Own Thinking
Docmatix - A Huge Dataset for Document Visual Question Answering
Documentation Refresh for LangChain V0.2
Don’t Believe the Hype: AGI Is Far From Inevitable
Don’t Build the Thing, Build the Thing That Builds the Thing!
Don’t Let That Crybaby in Here Again
Don't Mock Machine Learning Models in Unit Tests
Doorvest Acquires Getaway to Revolutionize Real Estate Investing
Drag Your GAN: Interactive Point-Based Manipulation on the Generative Image Manifold
Driving Operational Clarity
Driving Value With Sprint Goals
DuckDB vs Dplyr vs Base R
During yesterdays webinar hosted by @LangChainAI and @hwchase17 there were...
E T C H E D I S M a K I N G T H E B I G G E S T B E T I N a I
easystats: An R Framework for Easy Statistical Modeling, Visualization, and Reporting
EDS Wrap Up: Learn, Deliver, Communicate
EEUU Quiere Regular La Inteligencia Artificial Pero No Sabe Cómo: “Europa Va Por Delante. Necesitamos Liderar”
Efficient Open-Domain Question-Answering on Vespa.ai
Efforts to Expand the Lifespan Ignore What It’s Like to Get Old
Ego Is the Enemy: The Legend of Genghis Khan
El "Cultural Fit"aumenta El Éxito De La Contratación, Pero Engendra Monoculturas Discriminatorias Y Favorece El Pensamiento De Rebaño
El Aburrimiento No Nos Hace Más Creativos, Y Esta Es La Razón
El Amateur Dopado Convierte El Deporte en Una Competición De Quién Está Dispuesto a Arriesgar Más Su Salud Por El Éxito
El Desafio Cuantico De La Conciencia Humana
El Deseo Nos Hace Únicos a Los Humanos. El Equilibrio: Ni Vapear, Ni Sumarse Al Movimiento “No Fap”
El Efecto Ringelmann: Por Qué Vamos Más Lentos Cuántos Más Somos
El Futuro De La Innovación Es Azul
El Gran Logro Científico De Nuestro Tiempo Es La Modificación Y El Control Del Deseo
El Hombre Que Se Siente Como Un Castaño De 200 Años
El Mapa Del Alquiler Calle a Calle en La Región De Murcia
El Pleasurable MVP De Hey Calendar
El Premio Nobel De Economía 2024 a Daron Acemoglu, Simon Johnson and James A. Robinson
El Prisas
El Psicólogo Ramón Nogueras: "Si Un Niño Da La Vara Y Le Das Un Móvil, Aprenderá a Dar La Vara"
El Reino Unido Se Renueva
El Rendimiento Es Una Feature
El Vagon Más Lento
Eladlev/AutoPrompt
Elbows of Data
Elon Musk: 1; Yann LeCun: 1; Humanity: 0
Embedding Adapters
Embrace Complexity
Embrace the randomness
Employee Stock Options Guide
En La Era De La Inteligencia Artificial, Más Pensamiento Crítico.
Encoding Spatial Patterns as Variables
Engineers Shouldn’t Write ETL: A Guide to Building a High Functioning Data Science Department
Enhancing Prompt Engineering: Evaluating System Messages With AzureML and GPT-4
Entrata Acquires Colleen AI to Enhance Autonomous Property Management
Entregar No Es Suficiente
Ep 46: DBT Labs on DBT
Epum Raises $1.6M for Urban Zoning and Planning Data Platform
Es El Fin De La Clase Media. De La Clase Media De Los Medios De Comunicación.
Es El Mercado De La Atención, Amigo. Y El New York Times Lo Pelea Con Sus Videojuegos
España No Tiene FIRE
España Tiene Agua De Sobra, Hay Que Saber Aprovecharla
Evaluating and Uncovering Open LLMs
Evaluation of LLM question+answering chains can be challenging: here's @huggingface...
Evaluation-Driven Development: Improving WandBot, our LLM-Powered Documentation App
Even if You Think AI Search Could Be Good, It Won't Be Good
Evergreen Notes Are a Safe Place to Develop Wild Ideas
Everyone Loves the Idea of AI, but Not the Reality
Ex-Deliveroo Execs Secure Google AI Funding for Property Search Engine
Exclusive: Google Workers Revolt Over $1.2 Billion Contract With Israel
Executive’s guide to developing AI at scale
Experimenting With Local Alt Text Generation in Firefox Nightly
Experts vs. Imitators
Explaining Why Data & Models Aren’t Always Right & Getting Leaders to Act on Them
Exploratory Data Analysis in R
Exploring the Potential of the Segment Anything Model
Extending the Context Length to 1M Tokens!
Extending Transformer Layers as Painters to DiT's
Facebook Is Already Mistakenly Tagging Real Photos as "Made With AI"
Faster Text Generation With Self-Speculative Decoding
Feature Discretization
Feature Engineering for Personalized Search
Feature Interpretation With the GapEncoder
Feelings of Presence in Sleep Paralysis and Other Conditions
Finally, a Replacement for BERT
Financial Statement Analysis With Large Language Models
Find Noisy Labels in Regression Datasets#
Findigs Raises $27m to Simplify Rental Screening and Leasing Decisions
Fine-Tune BERT for Text Classification on AWS Trainium
Fine-Tuning BERT for an Unbalanced Multi-Class Classification Problem
Fine-Tuning Florence-2 - Microsoft's Cutting-Edge Vision Language Models
Fine-Tuning Llama 2 70B Using PyTorch FSDP
Fine-Tuning the Multimodal Marvel: Qwen-2 VL With LlamaFactory
Fine, I'll Run a Regression Analysis. But It Won't Make You Happy.
Finetuning an LLM: RLHF and Alternatives
Finetuning an LLM: RLHF and Alternatives
Finetuning an LLM: RLHF and Alternatives
FineVideo: Behind the Scenes
First Came ‘Spam.’ Now, With A.I., We’ve Got ‘Slop’
Fixing Faces in Stable Diffusion Using ADetailer Extension
Flux AI: A Beginner-Friendly Overview
FLUX1.1 [Pro] Is Here
Flyhomes Acquires Real Estate AI Startup, ZeroDown, and Launches AI Home Search Portal
Focus Time for Developers and Everybody Else
FOMO and AI Anxiety in Davos
For Immediate Release: Doubling Down on Our Data Investments
Founder Mode
From Business to Data: Boosting AI Teams
From Data Scientist to ML / AI Product Manager
From Data to Product
From Pandas to Production: How We Built DLT as the Right ELT Tool for Normies
From PDFs to AI-ready Structured Data: A Deep Dive
From PyTorch to PyTorch Lightning
Fuzzy Ownership
Gemini 1.5 Flash-8b Is Now Production Ready
Gemini’s Big Upgrade: Faster Responses With 1.5 Flash
Generating Worlds
GENERATIVE AI AT WORK
Generative AI Strategy
Generative AI’s Act O1
Generative NLP Models in Customer Service: Evaluating Them, Challenges, and Lessons Learned in Banking
Geoffrey Hinton and the Existential Threat From AI
German Startup Syte Raises Seed Funding for Its AI-powered Data Platform
Getting Started With Hybrid Search
Getting Your Child to Love Reading in 2024
GitHub - DorsaRoh/Machine-Learning: Machine Learning From Scratch
Going Beyond Chatbots: How to Make GPT-4 Output Structured Data Using LangChain
Going With Your Gut Feels Good, but It’s Not Always Wise
Good Product Manager/Bad Product Manager
Google "We Have No Moat, and Neither Does OpenAI"
Google CEO Says Easy AI Gains Are Over
Google DeepMind: Bringing Together Two World-Class AI Teams
Google Shopping’s Getting a Big Transformation
Google Strategist Quits, Slams Company's AI Work as Motivated by Greed and Fear
Google’s Search AI Recommends Changing Your Car’s Blinker Fluid, Which Is a Made Up Thing That Does Not Exist
GPT and the Economics of Cognitively Costly Writing Tasks
GPT-4o Mini
GPT-5 Could Be Months Away
Grandes Exitos Y Fracasos De La Cultura De La Cancelacion
Granularity, consistency and scalability in morphological studies
Grid Search and Random Search Are Outdated. This Approach Outperforms Both.
Growing With Your Company's Complexity.
Guesty Acquires Barcelona-Based Rentals United
Guides / Present to Executives
Guides / Staying aligned with authority
Guides / Work on What Matters
Gwyneth Windflower
Handling Mislabeled Tabular Data to Improve Your XGBoost Model
Happiness Is Bullshit
Happy New Year: GPT in 500 Lines of SQL
Hello GPT-4o
Hello Product Data Team, Goodbye Ad-Hoc Work
Here Lies the Internet, Murdered by Generative AI
Hey Claude, Help Me Analyze Bluesky Data.
High-Quality Upscaling Made Easy in Stable Diffusion
Homemove Raises $1.5m to Build AI-Driven Moving Service
How (And How Not) to Disagree With Your Manager
How a String of Pearls Can Change Your Life
How AI Is Personalizing Customer Service Experiences Across Industries
How AI Is Reshaping Recruiting
How Analytics Can Make a Massive Impact on the Bottom Line
How Business Can Lead in the Age of Generative AI
How CEOs Can Lead a Data-Driven Culture
How Did We Get to a World of Hyper-Surveillance?
How Google Lost Its Way
How I Ship Projects at Big Tech Companies
How I Try to Keep Up With the Data Tech World
How Meta Measures the Management of Its AI Ecosystem
How Nesta Uses NLP to Process 7m Job Ads and Shed Light on the UK’s Labor Market
How Neuroscience Can Help You as a Software Engineer - Motivation
How Pinterest Leverages Realtime User Actions in Recommendation to Boost Homefeed Engagement Volume
How Prototyping Can Help You to Get Buy-In
How the Guinness Brewery Invented the Most Important Statistical Method in Science
How the Mighty Fall
How to "Correctly" Obtain Images for a Dataset.
How to Assess Correlation on Ordinal Data?
How to automatically fix faces and hands (adetailer)
How to Be Strategic
How to Become a More Effective Engineer
How to Become a Time Billionaire
How to Build a Chatbot With ChatGPT API and a Conversational Memory in Python
How to Build a Custom Text Classifier Without Days of Human Labeling
How to Build an Open-Domain Question Answering System?
How to Build Data Literacy in Your Company
How to Disrupt a System That Was Built to Hold You Back
How to Do Great Work
How to Do Mental Time Travel
How to Effectively Manage Low Performers: The CARES Framework
How to Evaluate, Compare, and Optimize LLM Systems
How to Feel Less Lonely as You Get Older
How to Find Optimal Epsilon Value for DBSCAN Clustering?
How to Fine-Tune Google Gemma With ChatML and Hugging Face TRL
How to Fine-Tune LLaVA on a Custom Dataset
How to Fine-Tune Multimodal Models or VLMs With Hugging Face TRL
How to Fine-Tune PaliGemma 2
How to Generate Consistent Style With Stable Diffusion Using Style Aligned and Reference ControlNet
How to Get More Headcount.
How to Go Get Your Next Job in Tech
How To Gradio: Components — Image
How to Interrupt (And Be Interrupted) Respectfully in the Workplace
How to Label 1M Data Points/Week
How to Learn the Most About a Candidate From a Single Interview Question
How to Leave a Tech Job
How to Make a LoRA
How to make Britain’s AI dreams reality
How to Organize Continuous Delivery of ML/AI Systems: A 10-Stage Maturity Model
How to Post on LinkedIn API With Python
How to Read a Book: The X-Ray Method for Achieving a Sustainable “Book-Life Balance”
How to Rewild Yourself
How to Safely Query Enterprise Data With LangChain Agents + SQL + OpenAI + Gretel
How to Summarize Books Using ChatGPT: 7 Experiments in AI Distillation
How to Train Flux LoRA Models
How to Train Your Dream Machine
How to Train Your Own Large Language Models
How to Use InstantID to Copy Faces
How to Use LoRA With Flux AI
How to Write a Git Commit Message
How to Write Better With the Why, What, How Framework
How We Built Something Useful
How Well-Structured Should Your Data Code Be?
Humane Ingenuity 46: Can Engineered Writing Ever Be Great?
Hybrid Search Explained
Hypnagogic Hallucinations
I Am an Article About the Speaking Objects of Ancient Greece
I Asked My Followers:
I Can Now Run a GPT-4 Class Model on My Laptop
I Got Early Access to ChatGPT API and Then Pushed It to It’s Limits. Here’s What You Need to Know.
I Made Stable Diffusion XL Smarter by Finetuning It on Bad AI-Generated Images
I’ve Never Had a Goal
IBM X JLL's New ESG Reporting and Data Management Solution for CRE
Iceberg + Single Node Engines
Ideas on how to generate an image of an empty room from one with furniture?
IDEs With GenAI Features That Software Engineers Love
Ilya Returns to Take on OpenAI
Image ALT Text: Best Practices, Examples & SEO
Image Similarity With Hugging Face Datasets and Transformers
Improving Results by Using Multiple Models of the Same Concept (Turning It to 11!).
Improving Zero-Shot Ranking With Vespa Hybrid Search
Improving Zero-Shot Ranking With Vespa Hybrid Search - Part Two
In 2009, a Man Asked Ted Kaczynski if Nuclear Weapons...
In Defense of the Office
In Leaked Audio, Amazon Cloud CEO Says Human Developers Will Soon Be a Thing of the Past
In Praise of Boring AI
In-Depth Guide for Training Logos With AI
IndieWeb Examples
Innovación Invita a Emprendedores a Diseñar Una Herramienta De IA Frente a Pisos Turísticos Ilegales
Inpainting: A Complete Guide
Inside the Matrix: Visualizing Matrix Multiplication, Attention and Beyond
Inspect
InsPix2Pix
Instantly Deploy Generative AI With NVIDIA NIM
InstructPix2Pix is built straight into the img2img tab of A1111 now. Load the checkpoint and the "Image CFG Scale" setting becomes available.
Integrating R With Modern Tech Stacks
Inteligencia Artificial Y Poder | Antonio Ortiz | TEDxMálaga
Introducing Apple’s on-Device and Server Foundation Models
Introducing ChatGPT and Whisper APIs
Introducing Code Llama, a State-of-the-Art Large Language Model for Coding
Introducing Command R+: A Scalable LLM Built for Business
Introducing Computer Use, a New Claude 3.5 Sonnet, and Claude 3.5 Haiku
Introducing Devin, the First AI Software Engineer
Introducing Gemini: Our Largest and Most Capable AI Model
Introducing Idefics2: A Powerful 8B Vision-Language Model for the Community
Introducing Intuit Assist
Introducing Llama 3.1: Our Most Capable Models to Date
Introducing Marqo Specialized Embedding Models for Ecommerce: Powering Multimodal AI Search
Introducing Multimodal TextImage Augmentation for Document Images
Introducing OpenAI O1-Preview
Introducing RAGs: Your Personalized ChatGPT Experience Over Your Data
Introducing Sphere: Meta AI’s Web-Scale Corpus for Better Knowledge-Intensive NLP
Introducing SREs, TPMs and Other Specialized Roles.
Introducing Stable LM 2 12B
Introducing the Analysis Tool in Claude.ai
Introducing the Model Context Protocol
Introducing the Realtime API
Introducing Voyager: Spotify’s New Nearest-Neighbor Search Library
Introducing Writebook
Introduction#
Investing in Tennr
Is Seattle a 15-Minute City? It Depends on Where You Want to Walk
Ismael Clemente (CEO De Merlin): "Bajar Los Tipos De Interés Demasiado Pronto Puede Ser Peligroso"
Israel Has a History of Killing Hamas Leaders Who Are Trying to Secure Ceasefires
It Is Starting to Get Strange.
It May Be the Best TAG EDITOR So Far!
It's Time to Build
It's Time to Merge Analytics and Data Engineering
Jevons paradox
Joshua Angrist – Econometrics Is the Original Data Science
Judge Arena: Benchmarking LLMs as Evaluators
Jung’s Five Pillars of a Good Life
Just the Two of Us
Kafka on Friendship and the Art of Reconnection
KE Holdings: The Most Interesting Real Estate Company in the World
Keynes y los orígenes de la inversion en valor
Klarna AI Assistant Handles Two-Thirds of Customer Service Chats in Its First Month
Knowledge Graphs & LLMs: Fine-Tuning vs. Retrieval-Augmented Generation
Knowledge Retrieval Architecture for LLM’s
Knowledge Without Goodness Is Dangerous
La #Bonilista De Guillermo: ¿Un Problema Sin Solución?
La #Bonilista: El Círculo De Caca 💩
La advertencia de Johann Hari: "Hemos perdido el superpoder de nuestra especie y no es solo por culpa del móvil"
La Bonilista — El Proyecto De Dos Amigos 👥
La Bonilista — La Cláusula Que Nadie Debería Firmar 📄
La Bonilista — Liderazgo Del Bueno 🔄
La Bonilista: ¿Contratarías a Alguien Que Trabaja Para Un Amigo? 😬
La Distribución Es Más Importante Que El Producto
La Edad Subjetiva: El Misterio Por El Que Una Persona Se Siente Más Joven De Lo Que Es
La Equivocada Idea De Que Queremos Dejar Nuestros Teléfonos Móviles
La Formula 10-3-2-1-0 Quiere Que Duermas Mejor Y, De Paso, Volverte Mas Productivo
La Gran Crisis De Las Carreras De Humanidades en La Universidad: Más Informáticos, Es La Guerra
La Hipótesis Del Escalado De La Inteligencia Artificial Hasta Llegar a La AGI
La Humanidad Se Va a Reducir. Es Hora De Aceptarlo Y De Apostar Por Los Robots Camareros
La Incapacidad Para Formar Equipos
La Isla
La Ley De Vivienda Un Año Después: La Oferta Cae a Mínimos Y Los Precios Suben a Máximos
La Mayor Mentira Sobre Product Management
La Mayoria De Las Empresas Confunden Eficiencia Operativa Con...
La OMS Recomienda “Mapas Urbanos De Calor” Para Reducir Las Víctimas Del Calentamiento Global
La Trampa De Las Pruebas De Concepto (PoC) en Proyectos De Machine Learning
Landlords Now Using AI to Harass You for Rent and Refuse to Fix Your Appliances
LangChain + Aim: Building and Debugging AI Systems Made EASY!
LangSmith. A Review of How to Make Interaction With LLM Prompts Easier
Large Language Labor Markets
Large language models are having their Stable Diffusion moment
Large Language Models Beyond Dialogue
Large Models Are Expensive to Fine-Tune on Downstream Tasks
Las Mujeres Jóvenes Van Más a La Universidad, Ganan Más Dinero Por Hora Trabajada Y Votan Más a La Izquierda Que Los Hombres Jóvenes
Las Provincias Desaparecidas De España
Las Ventajas De Que Tus Padres Aparezcan en Azul en La Wikipedia
Last Mile Data Processing With Ray
Lead Innovate and Grow With AI and Generative AI
Leadership Is a Research Project
Learning Path to Build LLM Based Solutions — for Practioning Data Scientists
Learning With Not Enough Data Part 1: Semi-Supervised Learning
Learning with not Enough Data Part 2: Active Learning
Learning with not Enough Data Part 3: Data Generation
Learnings From Fine-Tuning LLM on My Telegram Messages
Lessons from Peter Thiel
Lex-GPT
License to Call: Introducing Transformers Agents 2.0
Living Through the Next American Political Order: Institutions Will Comply, and You Will Be Made Complicit
Llama 2 Learns to Code
Llama 3.2: Revolutionizing Edge AI and Vision With Open, Customizable Models
Llama-3 8B 1xL4 24GB -63% VRAM
Llegar a Los 35 Años Y Empezar a Vigilar Los Niveles De Testosterona
LLM Agents
LLM Mixture of Experts Explained
LLM Powered Autonomous Agents
LLMs and Data-to-Text
LLMs to Transform Data
Local Spatial Autocorrelation
LoRA Training Guide
Lora: Low-Rank Adaptation of Large Language Models
Ⓜ️ New Mistral Large Beats Every LLM - GPT4
Machine Learning of Spatial Data
Machines of Mind: The Case for an AI-powered Productivity Boom
Make Timeline Tradeoffs Using Iterative Elimination Tournaments.
Make Your Peers Your First Team.
Making a Lora Is Like Baking a Cake.
Making Sense of Deming
Mamba Explained
Manage Process Before People
Management Seat Time
Managing People 🤯
Managing the First Year
Manual De Sabotaje De Organizaciones
Mapas Turísticos De España, Por Jacques Liozu
Mapping AI’s Rapid Advance
Mapping Travel Times With malariaAtlas and Friction Surfaces
Marketplaces Inmobiliarios en España- ¿Idealista Tiene Rival?
Master Thesis MIIS - 02-07-24.docx
Mastering Dashboard Design: From Good to Unmissable Data Visualizations
Mastering RAG Pipelines: A Guide to Optimisation and Hyperparameter Tuning
Measuring Children's and Adolescents' Accessibility to Greenspaces From Different Locations and Commuting Settings
Mechanisms for Effective Machine Learning Projects
Meeting People.
Memory and New Controls for ChatGPT
Memory in LangChain: A Deep Dive Into Persistent Context
Message From CEO Andy Jassy: Strengthening Our Culture and Teams
Microsoft Announces Feature That Records Everything You Do on Your Computer for AI
Microsoft Brags That With the Powers of AI, You'll Be Able to Attend Three Meetings at the Same Time
Microsoft builds the bomb
Microsoft Deploys Powerful New AI Completely Disconnected From the Internet
Microsoft: Unlocking AI Benefits Will Require Cultural Changes for Enterprises
Midjourney's Style Reference for Photographers
Millenarianism
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset With One Trillion Tokens
Mirascope-Python's Alternative to Langchain
Mis Libros Favoritos
Mistral Large 2
MLE vs. EM — What’s the Difference?
MLX-VLM
Model Commoditization and Product Moats
Modelplotr V1.0 Now on CRAN: Visualize the Business Value of Your Predictive Models
Modelprop: Empowering UK Estate Agents With AI
Molmo
Moonlighting Managers Ain’t Got No Time for Bullshit
More Design Patterns for Machine Learning Systems
More Than 50% of Managers Feel Burned Out
Most Data Work Seems Fundamentally Worthless
MSCI 2023 Real Estate Market Size
Multimodality and Large Multimodal Models
Muppy Raises €2.3m to Lead Spanish Flexible Rental Market
Music Industry Titan Targets AI, End-to-End Multimodality, Millions of Tokens of Context, More Responsive Text-to-Image
Musings on Building a Generative AI Product
Mxbai-Embed-Large-V1
My Advice for How to Use LLMs in Your Product.
My AI Lover
My Binary Vector Search Is Better Than Your FP32 Vectors
My Favorite Decision-Making Frameworks
My Review of the Apple Vision Pro After One Month
Nada Raises Post Seed and Appoints New CEO, to Connect Homeowners With Investors
Nadie Sabe Nada
Navigating the Complexity of Real Estate Asset Management
Navigating the Future of Office Space Utilization With Data
Necesitamos Más Y Mejores Engineering Managers
Nemawashi – Toyota Production System guide
New Autonomous Agents Scale Your Team Like Never Before
New Models and Developer Products Announced at DevDay
New Phi-3 Models: Small, Medium and Vision
New Week #109
NEW: it’s become popular to say that supply and demand...
No sacred masterpieces
No Time to Lead? Then Be Prepared to Fail.
No Wrong Doors.
No, Elon Musk Is Not Our Henry Ford
No, Sora Has Not “Learned Physics”
Noam Chomsky: The False Promise of ChatGPT
Noob's Guide to Using Automatic1111's WebUI
Nosotros, Los Luditas
NotebookLM now lets you listen to a conversation about your sources
Notes on OpenAI's New O1 Chain-of-Thought Models
Nuestra Civilización Se Acaba: Disfruta Del Espectáculo
Numbers to Know for Managing
NVIDIA Releases Open Synthetic Data Generation Pipeline for Training Large Language Models
NVIDIA Transitions Fully Towards Open-Source GPU Kernel Modules
NVLM: Open Frontier-Class Multimodal LLMs
NYC's Newest Unicorn - $75M Raise for AI-Driven Real Estate Comms Platform
Office Politics Isn’t Something You Can Sit Out
On AI Anthropomorphism
On Hiring, Rehiring, and One Question to Answer Them All
On Why LLMs Are Just Like Medical Doctors
One of the World Most Innovative Supply Chains
Online Marketplaces Exclusive: Zoopla CEO Charlie Bryant and COO Richard Hayes
Open Data Maps for AWS
Open Source AI Is the Path Forward
Open Source Hooliganism and the TypeScript Meltdown
Open-LLMs - A List of LLMs for Commercial Use
OpenAI Deactivates All Real Estate Plugins — Redfin and Zillow Affected
OpenAI DevDay: Let’s Build Developer Tools, Not Digital God
OpenAI Licenses News Archives, Generative Coding From Plan to Pull Request, Recognizing Landmines, Streamlined Inference
OpenAI Scientist Ousted After Failed Coup Against Sam Altman Is Starting a New AI Company
OpenAI: Introducing Structured Outputs in the API
Opinion: 5 Things I’d Like to See More of on Real Estate Portals
Opinion: Portal AI Bandwagon Needs to Fix Data Before Fixing Search
OPP
Organizational Prototyping With Design Fiction
Oscar, an Open-Source Contributor Agent Architecture
Our Approach to Human Data Annotation in the Age of Gen AI
Over 25% of Google’s Code Is Now Written by AI—and CEO Sundar Pichai Says It’s Just the Start
Overpromising & Stumbling Bambis
Palantir’s Military AI Tech Conference Sounds Absolutely Terrifying
Pandas 2.0 and Its Ecosystem
Paper Break: HyperDreambooth
Patchwork #5. Excel, LLMs, Turbocapitalismo Y Otras Fuentes De Ansiedad
Pchunduri6/Rag-Demystified
Philosophy for Happiness, Q1 Relationships, & More
Phone Provider Deploys "State-of-the-Art AI Granny" to Waste Scammers' Time
Plan-and-Execute Agents
Planning for AGI and Beyond
Platform Products for Machine Learning
Play in Control - ControlNet Training Setup Guide
Playing for Ourselves
Please Refer to [About Training Data preparation](./train_README-ja.md).
Plentiful, High-Paying Jobs in the Age of AI
Pluralistic: Three AI Insights for Hard-Charging, Future-Oriented Smartypantses
Pokémon Go Players Have Unwittingly Trained AI to Navigate the World
Pop Song Generators, 3D Mesh Generators, Real-World Benchmarks, AI for Manufacturing
Por Qué, De Pronto, Estamos Dejando De Vernos Cara a Cara
Portable Quarto Reports
Portal War ‘24: Traffic Is a Non-Zero-Sum Game
POSSE: A Better Way to Post on Social Networks - The Verge
Post-Tuning the Decision Threshold for Cost-Sensitive Learning
Powering Feature Stores With ClickHouse
PPW Notes Final
Practical Tips for Finetuning LLMs Using LoRA
Prices for Used EVs Are Cratering
Private Cloud Compute: A New Frontier for AI Privacy in the Cloud
Probablemente El Móvil Y Las Plataformas Sean Un Gran Problema Para Los Chavales
Procrastination and the Fear of Not Being 'Good Enough'
Product Engineers
Product Engineers: El Secreto Mejor Guardado De Las Startups Con Buena Cultura De Producto
Producto Debe Entregar Con Calidad Y en Tiempos, ¿por Qué Cuesta Tanto?
Prompt Engineering
Prompt Engineering vs. Blind Prompting
Property Finder CEO & Founder, Michael Lahyani Reveals Portal's Beginnings and Future Plans in Candid PPW Pod Interview
Property Finder Launches SuperAgent, MENA’s FIRST AI- Driven Ranking System for Agents
Property Finder Raises $90m to Buy Out Early Investor and Continue Western Growth
Protegeme De Lo Que Quiero
Prototype-based models for real estate valuation
QASource’s Comprehensive Guide to Chatbot Testing
Question Answering on Documents Locally With LangChain, LocalAI, Chroma, and GPT4All
Quoting Dare Obasanjo
Quoting Paul Kedrosky and Eric Norlin
Quoting Yuval Harari, Tristan Harris and Aza Raskin
Qwen2-Vl
Qwen2-Vl: To See the World More Clearly
Qwen2.5-Coder Series: Powerful, Diverse, Practical.
Rafael Yuste, Ideologo Del Proyecto Brain: "La Humanidad Se Subira a La Chepa De La Inteligencia Artificial"
RAGs to Riches: Bringing Wandbot Into Production
Re-Implementing LangChain in 100 Lines of Code
Re-Ranking
Readers Absolutely Detest AI-Generated News Articles, Research Shows
Real Estate’s Hidden AI Revolution
reAlpha Acquires Hyperfast Title, to Vertically Integrate the Homebuying Process
Red-Teaming Large Language Models
Reduccion Del Error en Tests a/B
Reducing the Lottery Factor, for Data Teams
Reflections on Foundation Models
Reflections on Palantir
Reimagining LinkedIn in the New Era of AI
Reminiscing: The Retreat to Comforting Work.
Renting Forever and Trying to Create a Strong Financial Future
Rerankers and Two-Stage Retrieval
Rerankers: A Lightweight Python Library to Unify Ranking Methods
Researchers Say There’s a Vulgar but More Accurate Term for AI Hallucinations
Rethinking Property Management: How AI Is Transforming Maintenance, Documentation, and Compliance
Rethinking Real Estate - 'Space-as-a-Service'
Retrieval
Retrospectives Antipatterns
Revenge of the Humanities
Revolutionizing Search: How Hypothetical Document Embeddings (HyDE) Can Save Time and Increase Productivity
Rightmove Acquires Reviews Platform HomeViews
Rise Europe, Una Alianza Internacional Para Impulsar El Emprendimiento Tecnológico
RLHF and Alternatives: ORPO
Role and Mandate of National Productivity Boards
Rough FAQ for 東方Project AI
Run Your Data Team Like a Product Team
Running an Engineering Reorg
SAM + Stable Diffusion for Text-to-Image Inpainting
Sam Altman Ignoring Scarlett Johansson's Lack of Consent Shows Us Exactly What Type of Person He Really Is
Sam Altman Replaces OpenAI’s Fired Safety Team With Himself and His Cronies
Sampling with SQL
Scale AI Raises $1B / First European AI Rules to Take Effect in Weeks / Google to Show Ads in AI-generated Search Summaries
Scaling GAIA-1: 9-Billion Parameter Generative World Model for Autonomous Driving
Scaling Monosemanticity: Extracting Interpretable Features From Claude 3 Sonnet
Scaling: The State of Play in AI
Scatter
sd-scripts/docs/train_lllite_README.md at Main · Kohya-Ss/Sd-Scripts
SDXL in 4 Steps With Latent Consistency LoRAs
SearchGPT Prototype
Searching for the Exit Routes
Sébastien Dubois
SELF-CONSISTENCY IMPROVES CHAIN OF THOUGHT REASONING IN LANGUAGE MODELS
Self-Serve Analytics and Other Corporate Accountability Sinks
Self-Supervised Learning With Vision Transformers
Sentence Transformers and Embeddings
SentriLock Partners DirectOffer to Incorporate AI to Access Management Solutions
Setting Engineering Org Values.
Seven Failure Points When Engineering a Retrieval Augmented Generation System
Shape-Up
Share Your Data Insights to Engage Your Colleagues
Should You Call People Resources?
Should You Measure the Value of a Data Team?
Sierra Speaks
Similarities and Differences Between Evergreen Note-Writing and Zettelkasten
Simple Workflow That Is Fast and Highly Detailed
Simpson's Paradox and Existential Terror
Single-Threaded Leaders at Amazon
Sizing Engineering Teams.
Skyrocketing Delistings and the Pricing Imbalance
Sleep Paralysis
SmolVLM - Small Yet Mighty Vision Language Model
Sneaky Virus Uses ChatGPT to Send Human-Like Emails to Your Contacts to Spread Itself
So AI Won't Scale, Now What?
Sobre Raíles
Something New: On OpenAI's "Strawberry" and Reasoning
Sora, Groq, and Virtual Reality
Spanish Co-Living PropTech, Enso, Raises €8.2M to Expand in the US and Mexico
Spanish Developer Neinor Acquires Stake in Habitat
SQL Is All You Need
Stable Diffusion 3.5 Large Fine-Tuning Tutorial
Stable Diffusion Ultimate Guide Pt. 7: Tips and Tricks
Stable-Diffusion-Guide
StackLlama: A Hands-on Guide to Train LlaMa With RLHF
Stanford Alpaca, and the Acceleration of on-Device Large Language Model Development
Stanford Researcher Examines Earliest Concepts of Artificial Intelligence, Robots in Ancient Myths
State of AI Report - 2024
State of the Software Engineering Job Market in 2024
Statistical Process Control: A Manager’s Guide
Stay Out of the Uncanny Valley
Staying on the Path to High Performing Teams.
Steel, Servers and Power: What it Takes to Win the Next Phase of AI
Steve Jobs Swore the 10-Minute Rule Made Him Smarter. Modern Neuroscience Is Discovering He Was Right
Stonal Raises €100M~ From Aareon for Real Estate Data Management
Stop Doubling Down on your failing strategy
Stop Trying to Make a "Good" Social Media Site
Structured Outputs
Super Reimagined Hi-Res Upscaling With Magnific 🪄
Supercharging Discovery in Search With LLMs
Supercharging ML Workflows: Integrating Metaflow With Ray
SVPG Newsletter: Good Product People
Tantek Çelik
Teachers Are Leading an AI Revolution in Korean Classrooms
Teaching Language Models to Use Tools
Team Charters Are a Trap.
Team Topology for Machine Learning
Tech Spec
Tech Spec Review
Technical Considerations for Complex RAG
techniques_to_improve_reliability.md
Techstructive Blog
Teen Dies by Suicide After Becoming Obsessed With AI Chatbot
Tenets at Amazon
Terror in the Skynet
Textual Inversion / Embedding Training Guide
TGI Multi-LoRA: Deploy Once, Serve 30 Models
The “It” in AI Models Is the Dataset.
The /Ai 'Manifesto'
The Advent of the Open Data Lake
The Age of AI Has Begun
The AI Workforce Is Here: The Rise of a New Labor Market
The AI-First Marketplace
The Art of Focus – New York Times David Brooks
The Art of Pushback for Data Product Managers and Leaders
The Best Alternative to GitHub Copilot: Continue.dev + Free AI
The Birth of chDB
The Brain’s Twilight Zone: When You’re Neither Awake Nor Asleep
The Briefing: Mike Tyson, Jake Paul and Netflix
The Cathedral Effect, Joy Generator, & More
The Changelog Podcast: LLMs Break the Internet
The Complex and Fascinating Journey to Build AI Products
The Concept of the Ruliad
The Creepy New Digital Afterlife Industry
The Data Science Mentor
The Data Team Playbook: 50+ Resources for High-Performing Data Teams
The DelPrete Probability Paradox
The Developer Voice Guide: How to Write Articles Engineers Want to Read
The Dopamine Economy
The Dual LLM Pattern for Building AI Assistants That Can Resist Prompt Injection
The Dumb Reason Your AI Project Will Fail
The End of Middle Management: Analytics and Automation Are Replacing Entire Leadership Layers
The Enlightenment Trap
The Era of Inescapable AI
The Expanding Dark Forest and Generative AI
The Final Sprint in the Generative Interior Design Challenge - New Dev Data Released
The First Step to Feature Scaling Is NOT Feature Scaling
The Future of Education in a World of AI
The Future of Lineage
The Hidden Costs of Complexity in Data Science
The History of Pets vs Cattle and How to Use the Analogy Properly
The Illustrated Guide to a Ph.D.
The Illustrated Transformer
The Internet of Maps and Oracles
The Knowledge-Creating Company
The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey
The Looking Glass: Get Over Yourself
The Looking Glass: Managing Your Manager
The Looking Glass: Sharpening Judgement
The Looking Glass: So You Want to Write Better?
The Looking Glass: The Craft of Creating
The Looking Glass: The Curse of Perfect
The Looking Glass: The Paradoxes of Data
The Looking Glass: The Year of Everyday Risks
The Looking Glass: What Company Politics Actually Is
The Low-Trust Election
The Magic of Small Engineering Teams
The Man Who Thought Too Fast
The ManfredDeveloper Career Report
The Matthews Correlation Coefficient (MCC) Should Replace the ROC AUC as the Standard Metric for Assessing Binary Classification
The Maze Is in the Mouse
The Medieval Notion That Shows Why Even Experts Should Be Humble
The Most Common Misconception About Python OOP
The Most Important Decision of Your Life
The Motivation Behind Using KernelPCA Over PCA for Dimensionality Reduction
The Neoliberal Tenant Dystopia: Digital Polyplatform Rentierism, the Hybridization of Platform-Based Rental Markets and Financialization of Housing
The OpenAI Board Was Right
The Organization as a Network of Teams
The Pandemic Didn’t Upend US Geography
The Pentagon Wants to Flood Social Media With Fake AI People
The Pleasure of Being Left Alone
The Polars vs Pandas Difference Nobody Is Talking About
The Power of Defaults
The Problem With Founders
The Problem With LangChain
The Product-Minded Software Engineer
The Psychology Behind Meeting Overload
The Public Imagination
The Pursuit Map: 3 Steps to Choose Your Life Pursuits
The Question of Nine, Return on Hassle, & More
The Religion of Techno-Optimism
The Right Kind of Stubborn
The Rise (And Fall?) of Data Debt
The Semantic Apocalypse
The Story of Three Bricklayers – A Parable About The Power of Purpose
The Terrible Costs of a Phone-Based Childhood - The Atlantic
The Therapeutic Potential, and Addictive Lure, of Losing Yourself
The Truth About Happiness
The Ultimate Guide on Engineering Operations | Ryan Atkins
The Ultimate Guide to Camera Shots
The Ultimate Guide to Hiring Your Data Team
The Ultimate Guide to Writing Online
The Unrealised Promise of HTAP
The Washington Post Tells Staff It’s Pivoting to AI
The Window-Knocking Machine Test
There's a Small Problem With the AI Industry: It's Making Absolutely No Money
There's Something Deeply Wrong With Perplexity
Think of Language Models Like ChatGPT as a "Calculator for Words"
Thinking About High-Quality Human Data
Thinking Companion, Companion for Thinking
Thinking Place
This ChatGPT Plugin Is Truly Groundbreaking
This Stock Is Crushing Salesforce, MongoDB and Snowflake in AI Revenue
Thoughts on Techno-Optimism
Tiko Acquires Housell to Redefine the Real Estate Landscape in Iberia
Time to Check Your LinkedIn Settings
Time to Upgrade Your Monitor
Tímidos Éxitos Del Anti Turismo Cuando El Cine Ya No Es Central en Nuestra Cultura
Tinder Es El Nuevo Idealista: Así Se Usa Para Buscar Un Sitio Donde Vivir
To Build Muscle, It’s the Sets That Count
To Thrive, Children Need to Experience Awe – And You Can Help
To Worry Is to Work
Too Many Meetings Is Not Your Problem
Tool Use
Toolformer: LMs can teach themselves to use tools
Train and Run Stanford Alpaca on Your Own Machine
Train Your ControlNet With Diffusers
Training a LoRA Model for Stable Diffusion XL With Paperspace
Training Machine Learning Models With ClickHouse and Featureform
Transformer Models: An Introduction and Catalog
Try Unrolling a Thread Yourself!
Turing Test
Turning the Tables on AI
Two Ways of Measuring Data Team Value
Types of Data Products
Ultra fast ControlNet with 🧨 Diffusers
Un Día Cualquiera
Un Eclipse Solar Tiene La Culpa De Que Mis Planes De Verano en 2026 Pasen Por Soria
Un error que cometimos en el equipo fue asumir que...
Underrated Ways to Change the World
Understanding Business Needs - Staying Relevant as a Data Team
Understanding CUPED
Understanding LoRA With a Minimal Example
Understanding Multimodal LLMs
Understanding Question Answering
Understanding Visual Instruction Tuning
Unexpected Tips for Data Managers
Unpacking the Buzz Around ClickHouse
UPerNet
Use After Detailer and LoRA to Control Face
Useful to the Point of Being Revolutionary: Introducing Wolfram Notebook Assistant
Using and Finetuning Pretrained Transformers
Using LoRA for Efficient Stable Diffusion Fine-Tuning
Using NLP to Automate Customer Support, Part Two
Using Sentence Embeddings to Automate Customer Support, Part One
Using Uv to Develop Python Command-Line Applications
Uv Under Discussion on Mastodon
VAE RAW to Obtain Greater Detail
Validating LLM Outputs
Valkey's Expanding Ecosystem
Valuebase Raises $6.3m for Data-Driven Property Valuations
Vector Database Benchmarks
Version 1 Is for You
Vision Language Models: Everything About It
Visualizing a Neural Machine Translation Model
Vivienda Presenta El Nuevo Índice De Precios Para Poner Límites a Los Alquileres Y Entrará en Vigor El 13 De Marzo
Vuelve El Malthusianismo, Vuelve La Eugenesia
Wabi-Sabi
Wall Street Wants Your Home
WandBot: GPT-4 Powered Chat Support
Want to Improve Your Memory? Try These Unexpected Tips.
Warren Buffett Saw an AI That Scared Him
We Finally Have an ‘Official’ Definition for Open Source AI
We re All the #dataBS, but Individually Not
We’re Already Living in the Post-Truth Era
We’re the New Renewables
Weaviate 125 Release
Welcome 🎉 Let's Confirm Your Email!
Welcome FalconMamba: The First Strong Attention-Free 7B Model
Welcome, Gradio 5
What Apple's AI Tells Us: Experimental Models⁴
What Are Real Estate Aggregators and Are They About to Disappear?
What Are the Chances of Homes.com and OnTheMarket Dethroning Zillow and Rightmove to Become Costar’s Market Leaders?
What Are Tracer Bullets in Software Development?
What Are Word and Sentence Embeddings?
What Can We Remove?
What Does It Actually Take to Build a Data-Driven Culture?
What Exactly to Caption for Flux LoRa Training?
What Happens When AI Reads a Book 🤖📖
What I Do Before a Data Science Project to Ensure Success
What Is "Data as a Product" Really?
What is a Transformer?
What is a Vector Database?
What Is Denoising Strength?
What Is LyCORIS and How to Use Them in Stable Diffusion
What Is Prompt Optimization?
What Is Rightmove's Strategy Against the Threat From OnTheMarket and CoStar? We Analyse the Portal's Capital Markets Day
What Kind of Writer Is ChatGPT? | the New Yorker
What We Gain by Recognising the Role of Chance in Life
What We’ve Learned From a Year of Building With LLMs
What, How, and When: Personalizing Commercial Offers With AI
What's Next for AI Agentic Workflows Ft. Andrew Ng of AI Fund
Whatever, Do the Secondary Sale
When and Why to Automate: A Data Engineer's Perspective
When Product Markets Become Collective Traps: The Case of Social Media
When Product Teams Think Anecdotes Is Research
When to Choose CatBoost Over XGBoost or LightGBM [Practical Guide]
When to Write Strategy, and How Much?
Where Does Kudos Come From? The Origin of Kudos
Who killed non-contrastive image-text pretraining?
Who Runs Engineering Processes?
Who’s Behind All the ‘Pussy in Bio’ on X?
Why A.I. Isn’t Going to Make Art | the New Yorker
Why AI Companies Are Recruiting Hackers for Help 💻
Why AI Will Save the World
Why AI Won't Cause Unemployment
Why Are We Using LLMs as Calculators
Why backlogs are harmful, why they never shrink, and what to do instead
Why books don’t work
Why Chatbots Are Not the Future
Why Did I Leave Google Or, Why Did I Stay So Long?
Why Do We Say “Top Management” Yet Never “Bottom Management”?
Why Engineers Should Focus on Writing
Why Is “Data Scientist” Such a Controversial Title?
Why Is Paid Social Media a Bad Idea?
Why It's Easier to Manage 4 People Than It Is to Manage 1 Person
Why Local?
Why OpenAI May Well Be Completely Zuck’d
Why You Need a "WTF Notebook"
Why You Should Be Prototyping
Why Your Company Needs Data-Product Managers
Will We Ever Have Clean Data?
Windows Returns
Winning Over Hearts and Minds at Work: ADKAR My Favorite Change Management Approach
Wisereads Vol. 43 — Apple's Private Cloud, Cal Newport on Note-Taking, and More
Wisereads Vol. 47 — Bob Doto's System for Writing, Sequoia on AI's $600B Question, and More
Wisereads Vol. 63 — Dr. Ali Binazir's 5 Hidden Love Questions, 18 Life Learnings From Maria Popova, and More
Wisereads Vol. 68 — Main Street Millionaire by Codie Sanchez, Facebook's Little Red Book, and More
WizardLM 2
Wolfram|Alpha as the Way to Bring Computational Knowledge Superpowers to ChatGPT
Worse is better
Write Code With Your Alphabet Radio On
Writer Alarmed When Company Fires His 60-Person Team, Replaces Them All With AI
Writing Strategies and Visions.
X-Ray of Vienna Social Housing
XetHub Is Joining Hugging Face!
XLSCOUT Unveils ParaEmbed 2.0: A Powerful Embedding Model Tailored for Patents and IP With Expert Support From Hugging Face
YaFSDP
You Are Probably Building Inconsistent Classification Models Without Even Realizing
You Are What You Read, Even if You Don’t Always Remember It
You Are Your Body: Here’s How to Feel More at Home in It
You Can’t Sit Out Office Politics
You Can’t Spell Diffusion without U
You Need Two Leadership Gears
You’re Never Going to Be “Caught Up” at Work. Stop Feeling Guilty About It.
Young Workers Don't Want to Become Managers — And This Study Uncovers the Reason Why.
Your AI Product Needs Evals
Your Estimates Suck
YouTubers Furious After Apple and Anthropic Steal Their Data to Train AI
Zillow and Moody’s Team Up to Enhance Multifamily Rental Analytics
Zillow Group Acquires AI Company for Virtual Staging
Zillow’s Most Interesting Product
Zillow’s Transition to “Super App” Driving Revenue Growth
Zillow’s Upgraded AI Search Will Show You More Homes You Can’t Afford - The Verge
Zoopla CEO Has "No Desire" for Portal War With Rightmove and OnTheMarket
þÿSeven Failure Points When Engineering a Retrieval Augmented Generation System
Books
10% Happier
An Elegant Puzzle. Systems of Engineering Management
Austrian Perspective on the History of Economic Thought
Build. An Unorthodox Guide to Making Things Worth Making
Creativity, Inc.
Crucial Conversations. Tools for Talking When Stakes Are High
Data for All
Deep Work
Designing Machine Learning Systems. An Iterative Process for Production Ready Applications
Ego Is the Enemy
Four thousand weeks. Time management for mortals
Good strategy Bad strategy
Good to Great
High Output Management
High Output Management
How to Fail at Almost Everything and Still Win Big
How to lead in Data Science
How to take smart notes
How to Win Friends and Influence People
Inspired How to Create Products Customers
Las Gafas De La Felicidad
Leaders Eat Last. Why Some Teams Pull Together and Others Dont
Leadership BS
Leading change
Leonardo Da Vinci
Manna
Mindfulness
Misbehaving
Never split the difference
Peace Is Every Step
Peopleware. Productive Projects and Teams
Power. Why some people have it and others dont
Problem Solving Estrategico
Radical Candor Be a Kick-Ass Boss Without Losing Your Humanity
Real Happiness
Scaling People. Tactics for Management and Company Building
Sprint. How to Solve Big Problems and Test New Ideas in Just Five Days
Talking to Strangers. What We Should Know About the People
Team of teams
The Culture Code
The Five Dysfunctions of a Team
The Manual of Design Fiction
The Phoenix Project
The Winter of Frankie Machine
Think Again the Power of Knowing What You Don't Know
Tomorrow, and Tomorrow, and Tomorrow
Trillion Dollar Coach. The Leadership Playbook of Silicon Valley Bill Campbell
Un Caballero en Moscú -- Towles, Amor -- 2016 -- A431e0bdff91b0db80abcd2954075792 -- Anna’s Archive
Courses
A Roadmap for Creating a Data Literacy Program
Building and Evaluating Advanced RAG
ChatGPT Prompt Engineering for developers
Creacion de una organizacion preparada para la IA Generativa
Langchain-Chat-with-your-data
Multi AI agent systems with CrewAI
Multi AI agent systems with CrewAI
The Data Strategy Course. Building a data-driven business
Documentaries
Light and Magic
Podcasts
Corporate culture and peak human performance with Dr Larry Senn
El fin de la interfaz
Langchain Agents in Production Webinar
Langchain Document QA Webinar
Management Humanista
PPW Podcast Episode 4. The Future of Real Estate Search
PPW PPW Real Estate Portals and AI
Talks
Creando experiencias personalizadas con ciencia de datos con Christine Doig-Cardet Netflix
Ds Managers Guide. Managing Stakeholders
Sin Machirulos no hay paraiso
Map Of Contents
AI
Bio
Digital Garden
Management
Management
Now
Public Appearances
Research
Permanent Notes
attachments
Data Science Fundamentals
Resources
A/B Testing
Business Understanding
Causal Inference
Communicate with impact
Data Science Ethics
Data Visualization
Good Data Analysis
Introduction to Computer Science
Linear Algebra
Machine Learning 101
Numerical optimization
Programming Language
Rules of ML
Shell Script and others
SQL
Statistical Learning
Statistics 101: Probability
Technical Writing
The Data Science Process
The Ultimate Guide to Deploying ML Models
Time Series Analysis
Time Series Analysis
10 Years Later. Lessons from My PhD Experience
A Balanced Approach to Seeking Help
Advanced Course on Product Engineering
Agents explained
Agile for Data Science
AI Enhanced Knowledge Management
Bayesian reasoning
Be helpful
Bluesky feels like a breath of fresh air for data folks
Building to Forecast in Data Science
Buscas tu primer empleo de Ciencia de Datos?
Career advice on skill acquisition
Change Resistance
Change Resistance as a Corporate Autoimmune Disease
Como contratar DS y no desesperar en el intento
Corporate antibodies
Data is not objective
Data Paranoid
Data Science Fundamentals
Data Science job crafting
Data teamwork as a transport service
December Always Hits Hard
Deferred Responsability
Different managerial styles
Dont get too rusty
Dopamine rush
DRI
Embracing Incompetence
Energy Management Confession
Essential Books for New Managers in Tech
Explaining AI-infused products
First solo data scientist
Fostering collaboration between teams
Generating images with your LoRA like a Pro
Glue work
Growth mindset
Headspace for managers
How I Manage Myself and My Team Using Obsidian Tasks
Ideal data to solve a problem
Internal Networking
LoRA. Low-Rank Adaptation of LLMs
MacBook Pro preparation for SD training and inference
Make'em talk with prototypes
Manage the data before thinking of AI
Mentoring as a form of leverage
Mentors and me
My failure resume
My workflow for my public second brain
New icon for the blog
No Data Product Management
No public speaking in 2024
Of innocents and criminals
Office hours
Other People Problems
Owl Drawing and Data Generation
POSSE against Platform Nudges on Content Creation
Public Speaking is a Game-Changer for Networking
Radical Candor and Crucial Conversations
Rationality takes us closer to the truth
Rethinking Our Contributions to Social Media Platforms
Reversible and irreversible decision making
Rock stars vs Superstars
Short term and long term metrics
Stable Diffusion technicalities
Sucessful Model
Taming Impostor Syndrome
Team size trade-off. Coordination costs Vs collective intelligence
The bitter feeling of publishing a Peer-Reviewed Paper Again
The Data Vantage Point
The Power of Yet
The Rational Company
The Rise of the Dataset Engineer
There is always going to be something you cannot fix
Time to manage
Training a LoRa of your face with Stable Diffusion 1.5
Training a Personal LoRA on Replicate Using FLUX.1-dev
Two books I wish I had read before starting my PhD
Understanding low level data science
Verbund
Verbund in Data Science
When Management Communication Techniques Enter Personal Life
Why You Should Dive into Hand-Labeling Yourself
Write well to solve problems
You need a growth mindset to get honest feedback
Photography
Photography
Public Appearances
2019
Databeers-XXX
Busco Pisco
Spanish Cadastre h2o
Boosting Spanish Cadastre with Machine Learning
2020
Doing-DS-Nuevos-Profesionales-Digitales
Doing Data Science: Lessons Learned
Pensamiento-Digital
Podcast Pensamiento Digital
Spatial Autocorrelation
Spatial Autocorrelation is everywhere
2021
buscando-vocaciones
Buscando Vocaciones
business-applications-DS-uam
Business Applications of Data Science
carto-2021
Automatic Valuation of Spanish Cadastre
nova-ds-beyond-the-hype
Data Science beyond the Hype
open-expo-mesa-redonda-ai
El futuro inmediato (y real) de la Inteligencia Artificial
x-talks-ai
Interview at podcast xTalks.AI
2022
BdE-2022
ML en modelos hedonicos de valoracion inmobiliaria
Cruzando-datos-2022
Datos cruzados
enel-ninja-talk
Está la casa de mis sueños sobrevalorada o es un chollo
epi-gijon-lecciones-aprendidas
Lecciones aprendidas haciendo Ciencia de Datos
geoawesomeness
Using location Data to create amazing user experiences online
nuclio-data-science-sin-humo
Data Science Sin Humo
talent-hackers-interview
Data Science Sin Humo
2023
data-on-the-rocks
Lecciones Aprendidas haciendo productos de datos
de-economistas-a-ds
De Economistas a Data Scientists
dive-data
Data Science al descubierto
luce-gijon
Inteligencia Artificial, smart cities y uso de datos
mesa-redonda-ai
De Economistas a Data Scientists
mioti-ds-mitos
Data Science al descubierto
Research
A dynamic approach to road freight flows modeling in Spain
A geo-referenced micro-data set of real estate listings for Spain’s three largest cities
A Stochastic Frontier Analysis Approach for Estimating Energy Demand and Efficiency in the Transport Sector of Latin America and the Caribbean
Determinants of ground transport modal choice in long-distance trips in Spain
Intra-urban house prices in Madrid following the financial crisis, an exploration of spatial inequality
The spatial productivity of transportation infrastructure
Using machine learning to identify spatial market segments
Home
❯
Public Appearances
❯
2023
❯
dive data
Folder: appearances/2023/dive-data
1 item under this folder.
Dec 20, 2024
Data Science al descubierto
speaking