RAG Microservices: Getting to "AI Everywhere"
Chat opened up the floodgates for AI, but it's far from the future. The future is task-specific, interdependent models. Small context windows, reliably accurate, cheap and fast.
The AI Bifurcation
1. On one hand you have LLMs context windows expanding to the point where they'll swallow up your Snowflake Data Warehouse. This is the progression, as evident by Google's Gemini 1 million context window LLM.
2. On the other you have smaller, task-specific models that are fine-tuned to perform fas