Riley Learning

Riley Learning

Sign in Subscribe

Understanding and Comparing Embedding Models for RAG and Vector Search

Understanding and Comparing Embedding Models for RAG and Vector Search

In the rapidly evolving landscape of artificial intelligence, embedding models have emerged as the unsung heroes powering everything from semantic search to recommendation systems. These sophisticated models transform unstructured data into numerical vectors that preserve semantic meaning, enabling machines to understand and process human language with unprecedented accuracy. What Are

The Rise of On-Device AI: Comparing SLMs on NPUs and Copilot+ PCs

The Rise of On-Device AI: Comparing SLMs on NPUs and Copilot+ PCs

We're now witnessing the emergence of powerful on-device AI capabilities that promise to revolutionize how we interact with our computers. At the forefront of this transformation are SLMs running on NPUs, with Microsoft's Copilot+ PCs leading the charge into this new era.

Managing Python Projects: Virtual Environments, Git Ignore Rules, and Dependency Tracking

Managing Python Projects: Virtual Environments, Git Ignore Rules, and Dependency Tracking

Effective Python project management is crucial for maintaining clean, reproducible, and collaborative codebases. This post covers the essential tools and practices for managing virtual environments, dependency tracking, and version control in Python projects.

Model Context Protocol (MCP): Shaping the Future of AI Agents

Model Context Protocol (MCP): Shaping the Future of AI Agents

The Model Context Protocol (MCP) is an innovative protocol designed to enhance AI model interactions through advanced context management. This blog post explores what MCP is, how it works, and how developers can leverage its capabilities using the Python client example.

Understanding FastAPI: Building Production-Grade Asynchronous Applications with MCP

Understanding FastAPI: Building Production-Grade Asynchronous Applications with MCP

As the demand for real-time, responsive, and scalable AI applications grows, building robust asynchronous APIs becomes essential. In this guide, we explore FastAPI, a high-performance web framework for Python.

Building a Financial RAG Chatbot Using LLaMA, Streamlit and RunPod (VSCode)

Building a Financial RAG Chatbot Using LLaMA, Streamlit and RunPod (VSCode)

In this tutorial, we’ll walk through the process of building a Financial Question-Answering chatbot using Retrieval-Augmented Generation (RAG), LLaMA 3, and Streamlit.

TensorFlow vs PyTorch vs Keras: A Beginner-Friendly Comparison of Deep Learning Frameworks

TensorFlow vs PyTorch vs Keras: A Beginner-Friendly Comparison of Deep Learning Frameworks

Whether you’re just stepping into the world of deep learning or already exploring complex neural networks, choosing the right framework is crucial. Among the many, three stand out: TensorFlow, PyTorch, and Keras.

RLHF vs. RLAIF: Fine-Tuning LLMs for Better Alignment (OTS, SFT, PPO, Jailbreak)

RLHF vs. RLAIF: Fine-Tuning LLMs for Better Alignment (OTS, SFT, PPO, Jailbreak)

Large Language Models (LLMs) like GPT-4, LLaMA 3, and Claude are redefining natural language processing. Despite their advancements…

Optimizing Azure OpenAI Service: Base Model Deployment, Fine-Tuning, and Decoding Parameters

Optimizing Azure OpenAI Service: Base Model Deployment, Fine-Tuning, and Decoding Parameters

Azure OpenAI Service offers powerful tools to deploy, fine-tune, and interact with GPT models, making it essential to understand the…

RAG vs. Fine-Tuning : When to Use, Combine, and Optimize for Best Results

RAG vs. Fine-Tuning : When to Use, Combine, and Optimize for Best Results

When building or optimizing AI models, two powerful techniques often come into play: Fine-tuning and RAG (Retrieval-Augmented Generation)…

Paper Review — Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution…

Paper Review — Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution…

Debugging programs is essential yet challenging, even for advanced Large Language Models (LLMs). In their ACL 2024 paper, “Debug like a…

IMG2TEXT-Part2. OFA, CLIP Interrogator and ViT

IMG2TEXT-Part2. OFA, CLIP Interrogator and ViT

Continuing from Part 1, we are going to look into the CLIP Interrogator, OFA model, and ViT model and ensemble them. Most of the codes are…

IMG2TEXT-Part1. Background (Stable Diffusion, CLIP, Prompt)

IMG2TEXT-Part1. Background (Stable Diffusion, CLIP, Prompt)

In this article, I’d like to talk about background information to implement CLIPInterrogator+OFA+ViT_LB0.568. Part 2 will cover the…

Google ISLR Transformer with W&B (Part 2)

Google ISLR Transformer with W&B (Part 2)

In this article, I’ll be showing you how to create and train a model for the Kaggle ASL (American Sign Language) recognition competition…

Google ASL 1. Process Data with W&B 🐝

Google ASL 1. Process Data with W&B 🐝

Today, I’m going to explain the dataset and how to process it for a Kaggle competition on ASL(American Sign Language), Google — Isolated…

Paper Review — Strided Transformer (TMM 2022)

Paper Review — Strided Transformer (TMM 2022)

Strided Transformer is a monocular 3D pose estimation model which lifts a long sequence of 2D joint locations to a single 3D pose.

Paper Review — VideoPose3D (CVPR 2019)

Paper Review — VideoPose3D (CVPR 2019)

3D human pose estimation in video with temporal convolutions and semi-supervised training

Paper Review — MHFormer

Paper Review — MHFormer

[PyTorch] Simple 3D Pose Baseline implementation (ICCV’17)

[PyTorch] Simple 3D Pose Baseline implementation (ICCV’17)

In this post, I review Simple 3D Pose Baseline (A simple yet effective baseline for 3d human pose estimation, also called as SIM) which is…

HRNet : Code Explained

HRNet : Code Explained

HRNet(Deep High-Resolution Representation Learning for Human Pose Estimation) is a state-of-the-art algorithm in the field of semantic…

In this post, we create a simple convolutional neural network(SimpeConvNet) using only NumPy and…

In this post, we create a simple convolutional neural network(SimpeConvNet) using only NumPy and…

Simple CovNet with NumPy In this post, we create a simple convolutional neural network(SimpeConvNet) using only NumPy and it will classify MNIST images. The codes are from a book called ‘Deep Learning from Scratch’. Let’s check an architecture of SimpeConvNet and notations first. Architecture N: the number of

Training Basic Two Layer Network with Numpy

Training Basic Two Layer Network with Numpy

In this post, we develop a two-layer network in order to perform classification in MNIST dataset and train it. There are mainly two parts…

Softmax with Loss Layer with Numpy

Softmax with Loss Layer with Numpy

Softmax with Loss Layer

Activation Functions with Numpy

Activation Functions with Numpy

Activation Function

Understanding Pooling Layer with Numpy

Understanding Pooling Layer with Numpy