Posts

Showing posts from December, 2025

Week 8 - BALT 4364 - Language Models

Image
       Chapter 8 takes a deep dive into Large Language Models (LLMs), one of the biggest breakthroughs in artificial intelligence. These models—like ChatGPT and DALL·E 2—are changing the way machines understand and generate human-like text and images. In this chapter, I explore how LLMs work, how they’re trained, and why they matter, along with a hands-on exercise to help me actually apply what I’m learning. LLMs are built on transformer architecture, which allows them to recognize patterns and relationships in language. That’s what makes them so powerful for tasks like answering questions, summarizing long passages, writing emails, or even creating poetry. The chapter also breaks down the two major stages of training: pre-training, where the model learns general language patterns from massive amounts of text, and fine-tuning, where it’s shaped for a specific task. Understanding this process helps me see why LLMs are so capable—and where their limitations come from. ...

Week 7 - BALT 4364 - TensorFlow vs. PyTorch — What It’s About and Why It’s Usef

Image
TensorFlow vs. PyTorch — What It’s About and Why It’s Useful             When you get into deep learning, the two frameworks you hear about the most are TensorFlow and PyTorch. Both are powerful, open-source tools, but each one has its own strengths. Knowing the difference helps you decide which one fits your goals. TensorFlow was created by Google, while PyTorch was developed by Facebook. TensorFlow has a bigger ecosystem with lots of pre-trained models and tools, but PyTorch has grown fast, especially in research. Many people find PyTorch easier to learn because it works more like regular Python, making it simple to test ideas and fix errors. TensorFlow used to be harder to use, but TensorFlow 2.0 made it much more beginner-friendly. For deployment, TensorFlow is usually the better choice because it has strong tools for servers, mobile devices, and even web apps. PyTorch has deployment options too, but they’re not as mature. TensorFlow also has Tenso...