Elena' s AI Blog

Neural Network Architecture Series

Elena Daehnhardt

Midjourney AI-generated art
Image credit: Illustration created with Midjourney, prompt by the author.
Image prompt

“An illustration representing cloud computing”

Neural Network Architecture Series

This series explores how neural network architecture evolved from residual connections to modern experiments with dense links, transformers, and automated design. Each post builds on the previous one while still standing on its own.

Series Progress

1 of 9 posts published


All Posts in This Series

Part 0: Artificial Neural Networks

Artificial neural networks (ANNs) are the cornerstone of Deep Learning algorithms. The name and the architecture are adopted from the human brain's neural network. ANNs are designed to simulate human reasoning based on how neurons communicate. ANNs contain a set of artificial neurons connected.

Artificial Neural Networks

Part 1: Understanding Neural Network Architecture: From Basics to Residual Connections

Coming Soon

Before diving into cutting-edge architectural innovations, understand the foundations: how neural networks are structured, why depth creates problems, and how residual connections solved them.

This post is currently being written and will be published soon.

Understanding Neural Network Architecture: From Basics to Residual Connections

Part 2: DeepSeek's mHC: Making Neural Networks Learn Better by Preserving Identity

Coming Soon

DeepSeek's mHC architecture solves a fundamental tension in neural networks: how to allow rich information flow between layers while maintaining training stability. The result is measurably better performance on reasoning tasks.

This post is currently being written and will be published soon.

DeepSeek's mHC: Making Neural Networks Learn Better by Preserving Identity

Part 3: Dense Connections and Information Flow: From DenseNet to Modern Variants

Coming Soon

Explore dense connectivity patterns, why they improve gradient flow, and when dense connections outperform residual links.

This post is currently being written and will be published soon.

Dense Connections and Information Flow: From DenseNet to Modern Variants

Part 4: Attention Mechanisms Meet Residual Connections: How Transformers Use Skip Connections

Coming Soon

A practical guide to residual connections in transformers: equations, Pre-LN vs Post-LN trade-offs, failure modes, and implementation details for stable training.

This post is currently being written and will be published soon.

Attention Mechanisms Meet Residual Connections: How Transformers Use Skip Connections

Part 5: Neural Architecture Search: Automating the Discovery of Better Connections

Coming Soon

How NAS explores connection patterns and what it reveals about architecture design.

This post is currently being written and will be published soon.

Neural Architecture Search: Automating the Discovery of Better Connections

Part 6: Bottlenecks, Inverted Residuals, and Mobile Architectures

Coming Soon

How mobile architectures adapt residual designs to fit edge devices.

This post is currently being written and will be published soon.

Bottlenecks, Inverted Residuals, and Mobile Architectures

Part 7: The Future of Connections: From Manifold Constraints to Dynamic Routing

Coming Soon

A survey of emerging connection patterns and where architecture research may head next.

This post is currently being written and will be published soon.

The Future of Connections: From Manifold Constraints to Dynamic Routing

Part 8: Transformers in Practice: From Architecture to a Chatbot Implementation

Coming Soon

A practical bridge from transformer architecture concepts to implementation: build and evaluate a chatbot workflow with TensorFlow and HuggingFace, with clear trade-offs and deployment guidance.

This post is currently being written and will be published soon.

Transformers in Practice: From Architecture to a Chatbot Implementation

Getting Started

New to this series? Start with Part 0: Artificial Neural Networks.

Each post builds on the previous one, so reading them in order is recommended. Each post also works on its own if you need to jump to a specific topic.

All Posts