Artificial Intelligence Fundamentals

ForgEd deep-dive — computation, machine learning, large language models, ethics, and human-AI collaboration

Chapters: 10
Read time: ~105 min
Format: Textbook
Depth: Academic

Preface

Artificial intelligence is not a single invention but a long scientific and engineering project to build systems that perceive, reason, learn, and act in ways that resemble—or exceed—human cognitive performance on defined tasks. From early symbolic programs that manipulated logical formulas to today's large neural networks trained on vast corpora, the field has repeatedly redefined what 'intelligence' means in machines and what society should expect from automated decision-making.

This ForgEd textbook presents ten chapters that move from foundational ideas about intelligence and computation through the history of AI research, core machine learning and neural network theory, contemporary large language models and tool use, multimodal systems, ethical and safety considerations, workplace applications, and plausible futures for human-AI partnership. The tone is academic and explanatory: definitions, historical context, mechanisms, and implications—not hype or product marketing.

Readers should treat this material as general education, not as legal, medical, or professional licensing advice. AI capabilities change quickly; frameworks such as the NIST AI Risk Management Framework and OECD AI Principles are cited where they help structure responsible practice. Use the table of contents to study sequentially or to revisit specific topics before projects, coursework, or workplace conversations about AI adoption.

How to use this guide: scroll through all chapters in order, or jump via the table of contents. Each chapter includes learning objectives, cited sources, and section navigation—like a reference textbook, not a slideshow of bullet summaries.

Table of contents

Your progress

Chapter quizzes 0/10 · 0/50 sections read

Ch. 1 · 10mIntelligence, computation, and what machines can represent
Ch. 2History: From symbolic AI to deep learning
Ch. 3Machine learning foundations
Ch. 4Neural networks and deep learning
Ch. 5Large language models and generative AI
Ch. 6Prompting, tools, and agentic workflows
Ch. 7Computer vision, speech, and multimodal perception
Ch. 8Ethics, bias, safety, and governance
Ch. 9AI in the workplace and industry applications
Ch. 10The future of human-AI partnership

Read all sections in a chapter, then pass the chapter quiz to unlock the next chapter.

Course review quiz →

Jump to section

Chapter 1

Intelligence, computation, and what machines can represent

Foundations from Turing to rational agents

Estimated reading time · 10 min · Pass the chapter quiz below to unlock the next chapter

1.1 Defining intelligence in natural and artificial systems

Intelligence, in everyday language, suggests the ability to adapt, solve problems, and achieve goals in varied environments. Psychologists and cognitive scientists debate whether intelligence is a single general capacity (often labeled g) or a bundle of specialized faculties such as memory, language, spatial reasoning, and social inference. For artificial systems, researchers operationalize intelligence through task performance: can a program play chess, classify images, translate text, or plan routes under uncertainty?

Alan Turing proposed an influential behavioral test in 1950: if a human interrogator cannot reliably distinguish a machine's typed answers from a person's, the machine may be said to exhibit intelligent behavior for practical purposes. Critics note that the Turing Test emphasizes linguistic mimicry and can be gamed without genuine understanding. Modern benchmarks instead evaluate accuracy, robustness, calibration, and generalization across datasets and tasks.

A useful working definition for this course treats intelligence as goal-directed competence supported by representation, inference, and learning. Representation encodes facts and uncertainty; inference derives conclusions or plans; learning improves performance from experience. Whether a thermostat, spreadsheet macro, or billion-parameter neural network counts as 'intelligent' depends on the difficulty of the environment and the flexibility of the system's responses.

1.2 Computation, algorithms, and complexity

Every AI system runs on physical computers that execute algorithms—finite, step-by-step procedures that transform inputs into outputs. The Church-Turing thesis holds that any effectively computable function can be computed by a Turing machine, which grounds the mathematical study of what machines can, in principle, calculate. In practice, hardware limits, memory bandwidth, and energy consumption determine which algorithms are feasible at scale.

Computational complexity classifies problems by resource requirements. Class P contains problems solvable in polynomial time relative to input size; NP contains problems whose solutions can be verified quickly, though finding solutions may be exponentially hard. Many planning, scheduling, and combinatorial optimization tasks encountered in AI are NP-hard, motivating heuristics, approximation algorithms, and learned policies that trade optimality for speed.

Algorithm design in AI spans search (exploring state spaces), dynamic programming (reusing subproblem solutions), constraint satisfaction, probabilistic inference, and gradient-based optimization for neural networks. Understanding complexity helps explain why brute-force reasoning fails on large problems and why learning from data can outperform hand-coded rules when patterns are statistical rather than purely logical.

Key points

Algorithm: a precise procedure that terminates and produces an output for each valid input
State space: the set of configurations a problem solver can occupy while searching for a goal
Heuristic: a rule of thumb that guides search toward promising regions without guaranteeing optimality
Big-O notation: describes how runtime or memory grows as input size increases
Trade-off: exact reasoning versus scalable approximation or learned behavior

1.3 Representation, knowledge, and uncertainty

Intelligent behavior requires internal structures that stand for the world. Early AI used symbolic representations—facts in predicate logic, semantic networks, and frames—to support deductive inference. Expert systems encoded if-then rules distilled from human specialists. Such approaches excel when knowledge is stable and rules are explicit, but they struggle when perception is noisy and rules are incomplete.

Probability theory provides a language for uncertainty. Bayesian networks compactly represent joint distributions over many variables and support diagnostic reasoning. Markov decision processes model sequential decisions under stochastic transitions. Modern machine learning often learns distributed representations—vectors in high-dimensional space—rather than human-readable symbols, yet the underlying question remains: what does the system believe, and with what confidence?

Knowledge acquisition—the bottleneck of expert systems—motivated learning from data. Representation choices shape what a system can learn, explain, and audit. As AI enters regulated domains, the tension between opaque embeddings and interpretable structures becomes a design and governance issue, not merely a technical detail.

1.4 Limits of classical AI and the case for learning

Classical AI pursued general problem solvers and logical theorem provers. Results were impressive on toy domains but brittle in open worlds. The frame problem—how to represent what stays unchanged when an action occurs—illustrated the difficulty of commonsense reasoning with explicit logic alone. Perception, motor control, and language understanding resisted hand-crafted rule sets.

Moravec's paradox observes that high-level reasoning requires little computation in early robots, while low-level sensorimotor skills demand enormous resources. Children learn visual and motor skills through interaction; symbolic AI often skipped that grounding. Connectionism and later deep learning proposed that many competencies emerge from statistical regularities in data rather than from manually authored axioms.

Contemporary systems combine learning with structured modules: retrieval over documents, tool APIs, planners, and safety filters. The lesson of Chapter 1 is conceptual: intelligence in machines is not magic but engineered representation, search, inference, and learning—constrained by computation, data, and the environments we ask systems to master.

1.5 Agents, environments, and rational action

AI textbooks often model an agent—anything that perceives and acts—situated in an environment described along dimensions such as observability, determinism, episodic versus sequential structure, and single-agent versus multi-agent interaction. A rational agent selects actions that maximize expected utility given its percept history and built-in performance measure.

Simple reflex agents map percepts directly to actions; model-based agents maintain internal state; goal-based agents plan toward objectives; utility-based agents trade off competing goals under uncertainty. Learning agents improve the mapping from percepts to actions over time. Robotics and autonomous vehicles instantiate these ideas under real-time safety constraints.

Designers must align the performance measure with human values; a misspecified reward can produce harmful shortcut behavior, a theme revisited in ethics and safety chapters. Framing AI as agent design clarifies why data, sensors, actuators, and feedback loops matter as much as model architecture.

Key points

Agent: entity that perceives via sensors and acts via actuators
Fully observable versus partially observable environments
Episodic tasks: independent trials; sequential tasks: current actions affect future outcomes
Utility: numerical score capturing preferences over world states
Rationality: optimal action given available information, not omniscience

Locked