Introduction

Changelog

2024-08-24: Init for Ubuntu 24.04
2024-09-10: Add writeup on the general structure and concepts in Ansible

Getting started

Ansible is an orchestration tool for performing remote code execution on a networked client, by relying on the remote Python interpreter. By default it works by:

Opening an SSH connection to the remote
Transporting execution code over the connection (i.e. agent-less)
Executing said code using the remote Python
Execution results are collected as JSON output, and returned

Installation

Documentation here: https://docs.ansible.com/ansible/latest/installation_guide/intro_installation.html#control-node-requirements Reproduced below:

user:~$ sudo apt install pipx  # pip for isolated CLI apps
user:~$ pipx install --include-deps ansible  # core + collections
user:~$ pipx ensurepath
user:~$ pipx inject --include-apps ansible argcomplete

Set up configuration file:

user:~$ sudo mkdir -p /etc/ansible
user:~$ ansible-config init --disabled > ansible.cfg
user:~$ sudo mv ansible.cfg /etc/ansible

These executable programs (modules) are run as individual tasks. A first test is usually sending a script to localhost that writes an output string: this is provided with the built-in ping module¹⁾.

user:~$ ansible localhost -m ping
localhost | SUCCESS => {
    "changed": false,
    "ping": "pong"
}

Explanation

This hides a lot of the under-the-hood magic²⁾, which really corresponds to:

user:~$ ansible localhost -m ansible.builtin.ping -a "data=pong"
localhost | SUCCESS => {
    "changed": false,
    "ping": "pong"
}

The documentation for the ping module can be found on the Ansible Community Documentation, which in turn points to the ping.py module on Github (simplified here for clarity):

ansible/lib/ansible/modules/ping.py

from ansible.module_utils.basic import AnsibleModule
 
def main():
    module = AnsibleModule(
        argument_spec={'data': {'type': 'str', 'default': 'pong'}},
        supports_check_mode=True,
    )
    if module.params['data'] == 'crash':
        raise Exception("boom")
    module.exit_json(ping=module.params['data'])

The AnsibleModule is a template for creating basic Python scripts, with the exit_json() method documented here:

ansible/lib/ansible/module_utils/basic.py

...
    def exit_json(self, **kwargs):
        self.do_cleanup_files()
        self._return_formatted(kwargs)
        sys.exit(0)
 
    def _return_formatted(self, kwargs):
        ...  # emit warnings, handle logging, etc.
        print('\n%s' % self.jsonify(kwargs))
...

We see that "pong" is the default value to the "data" parameter, and a JSON is generated to return this value. Another "changed" key is observed, which, from the list of common values, indicates that the system state did not change after successfully running the task.

Small aside: there is actually no SSH connection to "localhost" being opened here, unless this is specified in an inventory (what an inventory is will be described later).

We can create a YAML declarative script (playbook) to repeat this command, with abbreviations and defaults explicit for clarity:

user:~$ cat mytasks.yml
- hosts: localhost
  tasks:
    - ansible.builtin.ping:
        data: pong
user:~$ ansible-playbook mytasks.yml

This is pretty much the core idea of what Ansible is designed to do, and grouping of tasks into playbooks. There is a set of best practices that is likely a good read.

Playbooks and modules

Ansible has a set of reserved keywords for use in playbooks, which can be found here. Learn the common ones to avoid mistaking modules from keywords³⁾. The next most common keyword is arguably name for labelling tasks (and coexists with playbook comments):

mytasks.yaml

# For hello-world equivalent tutorial
- name: Connectivity check with localhost
  hosts: localhost
  tasks:
    - name: Run ansible ping
      ansible.builtin.ping:
        data: pong

Most modules run through the Python interpreter (Ansible is designed to work with Python for easier scripting). To run shell commands, one of the command, shell, or raw modules should be used⁴⁾, in order of decreasing preference:

command executes simple scripts via Python, without importing any environment variables⁵⁾
shell opens a new shell via Python and allows for shell syntax, e.g. piping⁶⁾
raw executes low-level commands via SSH without using a Python interpreter. Only ever recommended for systems without Python and/or Python installation.

Note however that the use of shell scripting means error handling still needs to be done, i.e. it takes away from what makes Ansible easy to use due to its declarative nature. Use the right tool for the right job.

Common tasks are likely to have already been implemented by the community as a module. To search for modules, consider one of these methods:

Performing a Google search
Grepping ansible-doc -l for the full list of installed modules.
Searching on Ansible Galaxy, then using ansible-galaxy to install (see below section on "collections" and "roles")
Failing which, write one yourself.

Organization concepts

Ansible defines a couple more terms to allow for more effective categorization of tasks, namely "inventories", "plays", "collections", "roles".

Inventories group hosts into a file, with additional subgroup delineation. This can then be specified as a possible host in the playbook to run tasks against, e.g. "hosts: local", or "hosts: all" to run against all hosts in the inventory. The default inventory is defined in /etc/ansible/hosts.

user:~$ cat myinventory
[local]
localhost
 
[webserver]
192.168.1.12
webserver.internal
 
user:~$ ansible-playbook -i myinventory mytasks.yml  # 'hosts:' need to be changed

Playbooks can contain many independent groups of tasks. Each group is called a play. This can be used to run different tasks across different hosts, e.g. running a playbook updating all machines to update webservers and database servers separately. In short, they tie tasks to a set of hosts.

- name: Update web servers
  hosts: webservers
  remote_user: root
  tasks:
    - name: Ensure apache is at the latest version
      ansible.builtin.yum:
        name: httpd
        state: latest

- name: Update db servers
  hosts: databases
  remote_user: root
  tasks:
    - name: Ensure postgresql is at the latest version
      ansible.builtin.yum:
        name: postgresql
        state: latest

Modules are typically not standalone, e.g. managing a webserver may involve modules for updating, starting, stopping, etc. These are typically grouped and distributed as a collection of modules. These collections reside in namespaces, with some special ones being "ansible", "community", "local".

For example, the community.grafana.grafana_dashboard refers to the Grafana Dashboard management module, as part of the Grafana collections in the Community namespace.
The ansible.builtin (default) and ansible.legacy collections are not available on Ansible Galaxy, hence they reside in different documentation.

# List available collections
user:~$ ansible-galaxy collection list
 
# Show documentation for a module
user:~$ ansible-doc community.grafana.grafana_dashboard

Only a subset of modules in a collection need to be run, in order to, say, setup an NTP service. This is where Ansible roles come in, which are essentially like a self-contained group of tasks, and also additionally enforces an organization structure to store variables and template files for reuseability.

An example would be the timesync role which installs and configures NTP, and the methods/paths will need to vary based on which platform is being configured.
The key idea is reuseability of a group of tasks, so that the same role can be used across different environments, or even across different playbooks. The element of role customization comes in the form of variables and template files.
Roles can be packaged as part of collections⁷⁾.
Note that roles do not map to specific hosts - this is done in a play/playbook.

Summary of concepts

Playbooks contain a set of plays that runs tasks using modules.
Playbooks reference hosts collected in an inventory.
Groups of tasks are consolidated into roles.
Roles and modules are distributed in a collection, under a predefined namespace.

Other resources

Consider Geerling's book: Ansible for DevOps, which seems to be popular in this circle. Also a small commentary on why Ansible, reproduced below:

As someone who has used all of them (yes all of them, puppet, chef, ansible, salt, cfengine). They each have pros and cons. The best one is the one that fits your organization business requirements.

* If you are a windows shop and you can pay for enterprise, then chef is the most mature.
* If you are a linux shop with money, then go for RedHat Ansible Automation Platform.
* If you are a home user or don't have money, then open source Ansible is the easiest to setup.

However the industry is moving to immutable infrastructure where you don't even need to worry about VMs at all. In that case Terrafom, Pulumi, Helm are the go-to tools.

- u/dev_all_the_ops (Dec 2023)

Lastly, a quick example of using Ansible for VM deployment on Proxmox.

¹⁾

Note some tutorials specify ansible -m ping localhost, but the convention is to have the host pattern be the first argument

²⁾

This is partly why I took so long to pick up motivation to learn Ansible: many tutorials jump straight into playbooks and rules without actually dissecting where this module was coming from, or why the return value is a pong. It took me trial-and-error to even figure out -a was the required parameter and its "key=value" format. The main confusion behind plain "ansible" is arguably the loose syntax rules, e.g. ansible localhost -m "ping data=pong" is also valid, without the use of the module arguments parameter.

³⁾

Note that special variables that are non-user-assignable also exist

⁴⁾

belonging to the ansible.builtin collection

⁵⁾

similar to Python subprocess.run()

⁶⁾

similar to Python subprocess.run(shell=True)

⁷⁾

some sources online hint at roles being deprecated, but this is really more of standalone roles being deprecated in favor of packaging them as part of collections - not the same thing!