5 things I wish I'd known about Git
Hunter Johnson
Posted on August 22, 2022
Article was written by Ian Miell, author of the Educative course "Learn Git the Hard Way".
Git can be utterly bewildering to someone who uses it casually or is not interested in things like directed acyclic graphs.
It's difficult to get started, but you'll find git has many useful tricks and layers that can bring your project to the next level. The best way to learn is with the help of an expert who started where you are now.
Today, I'll briefly go through five things I wish someone had explained to me before I started using git.
Here’s what we’ll cover today:
- 1. The Four Stages
- 2. What is a Reference?
- 3. What's a Fast-Forward?
- 4. What's a Rebase?
- 5. The power of git log
- Next steps
1. The Four Stages
Having come from using CVS as a source control (an older example of a Version Control System (VCS)), one of the most baffling things about git was its different approach to the state of content.
If like me, you use git commit -am "checkin message"
to commit your work, then the second "adding/staging" state is more or less invisible to you. Instead, the -a
does it for you. It’s for this reason that I encourage new users to drop the -a
flag and git add
by hand so that they understand these distinctions.
One subtlety is that the -a
flag doesn’t add new files to the content tracked by git – it just adds changes made.
These states exist so that people can work independently and offline, syncing later. This was the driving force behind the development of git.
From this comes another key point: all git repositories are created equal.
My clone of your repository is not dependent on yours for its existence. Each repository stands on its own and is only related to others if you configure it so. This is another key difference between git and more traditional (nay, obsolete) client/server models of content history management.
This results in a far more flexible but potentially more complicated workflow. The workflow now looks more like this:
2. What is a Reference?
Git docs and blogs keep talking about references, but what is a reference?
A reference is just that: a pointer to a commit. And a commit is a unique reference to a new state of the content.
Once you understand this, a few other concepts make more sense.
HEAD
is a reference to "where you are" in the content history. It’s the content you’re currently looking at in your git repo.
When you git commit
, the HEAD
moves to the new commit.
A git tag
reference is one that can have arbitrary text and does not move when a new commit is seen.
A git branch
is a reference that moves with the HEAD
whenever you commit a new change.
A couple of other confusing things then become clearer. For example, a detached HEAD
is nothing to panic about despite its scary name – it just means that your HEAD
is not pointed at a branch.
Take a look at this diagram:
It represents a series of commits.
Confusingly, with git diagrams, the arrows go backward in time. A
is the first commit, then B
, and so on to the latest commit (H
).
There are three references
– master
(which is pointed at C
), experimental
, which is pointed at H
, and HEAD
, which is also pointed at H
. Remember, HEAD
essentially means "where we are".
3. What’s a Fast-Forward?
Now that you understand what a HEAD
reference is, understanding what a fast-forward is pretty simple.
Usually, when you merge two branches together, you get a new commit:
In the above diagram, I
is a commit that represents the merging of H
and G
from its common ancestor (D
). The changes made on both branches are applied together from D
and the resulting state of the content after the commit is stored in a new state (I
).
But consider the diagram we saw above:
There we have two branches, but no changes were made on one of them. Let’s say we want to merge the changes on experimental
(E
and H
) into master
– we’ve experimented, and the experiment was successful.
In this case, merging E
and H
into master requires no changes from H
, since there are no F
and G
changes that need to be merged together with E
and H
. They are all in one line of changes.
Such a merge only requires that the master
reference is picked up and moved from C
to H
. This is a "fast-forward" – the reference just needed moving along, and no content needed to be reconciled.
4) What’s a Rebase?
My manual page for git rebase says:
"Reapply commits on top of another base tip."
This is more understandable than previous versions of this main page, but will probably still confuse many people.
A visual example makes it much clearer.
You could merge feature1
into the master
branch, and you’d end up with a new commit (G
), which makes the tree look like this:
You can see that you’ve retained the chronology, as both branches keep their history and order of commits.
A git rebase
takes a different approach. It ‘picks up’ the changes on our branch (commit D
on feature1
in this case) and applies it to the end of the branch we are on (HEAD
is at master
).
This looks a lot neater, doesn’t it? master can now be ‘fast-forwarded’ to where feature1
is by moving master
‘s pointer along to D
.
The downside is that we’ve lost something a slight organizational benefit by doing this. It no longer reflects the order of things that happened chronologically. This is a trade-off you'll have to consider on a case-by-case basis.
5) The power of git log
The above concepts are all interesting, but how can you use these in your day-to-day work?
For this, I highly recommend getting to grips with git’s native log command. While there are many GUIs that can display history, they all have their own opinions on how things should be displayed, and moreover are not available everywhere. As a source of truth, git log is unimpeachable and transparent.
I wrote about this in more depth here, but to give yourself a flavor, try these two commands on a repo of your choice. They cover 90% of my git log usage day-to-day:
$ git log --oneline --graph
$ git log --oneline --graph --simplify-by-decoration --all
-
oneline
directs the log to only show the commit id and comment per-commit -
graph
provides a visual of the structure right in your terminal -
simplify-by-decoration
trims all minor changes from git history, allowing you to see all big developments across a project's history
Next steps
I hope these tips smooth your learning curve with Git and gave you one or more new tools for your next project. You'll also need hands-on practice to make sure you're ready to use Git professionally when the time comes.
To help you practice, I've adapted my book into an interactive Educative course, called Learn Git the Hard Way. This course starts off with fundamental concepts like repositories and commits and helps build your knowledge by exploring Git stash, pushing code, and more. By the end, you'll have all the Git skills employers are looking for and a certificate to prove it.
Happy learning!
Continue reading about Git on Educative
- Git and GitHub in a Nutshell: Definitive tutorial for beginners
- Git Tutorial: Learn Git Branching in 5 minutes
- GitHub is reimagining the technical recruiting process
Start discussion
What are some specific reasons why you want to learn git? Was this article helpful? Let us know in the comments below!
Posted on August 22, 2022
Join Our Newsletter. No Spam, Only the good stuff.
Sign up to receive the latest update from our blog.