\input texinfo @c -*-texinfo-*- @c vim: filetype=texinfo expandtab tabstop=4 shiftwidth=4 @c %**start of header (This is for running Texinfo on a region.) @setfilename gawkworkflow.info @settitle GNU Awk Development Workflow @c %**end of header (This is for running Texinfo on a region.) @dircategory Text creation and manipulation @direntry * Gawk Work Flow: (gawkworkflow). Participating in @command{gawk} development. @end direntry @dircategory Individual utilities @direntry * Gawk Work Flow: (gawkworkflow)Overview. Participating in @command{gawk} development. @end direntry @c With early 2014 texinfo.tex, restore PDF links and colors @tex \gdef\linkcolor{0.5 0.09 0.12} % Dark Red \gdef\urlcolor{0.5 0.09 0.12} % Also \global\urefurlonlylinktrue @end tex @set xref-automatic-section-title @c The following information should be updated here only! @c This sets the edition of the document, the version of gawk it @c applies to and all the info about who's publishing this edition @c These apply across the board. @set UPDATE-MONTH July, 2022 @set TITLE Participating in @command{gawk} Development @set EDITION 0.74 @iftex @set DOCUMENT booklet @set CHAPTER chapter @set APPENDIX appendix @set SECTION section @set SUBSECTION subsection @end iftex @ifinfo @set DOCUMENT Info file @set CHAPTER major node @set APPENDIX major node @set SECTION minor node @set SUBSECTION node @end ifinfo @ifhtml @set DOCUMENT Web page @set CHAPTER chapter @set APPENDIX appendix @set SECTION section @set SUBSECTION subsection @end ifhtml @ifdocbook @set DOCUMENT booklet @set CHAPTER chapter @set APPENDIX appendix @set SECTION section @set SUBSECTION subsection @end ifdocbook @ifxml @set DOCUMENT booklet @set CHAPTER chapter @set APPENDIX appendix @set SECTION section @set SUBSECTION subsection @end ifxml @ifplaintext @set DOCUMENT booklet @set CHAPTER chapter @set APPENDIX appendix @set SECTION section @set SUBSECTION subsection @end ifplaintext @ifnottex @ifnotdocbook @macro ii{text} @i{\text\} @end macro @end ifnotdocbook @end ifnottex @ifdocbook @macro ii{text} @inlineraw{docbook,\text\} @end macro @end ifdocbook @c For HTML, spell out email addresses, to avoid problems with @c address harvesters for spammers. @ifhtml @macro EMAIL{real,spelled} ``\spelled\'' @end macro @end ifhtml @ifnothtml @macro EMAIL{real,spelled} @email{\real\} @end macro @end ifnothtml @c merge the function and variable indexes into the concept index @ifinfo @synindex fn cp @synindex vr cp @end ifinfo @iftex @syncodeindex fn cp @syncodeindex vr cp @end iftex @ifxml @syncodeindex fn cp @syncodeindex vr cp @end ifxml @ifdocbook @synindex fn cp @synindex vr cp @end ifdocbook @c If "finalout" is commented out, the printed output will show @c black boxes that mark lines that are too long. Thus, it is @c unwise to comment it out when running a master in case there are @c overfulls which are deemed okay. @iftex @finalout @end iftex @copying @docbook Published by: Free Software Foundation 51 Franklin Street, Fifth Floor Boston, MA 02110-1301 USA Phone: +1-617-542-5942 Fax: +1-617-542-2652 Email: gnu@@gnu.org URL: http://www.gnu.org/ Copyright © 2017, 2018, 2019, 2020, 2022 Free Software Foundation, Inc. All Rights Reserved. @end docbook @ifnotdocbook Copyright @copyright{} 2017, 2018, 2019, 2020, 2022 Free Software Foundation, Inc. @end ifnotdocbook @sp 2 This is Edition @value{EDITION} of @cite{@value{TITLE}}. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with the Invariant Sections being ``GNU General Public License'', with the Front-Cover Texts being ``A GNU Manual'', and with the Back-Cover Texts as in (a) below. A copy of the license is included in the section entitled ``GNU Free Documentation License''. @enumerate a @item The FSF's Back-Cover Text is: ``You have the freedom to copy and modify this GNU manual.'' @end enumerate @end copying @c Comment out the "smallbook" for technical review. Saves @c considerable paper. Remember to turn it back on *before* @c starting the page-breaking work. @c 4/2002: Karl Berry recommends commenting out this and the @c `@setchapternewpage odd', and letting users use `texi2dvi -t' @c if they want to waste paper. @c @smallbook @c Uncomment this for the release. Leaving it off saves paper @c during editing and review. @c @setchapternewpage odd @c @shorttitlepage @value{TITLE} @titlepage @title @value{TITLE} @subtitle Edition @value{EDITION} @subtitle @value{UPDATE-MONTH} @author Arnold D.@: Robbins @ifnotdocbook @c Include the Distribution inside the titlepage environment so @c that headings are turned off. Headings on and off do not work. @page @vskip 0pt plus 1filll Published by: @sp 1 Free Software Foundation @* 51 Franklin Street, Fifth Floor @* Boston, MA 02110-1301 USA @* Phone: +1-617-542-5942 @* Fax: +1-617-542-2652 @* Email: @email{gnu@@gnu.org} @* URL: @uref{http://www.gnu.org/} @* @c ISBN x-xxxxxx-xx-x @* @sp 2 @insertcopying @end ifnotdocbook @end titlepage @iftex @headings off @evenheading @thispage@ @ @ @strong{@value{TITLE}} @| @| @oddheading @| @| @strong{@thischapter}@ @ @ @thispage @end iftex @ifnottex @ifnotxml @ifnotdocbook @node Top @top General Introduction @c Preface node should come right after the Top @c node, in `unnumbered' sections, then the first chapter. This file describes how to participate in software development for @uref{http://www.gnu.org/software/gawk, GNU Awk (@command{gawk})}. @insertcopying @end ifnotdocbook @end ifnotxml @end ifnottex @menu * Preface:: Some introductory remarks. * Contributing:: How to contribute to @command{gawk} development. * Using Git:: Getting started with Git. * Configuring git:: Configuring Git. * Development without commit access:: How to work without commit access. * Development with commit access:: How to work with commit access. * General practices:: How things should usually be done. * Repo Maintenance:: Tips for keeping your repo clean. * Development Stuff:: Things you need to know to be a @command{gawk} developer. * Cheat Sheet:: Git command summary. * Resources:: Some further resources. * TODO:: Stuff still to do. * Index:: The index. @detailmenu * This Manual:: How to use this manual. * Conventions:: Typographical Conventions. * Acknowledgments:: Acknowledgments. * Reviewers:: A note to reviewers. * Push Pull:: The push/pull software development model. * Repo Copies:: What it means to have a copy of a repo. * Local Branches:: How to best use local branches. * Branches are state:: Branches represent development state. * Repo State:: The different branch types in the repo. * Local State:: Managing local branches. * Remotes:: What a ``remote'' is. * Cloning:: Cloning the repo the first time. * Switching Branches:: Moving from one branch to another. * Starting A New Branch:: Starting a new branch for development. * Undoing a change:: Throwing away changes. * Updating:: Keeping in sync with the upstream repo. * Rebasing:: Rebasing A Local Branch. * Merge Conflicts:: Dealing With Merge Conflicts. * Submitting Changes:: How to submit your changes. * Removing Branches:: Getting rid of unneeded branches. * Points to remember:: Things you need to keep in mind. * Initial setup:: Getting started with commit access. * ssh clone:: Cloning using an @samp{ssh://} URL. * Developing patches:: Developing patches. * Developing new features:: Developing new features. * Developing fixes:: Developing fixes. * Coding style:: Where to read up on the coding style. * Doing paperwork:: Legal stuff in order to contribute. * Tools:: Tools to have on your system for development. * GNU Tools:: The GNU Autotools. * Compilers:: A discussion of compilers that can be used. * Debugging:: Compiling for debugging. @end detailmenu @end menu @c @summarycontents @contents @node Preface @unnumbered Preface @c I saw a comment somewhere that the preface should describe the book itself, @c and the introduction should describe what the book covers. This @value{DOCUMENT} describes how to participate in development of GNU Awk (@command{gawk}). GNU Awk is a Free Software project belonging to the Free Software Foundation's GNU project. @menu * Intended Audience:: Who should read this manual. * This Manual:: How to use this manual. * Conventions:: Typographical Conventions. * Acknowledgments:: Acknowledgments. * Reviewers:: A note to reviewers. @end menu @node Intended Audience @unnumberedsec Intended Audience This @value{DOCUMENT} is aimed at software developers who wish to participate in @command{gawk} development. You should be comfortable working with traditional Unix-style command-line tools, and with the C language and standard library facilities. You should also have some prior experience using distributed source code control systems, such as the Concurrent Versions System (CVS) or Subversion (SVN). Experience with a more modern system such as Mercurial or Git will be even more helpful. @cindex Git Project The @value{DOCUMENT} focuses on participation in the project (that is, how to work most effectively if you wish to contribute to it) and also describes how to make use of the @uref{http://git-scm.org, Git} distributed source code management system for @command{gawk} development. @node This Manual @unnumberedsec Using This Book This @value{DOCUMENT} has the following chapters and appendices: @itemize @bullet @item @ref{Contributing} describes how to start contributing to the @command{gawk} project. @item @ref{Using Git} introduces the Git distributed source code management system. @item @ref{Configuring git} describes some initial set-up you need to do before using Git seriously. @item @ref{Development without commit access} gets into the meat of the development workflow, describing how to work if you don't have commit access to the Savannah repository. @item @ref{Development with commit access} continues the discussion, covering what's different when you can commit directly to the Savannah repository. @item @ref{General practices} describes general development practices used by the @command{gawk} development team. @item @ref{Repo Maintenance} presents several different things you need to know about to keep your repository in good shape. @item @ref{Development Stuff} describes some important points you should be familiar with in order to participate in @command{gawk} development and presents some tools that may make your work easier. @item @ref{Cheat Sheet} provides a short ``cheat sheet'' summarizing all the Git commands referenced in this @value{DOCUMENT}. @item @ref{Resources} provides a few pointers to Internet resources for learning more about Git. @end itemize @node Conventions @unnumberedsec Typographical Conventions @cindex Texinfo This @value{DOCUMENT} is written in @uref{http://www.gnu.org/software/texinfo/, Texinfo}, the GNU documentation formatting language. A single Texinfo source file is used to produce both the printed and online versions of the documentation. @ifnotinfo Because of this, the typographical conventions are slightly different than in other books you may have read. @end ifnotinfo @ifinfo This @value{SECTION} briefly documents the typographical conventions used in Texinfo. @end ifinfo Examples you would type at the command line are preceded by the common shell primary and secondary prompts, @samp{$} and @samp{>}. Input that you type is shown @kbd{like this}. Output from the command is preceded by the glyph ``@print{}''. This typically represents the command's standard output. Error messages and other output on the command's standard error are preceded by the glyph ``@error{}''. For example: @example $ @kbd{echo hi on stdout} @print{} hi on stdout $ @kbd{echo hello on stderr 1>&2} @error{} hello on stderr @end example @ifnotinfo In the text, almost anything related to programming, such as command names, variable and function names, and string, numeric and regexp constants appear in @code{this font}. Code fragments appear in the same font and quoted, @samp{like this}. Things that are replaced by the user or programmer appear in @var{this font}. Options look like this: @option{-f}. File names are indicated like this: @file{/path/to/ourfile}. Some things are emphasized @emph{like this}, and if a point needs to be made strongly, it is done @strong{like this}. The first occurrence of a new term is usually its @dfn{definition} and appears in the same font as the previous occurrence of ``definition'' in this sentence. @end ifnotinfo Characters that you type at the keyboard look @kbd{like this}. In particular, there are special characters called ``control characters.'' These are characters that you type by holding down both the @kbd{CONTROL} key and another key, at the same time. For example, a @kbd{Ctrl-d} is typed by first pressing and holding the @kbd{CONTROL} key, next pressing the @kbd{d} key, and finally releasing both keys. @quotation NOTE Notes of interest look like this. @end quotation @quotation CAUTION Cautionary or warning notes look like this. @end quotation @node Acknowledgments @unnumberedsec Acknowledgments @cindex Kahrs, J@"urgen Thanks to J@"urgen Kahrs for his initial efforts to write a document like this. Although his prose has not survived, his material was helpful in preparing this @value{DOCUMENT}. @cindex Bernat, Yehezkel Thanks to Yehezkel Bernat for reviewing this document and in general for his good intentions. @strong{FIXME:} YOUR NAME HERE... @node Reviewers @unnumberedsec Notes to Reviewers Please let me know if anything is missing, or unclear. Real errors with respect Git commands and usage are very important as well. Spelling errors and typo fixes welcome, but not as important. @node Contributing @chapter How to Start Contributing @command{gawk} development is distributed. It's done using electronic mail (@dfn{email}) and via branches in the Git repository (or @dfn{repo}) on @uref{http://savannah.gnu.org, Savannah}, the GNU project's source code management site. In this @value{CHAPTER} we use some Git terminology. If you're not at all familiar with Git, then skim this @value{CHAPTER} and come back after reading the rest of the @value{DOCUMENT}. @command{gawk} is similar to many other Free Software projects. To begin contributing, simply start! Take a look at the @file{TODO} file in the distribution, see if there is something of interest to you, and ask on the @email{bug-gawk@@gnu.org} mailing list if anyone else is working on it. If not, then go for it! (@xref{Development Stuff} for a discussion of some of the technical things you'll need to do. Here we describe the process in general.) Your contribution can be almost anything that is relevant for @command{gawk}, such as code fixes, documentation fixes, and/or new features. @quotation NOTE If possible, new features should be done using @command{gawk}'s extension mechanism. If you want to add a user-visible language change to the @command{gawk} core, you're going to have to convince the maintainer and the other developers that it's really worthwhile to do so. Changes that improve performance or portability, or that fix bugs, or that enable more things in extensions, will require less convincing, of course. @end quotation As you complete a task, submit patches for review to the @email{bug-gawk@@gnu.org} mailing list, where you'll be given feedback about your work. Once your changes are acceptable, the maintainer will commit them to the Git repository. Over time, as the maintainer and development team gain confidence in your ability to contribute, you may be asked to join the private @command{gawk} developers' mailing list, and/or be granted commit access to the Git repository on Savannah. This has happened to more than one person who just ``came out of the woodwork.'' Until that happens, or if you don't want to join the list, you should continue to work with private branches and submission of patches to the mailing list. Once you have commit access, if you want to make a major change or add a major feature, where the patch(es) would be very large, it has become the practice to create a separate branch, based off of @code{master}, to host the feature. This way the maintainer can review it, and you can continue to improve it, until it's ready for integration into @code{master}. @cindex GitHub @quotation NOTE Because of the GNU project's requirements for signed paperwork for contributions, the @command{gawk} project will @strong{not} work with pull requests from @uref{http://github.com, GitHub} or any other Git-based software hosting service. You must submit patches to the mailing list, and be willing to sign paperwork for large patches (@pxref{Doing paperwork}). @end quotation The @email{bug-gawk@@gnu.org} mailing list is not private. Anyone may send mail to it, and anyone may subscribe to it. To subscribe, go to the list's @uref{https://lists.gnu.org/mailman/listinfo/bug-gawk, web page} and follow the instructions there. If you plan to be involved long-term with @command{gawk} development, then you probably should subscribe to the list. @node Using Git @chapter Using Git This chapter provides an introduction to using Git. Our point is @emph{not} to rave about how wonderful Git is, nor to go into painful detail about how it works. Rather we want to give you enough background to understand how to use Git effectively for bug fix and feature development and to interact (``play nicely'') with the development team. @menu * Push Pull:: The push/pull software development model. * Repo Copies:: What it means to have a copy of a repo. * Local Branches:: How to best use local branches. * Branches are state:: Branches represent development state. @end menu @node Push Pull @section The ``Push/Pull'' Model of Software Development Git is a powerful, distributed source code management system. However, the way it's used for @command{gawk} development purposely does not take advantage of all its features. Instead, the model is rather simple, and in many ways much like more traditional distributed systems such as the @uref{http://www.nongnu.org/cvs, Concurrent Versions System} (CVS) or @uref{http://subversion.apache.org, Subversion} (SVN). The central idea can be termed ``push/pull.'' You @emph{pull} updates down from the central repository to your local copy, and if you have commit rights, you @emph{push} your changes or updates up to the central repository. Other developers work this way; pushing their changes up to the central repository and pulling your changes down into theirs. Where Git does stand out is in its management of multiple branches of development. Git makes it very easy to set up a separate branch for use in fixing a bug or developing a feature. You can then easily keep that branch up to date with respect to the main development branch(es), and eventually merge the changes from your branch into the main branch. Almost always Git does these merges for you without problem. When there is a problem (a @dfn{merge conflict}), usually it is very easy for you to @dfn{resolve} them and then complete the merge. We talk about this in more detail later (@pxref{Merge Conflicts}). @node Repo Copies @section How Git Stores Branches and Their Copies So how does Git work?@footnote{The following description is greatly simplified.} A repository consists of a collection of @dfn{branches}. Each branch represents the history of a collection of files and directories (a file @dfn{tree}). Each combined set of changes to this collection (files and directories added or deleted, and/or file contents changed) is termed a @dfn{commit}. When you first create a local copy of a remote repository (``clone the repo''), Git copies all of the original repository's branches to your local system. The original remote repository is referred to as being @dfn{upstream}, and your local repo is @dfn{downstream} from it. Git distinguishes branches from the upstream repo by prefixing their names with @samp{origin/}. Let's draw some pictures. @ref{savannah-repo} represents the state of the repo on Savannah: @page @float Figure,savannah-repo @caption{The Savannah @command{gawk} Repository} @smallexample +======================+ | Branches | +======================+ | master | +----------------------+ | gawk-4.1-stable | +----------------------+ | gawk-4.0-stable | +----------------------+ | feature/fix-comments | +----------------------+ | ... | +----------------------+ @end smallexample @end float @cindex @command{git} command @subentry @code{git branch} After you clone the repo, on your local system you will have a single branch named @code{master} that's visible when you use @samp{git branch} to see your branches. @cindex @command{git} command @subentry @code{git clone} @example $ @kbd{git clone http://git.savannah.gnu.org/r/gawk.git} @ii{Clone the repo} $ @kbd{cd gawk} @ii{Change to local copy} $ @kbd{git branch} @ii{See branch information} @print{} * master @end example @noindent The current branch is always indicated with a leading asterisk (@samp{*}). Pictorially, the local repo looks like @ref{your-repo} (you can ignore the @samp{T} column for the moment): @float Figure,your-repo @caption{Your Local @command{gawk} Repository} @smallexample +===+======================++=============================+ | T | Local Branches || Remote Branches | +===+======================++=============================+ | X | master || origin/master | +---+----------------------++-----------------------------+ | | || origin/gawk-4.1-stable | +---+----------------------++-----------------------------+ | | || origin/gawk-4.0-stable | +---+----------------------++-----------------------------+ | | || origin/feature/fix-comments | +---+----------------------++-----------------------------+ | | || ... | +---+----------------------++-----------------------------+ @end smallexample @end float @noindent @cindex @code{origin/} branches @cindex branches @subentry @code{origin/} Note that what is simply @code{gawk-4.1-stable} in the upstream repo is now referred to as @code{origin/gawk-4.1-stable}. The @samp{origin/} branches are a snapshot of the state of the upstream repo. This is how Git allows you to see what changes you've made with respect to the upstream repo, without having to actually communicate with the upstream repo over the Internet. (When files are identical, Git is smart enough to not have two separate physical copies on your local disk.) If you're working on a simple bug fix or change, you can do so directly in your local @code{master} branch. You can then commit your changes, and if you have access rights, push them upstream to the Savannah repo. (However, there is a process to follow. Please read the rest of this @value{DOCUMENT}.) @node Local Branches @section Local Branches @cindex local branches @cindex branches @subentry local Let's talk about local branches in more detail. (The terminology used here is my own, not official Git jargon.) There are two kinds of local branches: @table @dfn @item Tracking Branches @cindex tracking branches @cindex branches @subentry tracking @cindex @command{git} command @subentry @code{git checkout} Tracking branches track branches from the upstream repository. You first create a tracking branch simply by checking out a branch from the upstream. You use the branch name without the leading @samp{origin/} prefix. For example, @samp{git checkout gawk-4.1-stable}. @cindex @command{git} command @subentry @code{git push} You can then work on this branch, making commits to it as you wish. Once things are ready to move upstream, you simply use @samp{git push}, and your changes will be pushed up to the main repo.@footnote{Assuming you have permission to do so, of course.} You should @strong{never} checkout a branch using the @samp{origin/} prefix. Things will get very confused. Always work on local tracking branches. @item Purely Local Branches A @dfn{purely local branch} exists only on your system. You may be developing some large new feature, or fixing a very difficult bug, or have a change for which paperwork has not yet been completed. In such a case, you would keep your changes on a local branch, and periodically synchronize it with @code{master} (or whichever upstream branch you started from). @end table This may seem somewhat abstract so far. We demonstrate with commands and branches in @ref{Development without commit access}, later in this @value{DOCUMENT}. Let's say you have checked out a copy of @code{gawk-4.1-stable} and have created a purely local branch named @code{better-random}. Then our picture now looks like @ref{your-repo-2}, where the @samp{T} column indicates a tracking branch. @float Figure,your-repo-2 @caption{Your Local @command{gawk} Repository With a Purely Local Branch} @smallexample +===+======================++=============================+ | T | Local Branches || Remote Branches | +===+======================++=============================+ | X | master || origin/master | +---+----------------------++-----------------------------+ | X | gawk-4.1-stable || origin/gawk-4.1-stable | +---+----------------------++-----------------------------+ | | || origin/gawk-4.0-stable | +---+----------------------++-----------------------------+ | | || origin/feature/fix-comments | +---+----------------------++-----------------------------+ | | || ... | +---+----------------------++-----------------------------+ | | better-random || | +---+----------------------++-----------------------------+ @end smallexample @end float @node Branches are state @section Branches Represent Development State Branches represent development state. At any given time, when you checkout a particular branch (or create a new one), you have a copy of the @command{gawk} source tree that you should be able to build and test. The following @value{SECTION}s describe the different branches in the @command{gawk} repository and what they are for, as well as how to use your own branches. @menu * Repo State:: The different branch types in the repo. * Local State:: Managing local branches. * Remotes:: What a ``remote'' is. @end menu @node Repo State @subsection Branches in the Savannah Repository There are several kinds of branches in the Savannah repository. @table @dfn @cindex branches @subentry dead @cindex dead branches @item Dead Branches Branches with the prefix @samp{dead-branches/} (such as @code{dead-branches/const}) hold code that was never merged into the main code base. For example, a feature which was started, but later deemed to be unwise to add. These branches keep the code available, but they are not updated. @cindex branches @subentry stable @cindex stable branches @item Stable Branches These branches are used for bug fixes to released versions of @command{gawk}. Sometimes new development (i.e., user-visible changes) also occurs on these branches, although in a perfect world they would be used only for bug fixes. These branches have names like @code{gawk-4.1-stable}, @code{gawk-4.0-stable}, and so on. Once a release has been made from @code{master}, the previous stable branch is not updated. For example, once @command{gawk} 4.1.0 was released, no more work was done on @code{gawk-4.0-stable}. @cindex branch, main @cindex main branch @cindex branch, @code{master} @cindex @code{master} branch @item The Main Branch This is the @code{master} branch. Here is where most new feature development takes place, and releases of new major versions are based off of this branch. Feature branches are typically based off this branch as well, and when the feature is deemed complete, merged back into it. @cindex branches @subentry feature @cindex feature branches @item Feature Branches Often, a proposed new feature or code improvement is quite involved. It may take some time to perfect, or the @command{gawk} development team may not be convinced that the feature should be kept. For this purpose, the team uses branches prefixed with @samp{feature/}. This prefix is used even for code that simply improves the internals and does not make a user-visible change. Having large changes on separate branches makes it easier for members of the team to review the code, and also makes it easier to keep the changes up-to-date with respect to @code{master}, since Git excels at merging commits from one branch to another. @end table @node Local State @subsection Branches in Your Local Repository @cindex branches @subentry purely local @cindex purely local branches Purely local branches are where you do your own development. You may use purely local branches because you don't have commit rights to the Savannah repo. You may also use them if you are doing some work that isn't ready for sharing with the rest of the team, or cannot be committed for some other reason. For example, for around a nine-month period, the maintainer kept a purely local branch for some contributed changes for which paperwork had not yet been completed. @node Remotes @subsection A Closer Look at Branch Naming @cindex @command{git branch} command, @option{-a} option Earlier, we said that Git maintains copies of the branches in the upstream repo, as well as manages your local branches. You can see all these branches with @samp{git branch -a}: @example $ @kbd{git branch -a} @print{} gawk-4.1-stable @print{} * master @print{} remotes/origin/HEAD -> origin/master @print{} remotes/origin/dead-branches/async-events @print{} @dots{} @print{} remotes/origin/feature/api-mpfr @print{} remotes/origin/feature/array-iface @print{} remotes/origin/feature/fix-comments @print{} @dots{} @end example You'll note that what we've referred to as @samp{origin/} branches appear in the output with an additional prefix: @samp{remotes/}. Up to this point, we've treated Git as if it allowed only a single upstream repository. But in fact, you can configure it to use more than one. All the known upstream repositories are grouped under the @samp{remotes/} prefix, with @code{remotes/origin} being the one from which you initially cloned your local repository. The ability to work with multiple upstream repositories is an advanced one; @command{gawk} development does not make use of it. The intent of this @value{SUBSECTION} is to explain the output from @samp{git branch -a}, nothing more. @node Configuring git @chapter Configuring Global Settings For Git @cindex configuration settings @cindex settings, configuration @cindex global configuration settings @cindex configuration settings @subentry global Before starting to use Git, you should configure it with some important settings that won't change as you use Git. You may configure options both globally, and on a per-repository basis. Here, we discuss only global configuration settings. @cindex @command{git} command @subentry @code{git config} You can configure Git using either @samp{git config}, or by editing the relevant files with your favorite text editor.@footnote{You are required to use either Vim or Emacs, other text editors are not allowed. Of course, reasonable developers wouldn't want to use any other editor anyway.} @cindex email address The first things to set are your email address and your real name: @cindex @code{user.name} configuration setting @cindex @code{user.email} configuration setting @cindex configuration setting @subentry @code{user.name} @cindex configuration setting @subentry @code{user.email} @example $ @kbd{git config --global user.name "J. P. Developer"} @ii{Set full name} $ @kbd{git config --global user.email jpdev@@example.com} @ii{Set email address} @end example Setting these two items are an absolute requirement. @strong{Note}: No aliases are allowed. If you can't supply your real name, you cannot contribute to the project. Other options that the @command{gawk} maintainer recommends that you use are: @cindex @code{push.default} configuration setting @cindex @code{pager.status} configuration setting @cindex configuration setting @subentry @code{push.default} @cindex configuration setting @subentry @code{pager.status} @example $ @kbd{git config --global push.default simple} @ii{Only push the current branch} $ @kbd{git config --global pager.status true} @ii{Use pager for output of} git status @end example @cindex @file{.gitconfig} file The global settings are stored in the @file{.gitconfig} file in your home directory. The file looks like this: @example [user] name = J. P. Developer email = jpdev@@example.com [push] default = simple [pager] status = true @end example The @code{push.default=simple} setting ensures that older versions of Git only push the current branch up to the Savannah repo. This is the safest way to operate, and is the default in current Git versions. There may be other settings in your configuration file as well. Use @samp{git config} to see your settings: @example $ @kbd{git config --list} @print{} user.name=J. P. Developer @print{} user.email=jpdev@@example.com @print{} push.default=simple @end example Here are the @command{gawk} maintainer's settings: @example $ @kbd{git config --global --list} @print{} user.name=Arnold D. Robbins @print{} user.email=arnold@@@dots{} @print{} credential.helper=cache --timeout=3600 @print{} push.default=simple @print{} color.ui=false @print{} core.autocrlf=input @print{} pager.status=true @print{} log.decorate=auto @end example Additional, per-project (``local'') settings are stored in each repo's @file{.git/config} file. @node Development without commit access @chapter Development Without Commit Access In this chapter we present step-by-step recipes for checking out and working with a local copy of the Savannah Git repo for @command{gawk}. The presentation is for when you do not have commit access to the Git repo, and so you cannot push your changes directly. @menu * Cloning:: Cloning the repo the first time. * Switching Branches:: Moving from one branch to another. * Starting A New Branch:: Starting a new branch for development. * Undoing a change:: Throwing away changes. * Saving Without Committing:: Moving changes aside. * Updating:: Keeping in sync with the upstream repo. * Submitting Changes:: How to submit your changes. * Removing Branches:: Getting rid of unneeded branches. * Points to remember:: Things you need to keep in mind. @end menu @node Cloning @section Cloning The Repo @cindex @command{git} command @subentry @code{git clone} Clone the Savannah repo using @samp{git clone}. You should do so using using the HTTPS protocol; HTTPS is considered to be more secure than the native Git protocol and is preferred.@footnote{The native Git protocol is supported, but not recommended.} @cindex URL @subentry for cloning repositories To choose which method, you supply a @dfn{URL} for the repo when you clone it, as follows. @example $ @kbd{git clone https://git.savannah.gnu.org/r/gawk.git} @ii{Clone the repo} @print{} ... $ @kbd{cd gawk} @ii{Start working} @end example @emph{You only need to clone the repo once.} From then on, you update its contents using other Git commands. For example, after coming back from your vacation in the Bahamas: @cindex @command{git} command @subentry @code{git pull} @example $ @kbd{cd gawk} @ii{Move to the repo} $ @kbd{make distclean} @ii{A good idea before updating} @print{} ... $ @kbd{git pull} @ii{Update it} @end example To build, you should generally follow this recipe: @example $ @kbd{./bootstrap.sh && ./configure && make -j && make check} @end example @cindex @file{bootstrap.sh} script @quotation NOTE Unless you have installed all the tools described in @ref{GNU Tools}, you @emph{must} run @command{./bootstrap.sh} every time you clone a repo, do a @samp{git pull} or checkout a different branch. (In the latter case, do @samp{make distclean} first.) Otherwise things will get messy very quickly. The @command{bootstrap.sh} script ensures that all of the file time stamps are up to date so that it's not necessary to run the various configuration tools. @end quotation @node Switching Branches @section Switching Branches So far, we've been working in the default @code{master} branch. Let's check what's happening in the @code{gawk-4.1-stable} branch: @cindex @command{git} command @subentry @code{git checkout} @cindex @command{git} command @subentry @code{git pull} @example $ @kbd{make distclean} @ii{Clean up} $ @kbd{git checkout gawk-4.1-stable} @ii{Checkout a different branch} @print{} ... $ @kbd{git pull} @ii{Get up to date} @print{} ... $ @kbd{./bootstrap.sh && ./configure &&} @ii{Start working} > @kbd{make -j && make check} @end example @node Starting A New Branch @section Starting A New Branch Let's say you want to work on a new feature. For example, you might decide to add Python syntax support.@footnote{Just joking. Please don't attempt this for real.} You should create a new branch on which to work. First, switch back to @code{master}: @cindex @command{git} command @subentry @code{git checkout} @example $ @kbd{make distclean} $ @kbd{git checkout master} @end example Now, create a new branch. The easiest way to do that is with the @option{-b} option to @samp{git checkout}: @example $ @kbd{git checkout -b feature/python} @print{} ... @end example @cindex @file{ChangeLog} file @cindex committing changes You now do massive amounts of work in order to add Python syntax support. As you do each defined chunk of work, you update the @file{ChangeLog} file with your changes before @dfn{committing} them to the repo. @cindex @command{git} command @subentry @code{git status} Let's say you've added a new file @file{python.c} and updated several others. Use @samp{git status} to see what's changed: @example $ @kbd{git status} @print{} ... @end example @cindex @command{git} command @subentry @code{git diff} @cindex @command{git} command @subentry @code{git difftool} @cindex @command{meld} utility Before committing the current set of changes, you can use @samp{git diff} to view the changes. You may also use @samp{git difftool}@footnote{Don't run @samp{git difftool} in the background; it works interactively.} to run an external @command{diff} command, such as @command{meld} on GNU/Linux: @example $ @kbd{git diff} @ii{Regular built-in tool for standard diffs} $ @kbd{git difftool --tool=meld} @ii{GUI diff tool} @end example @cindex @command{git} command @subentry @code{git add} When you're happy with the changes, use @samp{git add} to tell Git which of the changed and/or new files you wish to have ready to be committed: @example $ @kbd{git add ...} @end example @cindex @command{git} command @subentry @code{git status} Use @samp{git status} to see that your changes are scheduled for committing: @example $ @kbd{git status} @print{} @end example Now you can commit your changes to your branch: @cindex @command{git} command @subentry @code{git commit} @example $ @kbd{git commit} @end example @noindent @cindex @command{git} command @subentry @code{git log} Running @samp{git commit} causes Git to invoke an editor (typically from the @env{$EDITOR} environment variable) in which you can compose a commit message. Please supply a short message summarizing the commit. This message will be visible via @samp{git log}. @node Undoing a change @section Undoing A Change Should you need to undo a change that you have not yet committed (so that you can start over), you can do so on per-file basis by simply checking out the file again: @cindex @command{git} command @subentry @code{git checkout} @example $ @kbd{git checkout awkgram.y} @ii{Undo changes to} awkgram.y@ii{. There is no output} @end example @cindex @command{git} command @subentry @code{git reset} To start over completely, use @samp{git reset --hard}. Note that this will @emph{throw away} all your changes, with no chance for recovery, so be sure you really want to do it. @node Saving Without Committing @section Moving Changes Aside Sometimes, you may be in the middle of a set of changes that are not yet completed, when you need to stop what you're doing and work on something else. For example, you might be updating the documentation when a bug report comes in and you want to work on the bug. But you can't just switch branches, since you haven't finished your current changes. @cindex @command{git} command @subentry @code{git stash} @cindex @command{git} command @subentry @code{git stash pop} @cindex @command{git} command @subentry @code{git stash list} The way to work around this problem is with @samp{git stash}. This command saves your changes in a special place within Git from which they may be restored later. After executing @samp{git stash}, your current branch is restored to its original, pristine state. The workflow might go something like this: @example $ @kbd{git checkout my-local-branch} @ii{Checkout a work branch} ... @ii{Do some work} $ @kbd{git stash} @ii{Save the work aside} $ @kbd{git checkout gawk-4.1-stable} @ii{Work on a bug fix} ... @ii{Now we're done} $ @kbd{git checkout my-local-branch} @ii{Go back to our local work} $ @kbd{git stash pop} @ii{Restore the earlier changes} @end example The stash is maintained as a stack. Sets of changes are pushed onto the stack by @samp{git stash} and popped off of it with @samp{git stash pop}. You may use @samp{git stash list} to see the list of saved changes. @node Updating @section Updating and Merging As you work on your branch, you will occasionally want to bring it up to date with respect to @code{master}. This @value{SECTION} discusses updating local branches and handling merge conflicts. @menu * Rebasing:: Rebasing A Local Branch. * Merge Conflicts:: Dealing With Merge Conflicts. @end menu @node Rebasing @subsection Rebasing A Local Branch @cindex rebasing For purely local branches, bringing your branch up to date is called @dfn{rebasing}, which causes the branch to look @emph{as if} you had started from the latest version of @code{master}. The steps are as follows: @cindex @command{git} command @subentry @code{git rebase} @cindex @command{git} command @subentry @code{git checkout} @cindex @command{git} command @subentry @code{git pull} @example $ @kbd{git checkout master} @ii{Checkout} master $ @kbd{git pull} @ii{Update it} $ @kbd{git checkout feature/python} @ii{Move back to new, purely local branch} $ @kbd{git rebase master} @ii{``Start over'' from current} master @end example @node Merge Conflicts @subsection Dealing With Merge Conflicts @cindex conflicts, from merging @cindex merge conflicts Sometimes, when merging from @code{master} into your branch, or from a branch into @code{master}, there will be @dfn{merge conflicts}. These are one or more areas within a file where there are conflicting sets of changes, and Git could not do the merge for you. In this case, the conflicted area will be delimited by the traditional conflict markers, @samp{<<<}, @samp{===} and @samp{>>>}. Your mission then is to edit the file and @dfn{resolve} the conflict by fixing the order of additions (such as in a @file{ChangeLog} file), or fixing the code to take new changes into account. Once you have done so, you tell Git that everything is OK using @samp{git add} and @samp{git commit}: @example $ @kbd{git checkout feature/python} @ii{Move back to new, purely local branch} $ @kbd{git rebase master} @ii{``Start over'' from current} master @print{} First, rewinding head to replay your work on top of it... @print{} Applying: Demo change. @print{} Using index info to reconstruct a base tree... @print{} M main.c @print{} Falling back to patching base and 3-way merge... @print{} Auto-merging main.c @print{} CONFLICT (content): Merge conflict in main.c @print{} error: Failed to merge in the changes. @print{} Patch failed at 0001 Demo change. @print{} Use 'git am --show-current-patch' to see the failed patch @print{} @print{} Resolve all conflicts manually, mark them as resolved with @print{} "git add/rm ", then run "git rebase --continue". @print{} You can instead skip this commit: run "git rebase --skip". @print{} To abort and get back to the state before "git rebase", run "git rebase --abort". $ @kbd{gvim main.c} @ii{Edit the file and fix the problem} $ @kbd{git add main.c} @ii{Tell Git everything is OK now @dots{}} $ @kbd{git commit} @ii{@dots{} and it's settled} $ @kbd{git rebase --continue} @ii{Continue the rebase} @end example The @command{git rebase --continue} then continues the process of rebasing the current branch that we started in @ref{Rebasing}. It's not necessary if you are using @samp{git merge} (@pxref{Points to remember}). @node Submitting Changes @section Submitting Your Changes So now your feature is complete. You've added test cases for it to the test suite@footnote{You did do this, didn't you?}, you have @file{ChangeLog} entries that describe all the changes@footnote{You remembered this, right?}, you have documented the new feature@footnote{You wouldn't neglect this, would you?}, and everything works great. You're ready to submit the changes for review, and with any luck, inclusion into @command{gawk}. @cindex review, changes you made @cindex changes, submitting for review There are two ways to submit your changes for review. @table @emph @cindex generating a single patch @cindex patch, single, generation of @item Generate a single large patch To do this, simply compare your branch to the branch off which it is based: @cindex @command{git} command @subentry @code{git checkout} @cindex @command{git} command @subentry @code{git diff} @example $ @kbd{git checkout feature/python} $ @kbd{git diff master > /tmp/python.diff} @end example Mail the @file{python.diff} file to the appropriate mailing list along with a description of what you've changed and why. @cindex @command{git} command @subentry @code{git format-patch} @cindex generating multiple patches @cindex patches, multiple, generation of @item Generate a set of patches that in toto comprise your changes To do this, use @samp{git format-patch}: @cindex @command{git} command @subentry @code{git checkout} @example $ @kbd{git checkout feature/python} $ @kbd{git format-patch} @end example This creates a set of patch files, one per commit that isn't on the original branch. Mail these patches, either separately, or as a set of attachments, to the appropriate mailing list along with a description of what you've changed and why. @end table Either way you choose to submit your changes, the @command{gawk} maintainer and development team will review your changes and provide feedback. If you have signed paperwork with the FSF for @command{gawk} and the maintainer approves your changes, he will apply the patch(es) and commit the changes. Which list should you send mail to? If you are just starting to contribute, use @email{bug-gawk@@gnu.org}. After making enough contributions, you may be invited to join the private @command{gawk} developers' mailing list. If you do so, then submit your changes to that list. If you make any substantial changes, you will need to assign copyright in those changes to the Free Software Foundation before the maintainer can commit those changes. @xref{Doing paperwork}, for more information. @node Removing Branches @section Removing Branches @cindex removing @subentry branches @cindex branches @subentry removing Once the maintainer has integrated your changes, you can get rid of your local branch: @cindex @command{git} command @subentry @code{git checkout} @cindex @command{git} command @subentry @code{git pull} @cindex @command{git} command @subentry @code{git branch} @example $ @kbd{git checkout master} @ii{Move to upstream branch} $ @kbd{git pull} @ii{Update} $ @kbd{gvim ChangeLog ...} @ii{Verify your changes are in} $ @kbd{git branch -d feature/python} @ii{Remove your local branch} @end example @node Points to remember @section Points to Remember There are some important points to remember: @itemize @bullet @item Always do a @samp{make distclean} before switching between branches. Things will get really confused if you don't. @item For upstream branches, @emph{always} work with tracking branches. @emph{Never} use @samp{git checkout origin/@var{whatever}}. Git will happily let you do something like that, but it's just plain asking for trouble. @item Make sure your tracking branches are up-to-date before doing anything with them, particularly using them as the basis for a rebase or merge. This typically means a three-step process: @cindex @command{git} command @subentry @code{git checkout} @cindex @command{git} command @subentry @code{git pull} @example $ @kbd{git checkout master} @ii{Get to local copy} $ @kbd{git pull} @ii{Bring it up to date} $ @kbd{git checkout feature/python} @ii{Go back to your branch} @end example @noindent You can then do the actual rebase: @cindex @command{git} command @subentry @code{git rebase} @example $ @kbd{git rebase master} @ii{Now rebase your feature off of master} @end example @item Git always treats the currently checked-out branch as the object of operations. For example, when comparing files with the regular @command{diff} command, the usage is @samp{diff @var{oldfile} @var{newfile}}. For @samp{git diff}, the current branch takes the place of @var{newfile}, thus: @cindex @command{git} command @subentry @code{git checkout} @cindex @command{git} command @subentry @code{git diff} @example $ @kbd{git checkout feature/python} $ @kbd{git diff master} @ii{Compare} master @ii{to current branch} @end example @noindent or if merging: @cindex @command{git} command @subentry @code{git checkout} @cindex @command{git} command @subentry @code{git pull} @cindex @command{git} command @subentry @code{git merge} @example $ @kbd{git checkout master} @ii{Checkout} master $ @kbd{git pull} @ii{Update tracking branch} $ @kbd{git merge feature/python} @ii{Merge changes into} master @end example @end itemize @node Development with commit access @chapter Development With Commit Access This @value{CHAPTER} describes how to do development when you @emph{do} have commit access to the @command{gawk} repo on Savannah. @menu * Initial setup:: Getting started with commit access. * ssh clone:: Cloning using an @samp{ssh://} URL. * Developing patches:: Developing patches. * Developing new features:: Developing new features. * Developing fixes:: Developing fixes. @end menu @node Initial setup @section Initial Setup Congratulations! After becoming a quality contributor to @command{gawk} development, you've been invited to join the private development list and to accept having commit access to the repo. @cindex Savannah @subentry creating an account @cindex account, Savannah, creation of @cindex @code{ssh} key The first thing to do is to create an account on Savannah, choosing a unique user name. To do so, go to the @uref{http://savannah.gnu.org, Savannah home page} and click on the ``New User'' link. The setup will include uploading of your @command{ssh} key, as per the instructions on the Savannah web page. After you've done all this, send email to the maintainer with your Savannah user name, and he will add you to the list of users who have commit access to the repo. @node ssh clone @section Cloning The Repo With An @command{ssh} URL In order to be able to commit changes to the repo, you must clone it using an @samp{ssh://} URL. Cloning the repo with @command{ssh} is similar to cloning with HTTPS, but the URL is different: @cindex @command{git} command @subentry @code{git clone} @cindex URL @subentry for @command{gawk} repository @cindex Repository, @command{gawk}, URL for @example $ @kbd{git clone ssh://yourname@@git.sv.gnu.org/srv/git/gawk.git} @print{} ... @end example Here, you should replace @samp{yourname} in the command with the user name you chose for use on Savannah. @node Developing patches @section Developing Patches The first part of developing a patch is the same as for developers without commit access: @enumerate 1 @item Develop the code and test it. @item @cindex @file{ChangeLog} file Update the @file{ChangeLog}. @item @cindex documentation files @cindex @file{gawktexi.in} documentation @cindex @file{gawk.1} manual page If necessary, update the documentation: @file{doc/gawktexi.in} and/or @file{doc/gawk.1}. @cindex @command{git} command @subentry @code{git diff} @item Use @samp{git diff > mychange.diff} to create a patch file. @item Send it to the mailing list for discussion. @item Iterate until the patch is ready to be committed. @end enumerate However, now that you have commit access, you can commit the fix and push it up to the repo yourself! Let's assume you've made a bug fix directly on @code{master}. Here's how to commit your changes: @cindex @command{git} command @subentry @code{git diff} @cindex @command{git} command @subentry @code{git add} @cindex @command{git} command @subentry @code{git commit} @cindex @command{git} command @subentry @code{git push} @example $ @kbd{git diff} @ii{Review the patch one more time} $ @kbd{git add @dots{}} @ii{Add any files for committing} $ @kbd{git commit} @ii{Commit the files, with a commit message} $ @kbd{git push} @ii{Push the files up to the repo. Ta da!} @end example The first three steps are the same described earlier (@pxref{Starting A New Branch}). The @samp{git push} is what's new, and it updates the repo on Savannah. Congratulations! As a courtesy, you should send a note to the mailing list indicating that you have pushed your change. @node Developing new features @section Developing New Features Developing a new feature can be easier once you have commit access to the repo. First, create a new branch to hold your feature: @cindex @command{git} command @subentry @code{git checkout} @cindex @command{git} command @subentry @code{git pull} @example $ @kbd{git checkout master} @ii{Start from} master $ @kbd{git pull} @ii{Be sure to be up to date} $ @kbd{git checkout -b feature/python} @ii{Create and switch to a new branch} @end example Now, you can develop as normal, adding new files if necessary (such as new tests), modifying code, updating the @file{ChangeLog} and documentation, and so on. You can share changes with the mailing list as diffs, as usual. However, especially for a large feature, it would be better to push your branch up to Savannah. Then, everyone else can simply pull it down to their local systems and review your changes at their leisure. To push your branch up initially: @cindex @command{git} command @subentry @code{git diff} @cindex @command{git} command @subentry @code{git add} @cindex @command{git} command @subentry @code{git commit} @cindex @command{git} command @subentry @code{git push} @example $ @kbd{git diff} @ii{Review your changes} $ @kbd{git add @dots{}} @ii{Add any files for committing} $ @kbd{git commit} @ii{Commit the files with a commit message} $ @kbd{git push -u origin feature/python} @ii{Push the branch up to the repo} @end example When you use @samp{push -u origin}, Git helpfully converts your purely local branch into a tracking branch. It becomes as if the branch had originated from the upstream repo and you checked it out locally. @emph{You only need to do @samp{git push -u origin} once.} As you continue to work on your branch, the workflow simplifies into this: @cindex @command{git} command @subentry @code{git diff} @cindex @command{git} command @subentry @code{git add} @cindex @command{git} command @subentry @code{git commit} @cindex @command{git} command @subentry @code{git push} @example $ @kbd{git diff} @ii{Review your changes} $ @kbd{git add @dots{}} @ii{Add any files for committing} $ @kbd{git commit} @ii{Commit the files} $ @kbd{git push} @ii{Push your changes to the branch upstream} @end example @node Developing fixes @section Developing Fixes If you want to make a fix on @code{master} or on the current stable branch, you work the same way, by producing and discussing a diff on the mailing list. Once it's approved, you can commit it yourself: @cindex @command{git} command @subentry @code{git checkout} @cindex @command{git} command @subentry @code{git pull} @cindex @command{git} command @subentry @code{git add} @cindex @command{git} command @subentry @code{git commit} @cindex @command{git} command @subentry @code{git diff} @example $ @kbd{git checkout master} @ii{Move to} master $ @kbd{git pull} @ii{Make sure we're up to date with the maintainer} $ @kbd{gvim @dots{}} @ii{Make any fixes, compile, test} $ @kbd{git diff} @ii{Review your changes} $ @kbd{git add @dots{}} @ii{Add any files for committing} $ @kbd{git commit} @ii{Commit the files with a commit message.} @end example When you're ready to push your changes: @cindex @command{git} command @subentry @code{git pull} @cindex @command{git} command @subentry @code{git push} @example $ @kbd{git pull} @ii{Download latest version; Git will merge} $ @kbd{gvim ...} @ii{Resolve any merge conflicts with} git add @ii{and} git commit $ @kbd{git push} @ii{Now you can push your changes upstream} @end example @xref{Merge Conflicts} for instructions on dealing with merge conflicts. @node General practices @chapter General Development Practices This @value{CHAPTER} discusses general practices for @command{gawk} development. The discussion here is mainly for developers with commit access to the Savannah repo. @table @dfn @cindex propagating fixes to other branches @cindex fixes, propagating to other branches @item Propagating Fixes Usually, bug fixes should be made on the current ``stable'' branch. Once a fix has been reviewed and approved, you can commit it and push it yourself. Typically, the maintainer then takes care to merge the fix to @code{master} and from there to any other branches. However, you are welcome to save him the time and do this yourself. @cindex directory ownership @cindex ownership of directories @item Directory ownership Some developers ``own'' certain parts of the tree, such as the @file{pc} and @file{vms} directories. They are allowed to commit changes to those directories without review by the mailing list, but changes that also touch the mainline code should be submitted for review. @item New feature development Unless you can convince the maintainer (and the other developers!) otherwise, you should @emph{always} start branches for new features from @code{master}, and not from the current ``stable'' branch. Use @samp{checkout -b feature/@var{feature_name}} to create the initial branch. You may then elect to keep it purely local, or to push it up to Savannah for review, even if the feature is not yet totally ``ready for prime time.'' @end table During development of a new feature, you will most likely wish to keep your feature branch up to date with respect to ongoing improvements in @code{master}. This is generally easy to do. There are two different mechanisms, and which one you use depends upon the nature of your new feature branch. @table @dfn @item As long as your branch is purely local You should use @samp{git rebase} to the keep the branch synchronized with the original branch from which it was forked: @cindex @command{git} command @subentry @code{git checkout} @cindex @command{git} command @subentry @code{git pull} @cindex @command{git} command @subentry @code{git rebase} @example $ @kbd{git checkout master} @ii{Move to} master $ @kbd{git pull} @ii{Bring it up to date} $ @kbd{git checkout feature/python} @ii{Move to your new feature branch} $ @kbd{git rebase master} @ii{Rebase from} master @end example @noindent The rebasing operation may require that you resolve conflicts (@pxref{Merge Conflicts}). Edit any conflicted files and resolve the problem(s). Compile and test your changes, then use @samp{git add} and @samp{git commit} to indicate resolution, and then use @samp{git rebase --continue} to continue the rebasing. Git is very good about providing short instructions on how to continue when such conflicts occur. @item Once the branch has been pushed up to Savannah You @emph{must} use @samp{git merge} to bring your feature branch up to date. That flow looks like this: @cindex @command{git} command @subentry @code{git checkout} @cindex @command{git} command @subentry @code{git pull} @cindex @command{git} command @subentry @code{git merge} @example $ @kbd{git checkout master} @ii{Move to} master $ @kbd{git pull} @ii{Bring it up to date} $ @kbd{git checkout feature/python} @ii{Move to your new feature branch} $ @kbd{git merge master} @ii{Merge from} master @end example @noindent Here too, you may have to resolve any merge conflicts (@pxref{Merge Conflicts}). Once that's done, you can push the changes up to Savannah. @end table When the changes on your branch are complete, usually the maintainer merges the branch to @code{master}. But there's really no magic involved, the merge is simply done in the other direction: @cindex @command{git} command @subentry @code{git checkout} @cindex @command{git} command @subentry @code{git pull} @cindex @command{git} command @subentry @code{git merge} @example $ @kbd{git checkout feature/python} @ii{Checkout feature branch} $ @kbd{git pull} @ii{Bring it up to date} $ @kbd{git checkout master} @ii{Checkout} master $ @kbd{git pull} @ii{Bring it up to date} $ @kbd{git merge feature/python} @ii{Merge from} feature/python @ii{into} master @end example If you've been keeping @samp{feature/python} in sync with @code{master}, then there should be no merge conflicts to resolve, and you can push the result to Savannah: @cindex @command{git} command @subentry @code{git push} @example $ @kbd{git push} @ii{Push up to Savannah} @end example Since @samp{feature/python} is no longer needed, it can be gotten rid of: @cindex @command{git} command @subentry @code{git branch} @cindex @command{git} command @subentry @code{git push} @example $ @kbd{git branch} @ii{Still on} master @dots{} * master $ @kbd{git branch -d feature/python} @ii{Delete feature branch} $ @kbd{git push -u origin --delete feature/python} @ii{Delete on Savannah} @end example The @samp{git push} command deletes the @code{feature/python} branch from the Savannah repo. @cindex @command{git} command @subentry @code{git fetch} @noindent Finally, you should send an email to developer's list describing what you've done so that everyone else can delete their copies of the branch and do a @samp{git fetch --prune} (@pxref{Repo Maintenance}). To update the other remaining development branches with the latest changes on @code{master}, use the @samp{helpers/update-branches.sh} script in the repo. @node Repo Maintenance @chapter Keeping Your Repo Organized There are a few commands you should know about to help keep your local repo clean. @table @emph @cindex removing @subentry old branches @cindex old branches, removing @cindex branches @subentry removing @item Removing old branches Developers add branches to the Savannah repo and when development on them is done, they get merged into @code{master}. Then the branches on Savannah are deleted (as shown in @ref{General practices}). However, your local copies of those branches (labelled with the @samp{origin/} prefix) remain in your local repo. If you don't need them, then you can clean up your repo as follows. First, remove any related tracking branch you may have: @cindex @command{git} command @subentry @code{git pull} @cindex @command{git} command @subentry @code{git branch} @example $ @kbd{git pull} @ii{Get up to date} $ @kbd{git branch -d feature/merged-feature} @ii{Remove tracking branch} @end example Then, ask Git to clean things up for you: @cindex @command{git} command @subentry @code{git fetch} @example $ @kbd{git fetch --prune} @ii{Remove unneeded branches} @end example @cindex removing @subentry cruft @cindex cruft, removing @item Removing cruft As Git works, occasional ``cruft'' collects in the repository. Git does occasionally clean this out on its own, but if you're concerned about disk usage, you can do so yourself using @samp{git gc} (short for ``garbage collect''). For example: @cindex @command{git} command @subentry @code{git gc} @example $ @kbd{du -s .} @ii{Check disk usage} @print{} 99188 . @ii{Almost 10 megabytes} $ @kbd{git gc} @ii{Collect garbage} @print{} Counting objects: 32114, done. @print{} Delta compression using up to 4 threads. @print{} Compressing objects: 100% (6370/6370), done. @print{} Writing objects: 100% (32114/32114), done. @print{} Total 32114 (delta 25655), reused 31525 (delta 25231) $ @kbd{du -s .} @ii{Check disk usage again} @print{} 75168 . @ii{Down to 7 megabytes} @end example @cindex renaming branches @cindex branches @subentry renaming @item Renaming branches Occasionally you may want to rename a branch.@footnote{This discussion is adopted from @uref{https://multiplestates.wordpress.com/2015/02/05/rename-a-local-and-remote-branch-in-git, here}.} If your branch is local and you are on it, use: @example $ @kbd{git branch -m feature/@var{new-name}} @end example @noindent Otherwise, use: @example $ @kbd{git branch -m feature/@var{old-name} feature/@var{new-name}} @end example You then need to fix the upstream repo. This command does so, using an older syntax to simultaneously delete the old name and push the new name. You should be on the new branch: @example $ @kbd{git push origin :feature/@var{old-name} feature/@var{new-name}} @end example @quotation NOTE It is the leading @samp{:} in the first branch name that causes Git to delete the old name in the upstream repo. Don't omit it! @end quotation Finally, reset the upstream branch for the local branch with the new name: @example $ @kbd{git push -u origin feature/@var{new-name}} @end example You should also update the mailing list to let the other developers know what's happening. @end table @node Development Stuff @chapter Development Stuff This @value{CHAPTER} discusses other things you need to know and/or do if you're going to participate seriously in @command{gawk} development. @menu * Coding style:: Where to read up on the coding style. * Doing paperwork:: Legal stuff in order to contribute. * Tools:: Tools to have on your system for development. * Debugging:: Compiling for debugging. @end menu @node Coding style @section Coding Style @cindex coding style You should read the discussion about adding code in the @command{gawk} documentation. @ifnothtml @xref{Additions, Additions, Making Additions to @command{gawk}, gawk, GAWK: Effective awk Programming}, for a discussion of the general procedure. In particular, pay attention to the coding style guidelines in @ref{Adding Code, Adding Code, Adding New Features, gawk, GAWK: Effective awk Programming}.@footnote{Changes that don't follow the coding style guidelines won't be accepted. Period.} These two sections may also be found online, at @uref{https://www.gnu.org/software/gawk/manual/html_node/Additions.html#Additions}, and @uref{https://www.gnu.org/software/gawk/manual/html_node/Adding-Code.html#Adding-Code}, respectively. @end ifnothtml @ifhtml See @uref{https://www.gnu.org/software/gawk/manual/html_node/Additions.html#Additions, the section @cite{Making Additions to @command{gawk}}}, in the online documentation for a discussion of the general procedure. In particular, pay attention to the coding style guidelines in @uref{https://www.gnu.org/software/gawk/manual/html_node/Adding-Code.html#Adding-Code, the section @cite{Adding New Features}}, also in the online documentation. @end ifhtml @node Doing paperwork @section Assigning Copyrights to the FSF @cindex assigning copyright @cindex copyright, assignment For any change of more than just a few lines, you will need to assign copyright in (that is, ownership of) those changes to the Free Software Foundation. This is generally an easy thing to do. In particular, you can choose to use a version of the copyright assignment which assigns all your current @emph{and future} changes to @command{gawk} to the FSF. This means that you only need to do the paperwork once, and from then on all your changes will automatically belong to the FSF. The maintainer recommends doing this. The maintainer will help you with this process once you have a contribution that warrants it. @node Tools @section Software Tools You Will Need @cindex software tools This @value{SECTION} discusses additional tools that you may need to install on your system in order to be in sync with what the @command{gawk} maintainer uses. It also discusses different C compiler options for use during code development, and how to compile @command{gawk} for debugging. @menu * GNU Tools:: The GNU Autotools. * Compilers:: A discussion of compilers that can be used. @end menu @node GNU Tools @subsection GNU Tools @cindex GNU software tools @cindex autotools If you expect to work with the configuration files and/or the @file{Makefile} files, you will need to install a number of other GNU tools. In general, you should be using the latest versions of the tools, or least the same ones that the maintainer himself uses. This helps minimize the differences that the maintainer has to resolve when merging changes, and in general avoids confusion and hassle. Similarly, you should install the latest GNU documentation tools as well. The tools are described in the following list: @table @command @cindex @command{autoconf} @cindex GNU @command{autoconf} @cindex @file{configure.ac} file @item autoconf GNU Autoconf processes the @file{configure.ac} files in order to generate the @file{configure} shell script and @file{config.h.in} input file. See @uref{https://www.gnu.org/software/autoconf/autoconf.html, the Autoconf home page} for more information. @cindex @command{automake} @cindex GNU @command{automake} @cindex @file{Makefile.am} file @item automake GNU Automake processes the @file{configure.ac} and @file{Makefile.am} files to produce @file{Makefile.in} files. See @uref{https://www.gnu.org/software/automake, the Automake home page} for more information. @cindex @command{gettext} @cindex GNU @command{gettext} @cindex @file{gawk.pot} file @item gettext GNU Gettext processes the @command{gawk} source code to produce the original @file{po/gawk.pot} message template file. Normally you should not need need to do this; the maintainer usually manages this task. See @uref{https://www.gnu.org/software/gettext, the Gettext home page} for more information. @cindex @command{libtool} @cindex GNU @command{libtool} @cindex extensions, @command{gawk} @item libtool GNU Libtool works with Autoconf and Automake to produce portable shared libraries. It is used for the extensions that ship with @command{gawk}, whose code is in the @file{extensions} directory. See @uref{https://www.gnu.org/software/libtool, the Libtool home page} for more information. @cindex @command{makeinfo} @cindex GNU @command{makeinfo} @cindex @command{Texinfo} @cindex GNU Texinfo @item makeinfo The @command{makeinfo} command is used to build the Info versions of the documentation. You need to have the same version as the maintainer uses, so that when you make a change to the documentation, the corresponding change to the generated Info file will be minimal. @command{makeinfo} is part of GNU Texinfo. See @uref{https://www.gnu.org/software/texinfo, the Texinfo home page} for more information. @end table @node Compilers @subsection Compilers @cindex compilers @cindex GCC, the GNU Compiler Collection The default compiler for @command{gawk} development is GCC, the @uref{https://gcc.gnu.org, GNU Compiler Collection}. The default version of GCC is whatever is on the maintainer's personal GNU/Linux system, although he does try to build the latest released version if that is newer than what's on his system, and then occasionally test @command{gawk} with it. @cindex @command{clang} compiler He also attempts to test occasionally with @uref{https://clang.llvm.org/, @command{clang}}. However, he uses whatever is the default for his GNU/Linux system, and does @emph{not} make an effort to build the current version for testing. Both GCC and @command{clang} are highly optimizing compilers that produce good code, but are very slow. There are two other compilers that are faster, but that may not produce quite as good code. However, they are both reasonable for doing development. @table @emph @cindex @command{tcc} compiler @cindex Tiny C compiler @item The Tiny C Compiler, @command{tcc} This compiler is @emph{very} fast, but it produces only mediocre code. It is capable of compiling @command{gawk}, and it does so well enough that @samp{make check} runs without errors. However, in the past the quality has varied, and the maintainer has had problems with it. He recommends using it for regular development, where fast compiles are important, but rebuilding with GCC before doing any commits, in case @command{tcc} has missed something.@footnote{This bit the maintainer once.} See @uref{http://www.tinycc.org, the project's home page} for some information. More information can be found in the project's @uref{http://repo.or.cz/tinycc.git, Git repository}. The maintainer builds from the @code{mob} branch for his work, but after updating it you should check that this branch still works to compile @command{gawk} before installing it. @cindex @command{pcc} compiler @cindex Portable C compiler @item The (Revived) Portable C Compiler This is an updated version of the venerable Unix Portable C Compiler, PCC. It accepts ANSI C syntax and supports both older and modern architectures. It produces better code than @command{tcc} but is slower, although still much faster than GCC and @command{clang}. See @uref{http://pcc.ludd.ltu.se, the project's home page} for more information. See @uref{http://pcc.ludd.ltu.se/supported-platforms} for instructions about obtaining the code using CVS and building it. @cindex @command{pcc} compiler @subentry Git mirror An alternative location for the source is the @command{gawk} maintainer's @uref{https://github.com/arnoldrobbins/pcc-revived, Git mirror} of the code. If you're using Ubuntu GNU/Linux 18.04 or later, you need to use the @code{ubuntu-18} branch from this Git mirror. @end table @node Debugging @section Compiling For Debugging @cindex debugging, compiling for @cindex compiling for debugging If you wish to compile for debugging, you should use GCC. After running @command{configure} but before running @command{make}, edit the @file{Makefile} and remove the @option{-O2} flag from the definition of @code{CFLAGS}. Optionally, do the same for @file{support/Makefile} and/or @file{extensions/Makefile}. Then run @command{make}. @cindex @file{.developing} file You can enable additional debugging code by creating a file named @file{.developing} in the @command{gawk} source code directory @emph{before} running @command{configure}. Doing so enables additional conditionally-compiled debugging code within @command{gawk}, and adds additional warning and debugging options if compiling with GCC. It also disables optimization. @node Cheat Sheet @appendix Git Command Cheat Sheet This @value{APPENDIX} provides an alphabetical list of the Git commands cited in this @value{DOCUMENT}, along with brief descriptions of what the commands do. @cindex @command{git} command @subentry @code{git help} @cindex @option{--help} option for @command{git} @cindex @command{git} command @subentry @option{--help} option Note that you may always use either @samp{git help @var{command}} or @samp{git @var{command} --help} to get short, man-page style help on how to use any given Git command. @table @code @item git add Add a file to the list of files to be committed. @item git branch View existing branches, or delete a branch. The most useful options are @option{-a} and @option{-d}. @item git checkout Checkout an existing branch, create a new branch, or checkout a file to reset it. Use the @option{-b} option to create and checkout a new branch in one operation. @item git clone Clone (make a new copy of) an existing repository. You generally only need to do this once. @item git commit Commit changes to files which have been staged for committing with @samp{git add}. This makes your changes permanent, @emph{in your local repository only}. To publish your changes to an upstream repo, you must use @samp{git push}. @item git config Display and/or change global and/or local configuration settings. @item git diff Show a unified-format diff of what's changed in the current directory as of the last commit. It helps to have Git configured to use its builtin pager for reviewing diffs (@pxref{Configuring git}). @item git difftool Use a ``tool'' (usually a GUI-based program) to view differences, instead of the standard textual diff as you'd get from @samp{git diff}. @item git fetch Update your local copy of the upstream's branches. That is, update the various @samp{origin/} branches. This leaves your local tracking branches unchanged. With the @option{--prune} option, this removes any copies of stale @samp{origin/} branches. @item git format-patch Create a series of patch files, one per commit not on the original branch from which you started. @item git gc Run a ``garbage collection'' pass in the current repository. This can often reduce the space used in a large repo. For @command{gawk} it does not make that much difference. @item git help Print a man-page--style usage summary for a command. @item git log Show the current branch's commit log. This includes who made the commit, the date, and the commit message. Commits are shown from newest to oldest. @item git merge Merge changes from the named branch into the current one. @item git pull When in your local tracking branch @code{@var{xxx}}, run @samp{git fetch}, and then merge from @code{origin/@var{xxx}} into @code{@var{xxx}}. @item git push Push commits from your local tracking branch @code{@var{xxx}} through @code{origin/@var{xxx}} and on to branch @code{@var{xxx}} in the upstream repo. Use @samp{git push -u origin --delete @var{xxx}} to delete an upstream branch. (Do so carefully!) @item git rebase Rebase the changes in the current purely local branch to look as if they had been made relative to the latest commit in the current upstream branch (typically @code{master}). This is how you keep your local, in-progress changes up-to-date with respect to the original branch from which they were started. @item git reset @cindex @command{git} command @subentry @code{git reset} @subentry @option{--hard} option Restore the original state of the repo, especially with the @option{--hard} option. Read up on this command, and use it carefully. @item git stash @cindex @command{git} command @subentry @code{git stash} Save your current changes in a special place within Git. They can be restored with @samp{git stash pop}, even on a different branch. Use @samp{git stash list} to see the list of stashed changes. @item git status Show the status of files that are scheduled to be committed, and those that have been modified but not yet scheduled for committing. Use @samp{git add} to schedule a file for committing. This command also lists untracked files. @end table @node Resources @appendix Git Resources @cindex @cite{Pro Git} book There are many Git resources available on the Internet. Start at the @uref{http://git-scm.org, Git Project home page}. In particular, the @uref{https://git-scm.com/book/en/v2, @cite{Pro Git} book} is available online. @cindex Savannah @subentry using Git guide See also @uref{http://savannah.gnu.org/maintenance/UsingGit, the Savannah quick introduction to Git}. A nice article on how Git works is @uref{http://jwiegley.github.io/git-from-the-bottom-up/, @cite{Git From The Bottom Up}}, by John Wiegley. @node TODO @appendix Stuff Still To Do In This Document @itemize @bullet @item Fill out all examples with full output @end itemize @ifnotdocbook @node Index @unnumbered Index @end ifnotdocbook @printindex cp @bye