The extract system provides an interface between the revision control system (currently Subversion) and the build system. Where appropriate, it extracts code from the repository and other user-defined locations to a directory tree suitable for feeding into the build system. In this chapter, we shall use many examples to explain how to use the extract system. At the end of this chapter, you will be able to extract code from the local file system as well as from different branches of different repository URLs. You will also learn how to mirror code to a remote machine. Finally, you will be given an introduction on how to specify configurations for the build system via the extract configuration file. (For further information on the build system, please see the next chapter The Build System.) The last section of the chapter tells you what you can do in the case when Subversion is not available.
To invoke the extract system, simply issue the command:
fcm extract
By default, the extract system searches for an extract configuration file "ext.cfg" in "$PWD" and then "$PWD/cfg". If an extract configuration file is not found in these directories, the command fails with an error. If an extract configuration file is found, the system will use the configuration specified in the file to perform the current extraction.
If the destination of the extraction does not exist, the system performs a new full extraction to the destination. If a previous extraction already exists at the destination, the system performs an incremental extraction, updating any modifications if necessary. If a full (fresh) extraction is required for whatever reason, you can invoke the extract system using the "-f" option, (i.e. the command becomes "fcm extract -f"). For further information on the extract command, please see FCM Command Reference > fcm extract.
The extract configuration file is the main user interface of the extract system. It is a line based text file. For a complete set of extract configuration file declarations, please refer to the Annex: Declarations in FCM extract configuration file.
A simple example of a basic extract configuration file is given below:
Extract configuration example 1 - extract from a local path |
---|
cfg::type ext # line 1 cfg::version 1.0 # line 2 # line 3 dest::rootdir $PWD # line 4 # line 5 repos::var::user $HOME/var # line 6 # line 7 expsrc::var::user code # line 8 |
The above demonstrates how to use the extract system to extract code from a local user directory. Here is an explanation of what each line does:
Invoking the extract system using the above configuration file will "extract" all sub-directories under "$HOME/var/code" to "$PWD/src/var/code". Note: the extract system ignores all hidden files, (i.e. directories and files beginning with a "."). It will write a build configuration file to "$PWD/cfg/bld.cfg". The configuration used for this extraction will be written to the configuration file at "$PWD/cfg/ext.cfg".
Note - incremental extraction |
---|
Suppose you have already performed an extraction using the above configuration file. At a later time, you have made some changes to some of the files in the source directory. Re-running the extract system on the same configuration will trigger an incremental extraction. In an incremental extraction, the system will update only those files that are modified. In exact words, the system checks the modification time of each file in the source and destination. If a source file is newer than its corresponding destination file, it checks whether the content differs. The destination is only updated if its content differs from the source. |
The next example demonstrates how to extract from a Subversion repository URL:
Extract configuration example 2 - extract from a Subversion URL |
---|
cfg::type ext # line 1 cfg::version 1.0 # line 2 # line 3 dest::rootdir $PWD # line 4 # line 5 repos::var::trunk svn://server/var/trunk # line 6 version::var::trunk 1234 # line 7 # line 8 expsrc::var::trunk code # line 9 |
Invoking the extract system using the above configuration file will extract all sub-directories under "svn://server/var/trunk/code" to "$PWD/src/var/code". It will write a build configuration file to "$PWD/cfg/bld.cfg". The configuration used for this extraction will be written to the configuration file at "$PWD/cfg/ext.cfg".
Note - declaration of source directories for extraction |
---|
EXPSRC or SRC?
So far, we have only declared source directories using the "EXPSRC" statement, which stands for "expandable source directory". A source directory declared using this statement will trigger the system to search recursively for any sub-directories under the declared one. Any sub-directories containing regular source files will be included in the extraction. Empty directories, hidden directories and directories containing only hidden files are ignored. If you do not want the system to search for sub-directories underneath your declared source directory, you can declare your source directory using the "SRC" statement. The "SRC" statement is essentially the same as "EXPSRC" except that it does not trigger the automatic recursive search for source directories. In fact, the system implements the "EXPSRC" statement by expanding it into a list of "SRC" statements. Package and sub-package The second field of a repository, revision or source directory declaration label is the name of the container package. It is a name selected by the user to identify the system or project he/she is working on. (Therefore, it is often sensible to choose an identifier that matches the name of the project or system.) The package name provides a unique namespace for a file container. Source directories are automatically arranged into sub-packages, using the names of the sub-directories as the names of the sub-packages. For example, the declaration at line 9 in example 2 will put the source directory in the "var::code" sub-package automatically. The double colon "::" and the double underscore "__" (internal only) are delimiters for package names in the extract system. Please avoid using "::" and "__" for naming your files and directories. You can declare a sub-package name explicitly in your source directory statement. For example, the following two lines are equivalent: src::var::trunk code/VarMod_Surface src::var::code::VarMod_Surface::trunk code/VarMod_Surface Explicit sub-package declaration should not be used normally, as it requires a lot more typing (although there are some situations where it can be useful). |
Note - the expanded extract configuration file |
---|
At the end of a successful extraction, the configuration used by the
current extraction is written in "cfg/ext.cfg" under the extract
destination root. This file is an "expanded" version of the original,
with changes in the following declarations:
With this file, it should be possible for later extraction to re-create the current configuration even if the contents of the repository have changed. (This applies only to code stored in the repository.) |
The next example demonstrates how to extract from a repository and mirror the code to a remote machine. It is essentially the same as example 2, except that it has three new lines to describe how the system can mirror the extracted code to a remote machine.
Extract configuration example 3 - mirror code to remote machine |
---|
cfg::type ext cfg::version 1.0 dest::rootdir $PWD rdest::machine tx01 # line 6 rdest::logname frva # line 7 rdest::rootdir /scratch/frva/extract/example3 # line 8 repos::var::trunk svn://server/var/trunk version::var::trunk 1234 expsrc::var::trunk code |
Here is an explanation of what each line does:
Invoking the extract system on the above configuration will trigger an extraction similar to that given in example 2, but it will also attempt to mirror the contents at "$PWD/src/var/code" to "/scratch/frva/extract/example3/src" on the remote machine. It will also mirror the expanded extract configuration file "$PWD/cfg/ext.cfg" to "/scratch/frva/extract/example3/cfg/ext.cfg" and "$PWD/cfg/bld.cfg" to "/scratch/frva/extract/example3/cfg/bld.cfg". It is also worth noting that the content of the build configuration file will be slightly different, since it will include directory names appropriate for the remote system.
Note - mirroring command |
---|
The extract system currently supports "rdist" and "rsync" as its
mirroring tool. The default is "rsync". To use "rdist" instead of
"rsync", add the following line to your "$HOME/.fcm" file:
set::tool::mirror rdist N.B. If you are going to mirror code to another machine, you need to ensure that your account on the remote machine is set up correctly to accept commands from the local machine. In our current settings of both "rdist" and "rsync", all you need to do is set up your "$HOME/.rhosts" file on the remote machine. For example, if you are "fred" working on the local machine "eld001", you will need to have the following entry in your "$HOME/.rhosts" on the remote machine: eld001 fred |
So far, we have only extracted from a single location. The extract system is not much use if that is the only thing it can do. In fact, the extract system supports extraction of multiple source directories from multiple branches in multiple repositories. The following configuration file is an example of how to extract from multiple repositories:
Extract configuration example 4 - extract from multiple repositories |
---|
cfg::type ext cfg::version 1.0 dest::rootdir $PWD repos::var::trunk fcm:var_tr # line 6 repos::ops::trunk fcm:ops_tr # line 7 repos::gen::trunk fcm:gen_tr # line 8 version::gen::trunk 2468 # line 10 expsrc::var::trunk code # line 12 expsrc::var::trunk scripts # line 13 expsrc::ops::trunk code # line 14 src::gen::trunk code/GenMod_Constants # line 15 src::gen::trunk code/GenMod_Control # line 16 src::gen::trunk code/GenMod_FortranIO # line 17 src::gen::trunk code/GenMod_GetEnv # line 18 src::gen::trunk code/GenMod_ModelIO # line 19 src::gen::trunk code/GenMod_ObsInfo # line 20 src::gen::trunk code/GenMod_Platform # line 21 src::gen::trunk code/GenMod_Reporting # line 22 src::gen::trunk code/GenMod_Trace # line 23 src::gen::trunk code/GenMod_UMConstants # line 24 src::gen::trunk code/GenMod_Utilities # line 25 |
Here is an explanation of what each line does:
We shall end up with a directory tree such as:
$PWD | |--- cfg | | | |--- bld.cfg | |--- ext.cfg | |--- src | |--- gen | | | |--- code | | | |--- GenMod_Constants | |--- GenMod_Control | |--- GenMod_FortranIO | |--- GenMod_GetEnv | |--- GenMod_ModelIO | |--- GenMod_ObsInfo | |--- GenMod_Platform | |--- GenMod_Reporting | |--- GenMod_Trace | |--- GenMod_UMConstants | |--- GenMod_Utilities | |--- ops | | | |--- code | | | |--- ... | |--- var | |--- code | | | |--- ... | |--- scripts | |--- ...
Note - revision number |
---|
As seen in the above example, if a revision number is not specified for
a repository URL, it defaults to the "HEAD" revision. The revision
number can also be declared in other ways:
|
We have so far dealt with extraction from a single branch in any package. The extract system can be used to "combine" changes from different branches of a package. An example is given below:
Extract configuration example 5 - extract from multiple branches |
---|
cfg::type ext cfg::version 1.0 dest::rootdir $PWD repos::var::trunk fcm:var_tr repos::ops::trunk fcm:ops_tr repos::gen::trunk fcm:gen_tr version::gen::trunk 2468 expsrc::var::trunk code expsrc::var::trunk scripts expsrc::ops::trunk code src::gen::trunk code/GenMod_Constants src::gen::trunk code/GenMod_Control src::gen::trunk code/GenMod_FortranIO src::gen::trunk code/GenMod_GetEnv src::gen::trunk code/GenMod_ModelIO src::gen::trunk code/GenMod_ObsInfo src::gen::trunk code/GenMod_Platform src::gen::trunk code/GenMod_Reporting src::gen::trunk code/GenMod_Trace src::gen::trunk code/GenMod_UMConstants src::gen::trunk code/GenMod_Utilities repos::var::new_stuff fcm:var_br/frva/r1234_new_stuff # line 27 repos::var::bug_fix fcm:var_br/frva/r1516_bug_fix # line 28 repos::ops::good_stuff fcm:ops_br/opsrc/r3188_good_stuff # line 29 |
The configuration file in example 5 is similar to that of example 4 except for the last three lines. Here is an explanation of what they do:
When we invoke the extract system, it will attempt to extract from the first declared branch of a package, if the last commit revision of the source directory is the same in all the branches. However, if the last commit revision of the source directory differs for different branches, the system will attempt to obtain an extract priority list for each source directory, using the following logic:
For the "var" package in the above example, let us assume that we have three source directory packages X, Y and Z under "code", and their commit revisions under "trunk" are 100. Let's say we have committed some changes to X and Z in the "new_stuff" branch at revision 102, and other changes to Y and Z in the "bug_fix" branch at revision 104, the extract priority lists for X, Y and Z will look like:
Once we have an extract priority list for a source directory, we can begin the extraction of source files in the source directory. The source directory of the base branch is extracted first, followed by that in the subsequent branches. If a source file in a subsequent branch has the same content as the that in the base branch, it is discarded. Otherwise, the following logic determines the branch to use:
override 1
Once the system has established which source files to use, it determines whether the destination is out of date or not. The destination is out of date if the source file does not exist or if its content differs from the version of the source file we are using. The system only updates the destination if it is considered to be out of date.
The extract system can also combine changes from branches in the Subversion repository and the local file system. The limitation is that there can only be one branch from the local file system. (By convention, the branch is named "user".)
It is also worth bearing in mind that the "user" branch always takes precedence over branches residing in Subversion repositories. Hence, source directories from a "user" branch are always placed at the end of the extract priority list.
Extracting from a mixture of Subversion repository and local file system is demonstrated in the next example.
Extract configuration example 6 - extract from multiple branches + user paths |
---|
cfg::type ext cfg::version 1.0 dest::rootdir $PWD repos::var::trunk fcm:var_tr repos::ops::trunk fcm:ops_tr repos::gen::trunk fcm:gen_tr version::gen::trunk 2468 expsrc::var::trunk code expsrc::var::trunk scripts expsrc::ops::trunk code src::gen::trunk code/GenMod_Constants src::gen::trunk code/GenMod_Control src::gen::trunk code/GenMod_FortranIO src::gen::trunk code/GenMod_GetEnv src::gen::trunk code/GenMod_ModelIO src::gen::trunk code/GenMod_ObsInfo src::gen::trunk code/GenMod_Platform src::gen::trunk code/GenMod_Reporting src::gen::trunk code/GenMod_Trace src::gen::trunk code/GenMod_UMConstants src::gen::trunk code/GenMod_Utilities repos::var::new_stuff fcm:var_br/frva/r1234_new_stuff repos::var::bug_fix fcm:var_br/frva/r1516_bug_fix repos::ops::good_stuff fcm:ops_br/opsrc/r3188_good_stuff repos::var::user $HOME/var/src # line 31 repos::gen::user $HOME/gen/src # line 32 |
Example 6 is similar to example 5 except that it is also extracting from local directories. Here is an explanation of the lines:
Note - the INC declaration |
---|
You have probably realised that the above examples have many repeated
lines. To avoid having repeated lines in multiple extract configuration
files, you can use INC declarations to "include" other extract
configuration files. For example, if the configuration file of example 5
is stored in the file "$HOME/var/example5/ext.cfg", line 1 to 29 of
example 6 can be replaced with an INC declaration. Example 6 can then be
written as:
inc $HOME/var/example5/ext.cfg repos::var::user $HOME/var/src repos::gen::user $HOME/gen/src Note: the INC declaration supports the special "environment variable" $HERE. If this variable is already set in the environment, it acts as a normal environment variable. However, if it is not set, it will be expanded into the container directory of the current extract configuration file. This feature is particularly useful if you are including a hierarchy of extract configurations from files in the same container directory in a repository. |
All the examples above dealt with standalone extraction, that is, the current extraction is independent of any other extraction. If a previous extraction exists in another location, the extract system can "USE" this previous extraction in your current extraction. This works like a normal incremental extraction, except that your extraction will only contain the changes you have specified (compared with the USEd extraction) instead of the full source directory tree. This type of incremental extraction is useful in several ways. For instance:
The following example is based on example 4 and 6. The assumption is that an extraction has already been performed at the directory "~frva/var/vn22.0" based on the configuration file in example 4.
Extract configuration example 7 - incremental extract based on a previous extraction |
---|
cfg::type ext cfg::version 1.0 dest::rootdir $PWD use ~frva/var/vn22.0 # line 6 repos::var::new_stuff fcm:var_br/frva/r1234_new_stuff # line 8 repos::var::bug_fix fcm:var_br/frva/r1516_bug_fix # line 9 repos::ops::good_stuff fcm:ops_br/opsrc/r3188_good_stuff # line 10 repos::var::user $HOME/var/src # line 12 repos::gen::user $HOME/gen/src # line 13 |
Running the extract system using the above configuration will trigger an incremental extraction, as if you are running an incremental extraction having modified the configuration file in example 4 to that of example 6. The only difference is that the original extraction using the example 4 configuration will be left untouched at "~frva/var/vn22.0", and the new extraction will contain only the changes in the branches declared from line 8 to 13.
If you are setting up an extraction to be reused, you do not have to perform a build. If you don't you will still gain the benefit of incremental file extraction, but you will be performing a full build of the code.
Configuration settings for feeding into the build system can be declared through the extract configuration file using the "BLD::" prefix. Any line in an extract configuration containing a label with such a prefix will be considered a build system variable. At the end of a successful extraction, the system strips out the "BLD::" prefix before writing these variables to the build configuration file. Some example entries are given between line 17 and 22 in the following configuration file:
Extract configuration example 8 - setting build configuration |
---|
cfg::type ext cfg::version 1.0 dest::rootdir $PWD repos::var::trunk fcm:var_tr repos::ops::trunk fcm:ops_tr repos::gen::trunk fcm:gen_tr version::gen::trunk 2468 expsrc::var::trunk code expsrc::var::trunk scripts expsrc::ops::trunk code src::gen::trunk code/GenMod_Constants src::gen::trunk code/GenMod_Control src::gen::trunk code/GenMod_FortranIO src::gen::trunk code/GenMod_GetEnv src::gen::trunk code/GenMod_ModelIO src::gen::trunk code/GenMod_ObsInfo src::gen::trunk code/GenMod_Platform src::gen::trunk code/GenMod_Reporting src::gen::trunk code/GenMod_Trace src::gen::trunk code/GenMod_UMConstants src::gen::trunk code/GenMod_Utilities bld::target VarProg_AnalysePF.exe # line 27 bld::tool::fc sxmpif90 # line 29 bld::tool::cc sxmpic++ # line 30 bld::tool::ld sxmpif90 # line 31 |
The above example is essentially the same as example 4, apart from the additional build configuration. The following is a simple explanation of what the lines represent: (For detail of the build system, please see the next chapter on The Build System.)
Note - user variable |
---|
When you start using the extract system to define compiler flags for
the build system, you may end up having to make a lot of long and
repetitive declarations. In such case, you may want to define variables to
replace the repetitive parts of the declarations. In the extract system,
you can define a local variable by making a declaration with a label that
begins with a percent sign "%". For example:
# Declare a variable %fred %fred -Cdebug -eC -Wf,-init heap=nan stack=nan bld::tool::fflags %fred # bld::tool::fflags -Cdebug -eC -Wf,-init heap=nan stack=nan bld::tool::fflags::foo %fred -f0 # bld::tool::fflags::foo -Cdebug -eC -Wf,-init heap=nan stack=nan -f0 bld::tool::fflags::bar -w %fred # bld::tool::fflags::bar -w -Cdebug -eC -Wf,-init heap=nan stack=nan |
The amount of diagnostic messages generated by the extract system is normally set to a level suitable for normal everyday operation. This is the default diagnostic verbose level 1. If you want a minimum amount of diagnostic messages, you should set the verbose level to 0. If you want more diagnostic messages, you can set the verbose level to 2 or 3. You can modify the verbose level in two ways. The first way is to set the environment variable FCM_VERBOSE to the desired verbose level. The second way is to invoke the extract system with the "-v <level>" option. (If set, the command line option overrides the environment variable.)
The following is a list of diagnostic output at each verbose level:
Verbose level | Possible output |
---|---|
0 |
|
1 |
|
2 |
|
3 |
|
The extract system can still be used if Subversion is not available. Clearly, you can only use local repositories. However, you can still do incremental extraction, mirror an extraction to another machine, or combine code from multiple local repositories.
If you are using Subversion but your server is down then clearly there is little you can do. However, if you already have an extraction then you can re-run fcm extract as long as the extract configuration file only refers to fixed revisions. If this is not the case then you can always use the expanded extract configuration file which can be found in "cfg/ext.cfg" under the extract destination root. This means that you can continue to makes changes to local code and do incremental extractions even whilst your Subversion server is down.