1 | <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> |
---|
2 | |
---|
3 | <html> |
---|
4 | <head> |
---|
5 | <title>FCM Detailed Design: Extract System</title> |
---|
6 | <meta name="author" content="FCM development team"> |
---|
7 | <meta name="descriptions" content="FCM Detailed Design: Extract System"> |
---|
8 | <meta name="keywords" content="FCM, design"> |
---|
9 | <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> |
---|
10 | <link rel="stylesheet" type="text/css" href="style.css"> |
---|
11 | </head> |
---|
12 | |
---|
13 | <body> |
---|
14 | <address> |
---|
15 | <a href="index.html">FCM Detailed Design</a> > Extract System |
---|
16 | </address> |
---|
17 | |
---|
18 | <h1>Extract System</h1> |
---|
19 | |
---|
20 | <p>In this chapter, we shall discuss in detail the design of the extract |
---|
21 | system. For information of how to use the extract system, please see: <a |
---|
22 | href="../user_guide/extract.html">FCM System User Guide > The Extract |
---|
23 | System</a>.</p> |
---|
24 | |
---|
25 | <p>The extract system extracts source directories from different branches of |
---|
26 | Subversion repositories, combining them with source directories from the |
---|
27 | local file system to give a source directory tree suitable for feeding into |
---|
28 | the build system. The system is written in a set of Perl modules. The extract |
---|
29 | system uses a similar interface to the build system. It shares the same |
---|
30 | command line interface and many other utilities with the code management |
---|
31 | system and the build system.</p> |
---|
32 | |
---|
33 | <h2><a name="io">Input and Output</a></h2> |
---|
34 | |
---|
35 | <p>The extract system should provide the following outputs:</p> |
---|
36 | |
---|
37 | <ul class="pad"> |
---|
38 | <li>a directory tree with the extracted source code suitable for feeding |
---|
39 | into the build system.</li> |
---|
40 | |
---|
41 | <li>a configuration file for the build system.</li> |
---|
42 | |
---|
43 | <li>an expanded version of the current configuration file, so that the |
---|
44 | current extraction can be re-used.</li> |
---|
45 | </ul> |
---|
46 | |
---|
47 | <p>The following inputs are required by the extract system:</p> |
---|
48 | |
---|
49 | <ul class="pad"> |
---|
50 | <li>the location of the destination.</li> |
---|
51 | |
---|
52 | <li>the location of the source, e.g a repository URL or a local file |
---|
53 | system path.</li> |
---|
54 | |
---|
55 | <li>the revision of the source, if applicable.</li> |
---|
56 | |
---|
57 | <li>extra configuration settings to be exported to the build system.</li> |
---|
58 | |
---|
59 | <li>the location of previous extractions, if the current extract is to be |
---|
60 | based on them.</li> |
---|
61 | </ul> |
---|
62 | |
---|
63 | <h2><a name="component">Components</a></h2> |
---|
64 | |
---|
65 | <p>The extract system uses the following commands, modules and tools:</p> |
---|
66 | |
---|
67 | <table class="pad" summary="build system components" border="1"> |
---|
68 | <tr> |
---|
69 | <th>Name</th> |
---|
70 | |
---|
71 | <th>Category</th> |
---|
72 | |
---|
73 | <th>Description</th> |
---|
74 | </tr> |
---|
75 | |
---|
76 | <tr> |
---|
77 | <th>fcm</th> |
---|
78 | |
---|
79 | <td>Perl executable</td> |
---|
80 | |
---|
81 | <td>Top level command line interface of the FCM system.</td> |
---|
82 | </tr> |
---|
83 | |
---|
84 | <tr> |
---|
85 | <th>Fcm::CfgFile</th> |
---|
86 | |
---|
87 | <td>Perl module</td> |
---|
88 | |
---|
89 | <td>A class for reading from and writing to configuration files.</td> |
---|
90 | </tr> |
---|
91 | |
---|
92 | <tr> |
---|
93 | <th>Fcm::Config</th> |
---|
94 | |
---|
95 | <td>Perl module</td> |
---|
96 | |
---|
97 | <td>A class that contains the configuration settings shared by all |
---|
98 | FCM components.</td> |
---|
99 | </tr> |
---|
100 | |
---|
101 | <tr> |
---|
102 | <th>Fcm::Extract</th> |
---|
103 | |
---|
104 | <td>Perl module</td> |
---|
105 | |
---|
106 | <td>Main class that controls the running of the extract system.</td> |
---|
107 | </tr> |
---|
108 | |
---|
109 | <tr> |
---|
110 | <th>Fcm::ReposBranch</th> |
---|
111 | |
---|
112 | <td>Perl module</td> |
---|
113 | |
---|
114 | <td>A class that stores and processes information of a repository |
---|
115 | branch.</td> |
---|
116 | </tr> |
---|
117 | |
---|
118 | <tr> |
---|
119 | <th>Fcm::SrcDirLayer</th> |
---|
120 | |
---|
121 | <td>Perl module</td> |
---|
122 | |
---|
123 | <td>A class that stores and processes information of a "layer" in the |
---|
124 | extraction sequence of a source directory.</td> |
---|
125 | </tr> |
---|
126 | |
---|
127 | <tr> |
---|
128 | <th>Fcm::Util</th> |
---|
129 | |
---|
130 | <td>Perl module</td> |
---|
131 | |
---|
132 | <td>A collection of utilities shared by all FCM components.</td> |
---|
133 | </tr> |
---|
134 | |
---|
135 | <tr> |
---|
136 | <th>svn</th> |
---|
137 | |
---|
138 | <td>Subversion client</td> |
---|
139 | |
---|
140 | <td>The following sub-commands are used: "info", "list", "export" and |
---|
141 | "cat".</td> |
---|
142 | </tr> |
---|
143 | |
---|
144 | <tr> |
---|
145 | <th>ksh</th> |
---|
146 | |
---|
147 | <td>Unix shell</td> |
---|
148 | |
---|
149 | <td>The following shell commands are used: "cp", "rm" and "mkdir".</td> |
---|
150 | </tr> |
---|
151 | |
---|
152 | <tr> |
---|
153 | <th>rdist</th> |
---|
154 | |
---|
155 | <td>Unix utility</td> |
---|
156 | |
---|
157 | <td>A remote distribution tool for mirror the extracted source |
---|
158 | directory to a remote host.</td> |
---|
159 | </tr> |
---|
160 | |
---|
161 | <tr> |
---|
162 | <th>rsync</th> |
---|
163 | |
---|
164 | <td>Unix utility</td> |
---|
165 | |
---|
166 | <td>A remote synchronisation tool for mirror the extracted source |
---|
167 | directory to a remote host.</td> |
---|
168 | </tr> |
---|
169 | |
---|
170 | <tr> |
---|
171 | <th>remsh</th> |
---|
172 | |
---|
173 | <td>Unix command</td> |
---|
174 | |
---|
175 | <td>A command to invoke a shell on a remote host.</td> |
---|
176 | </tr> |
---|
177 | </table> |
---|
178 | |
---|
179 | <h2><a name="task">Task</a></h2> |
---|
180 | |
---|
181 | <p>To do its job, the extract system executes the following tasks in |
---|
182 | order:</p> |
---|
183 | |
---|
184 | <ul class="pad"> |
---|
185 | <li>parse/decipher the extract configuration file.</li> |
---|
186 | |
---|
187 | <li>establish the extraction sequence when dealing with extraction of |
---|
188 | multiple branches of the same package.</li> |
---|
189 | |
---|
190 | <li>extract or copy individual files from the source to the |
---|
191 | destination.</li> |
---|
192 | |
---|
193 | <li>generate of an expanded extraction configuration file to allow another |
---|
194 | extraction to be based on the current one.</li> |
---|
195 | |
---|
196 | <li>generate of a build configuration file.</li> |
---|
197 | |
---|
198 | <li>mirror extracted source code and configuration files to a remote build |
---|
199 | machine.</li> |
---|
200 | </ul> |
---|
201 | |
---|
202 | <h3><a name="task_cfg">The extract configuration</a></h3> |
---|
203 | |
---|
204 | <p>When we invoke the FCM command, it creates a new instance of Fcm::Config, |
---|
205 | which reads, processes and stores information from the central and user |
---|
206 | configuration file. Configuration settings in Fcm::Config are then |
---|
207 | accessible by all other modules used by the extract system.</p> |
---|
208 | |
---|
209 | <p>When we invoke the extract command, it creates a new instance of |
---|
210 | Fcm::Extract, which automatically creates a new instance of the |
---|
211 | Fcm::CfgFile. If an argument is specified in the command line, the argument |
---|
212 | is used as the "basis". Otherwise, the current working directory is taken as |
---|
213 | the basis. If the basis is a directory, Fcm::CfgFile will attempt to locate |
---|
214 | a file called "ext.cfg" under this directory. If such a file is not found, |
---|
215 | it will attempt to locate it under "cfg/ext.cfg". If the basis is a regular |
---|
216 | file, the file itself is used.</p> |
---|
217 | |
---|
218 | <p>Once a file is located, Fcm::CfgFile will attempt to parse it. This is |
---|
219 | done by reading and processing each line of the configuration file into |
---|
220 | separate label, value and comment fields. If an INC declaration is |
---|
221 | encountered, a new instance of Fcm::CfgFile is created to read the included |
---|
222 | file as specified. The included lines are then added to the current array. |
---|
223 | Each line is then pushed into an array that can be fetched as a property of |
---|
224 | Fcm::CfgFile. Internally, each line is recorded as a reference to a hash |
---|
225 | table with the following keys:</p> |
---|
226 | |
---|
227 | <ul class="pad"> |
---|
228 | <li>LABEL: the label of a declaration.</li> |
---|
229 | |
---|
230 | <li>VALUE: the value of a declaration.</li> |
---|
231 | |
---|
232 | <li>COMMENT: the comment following a declaration or the comment in a |
---|
233 | comment line.</li> |
---|
234 | |
---|
235 | <li>NUMBER: the line number of the current line in the source file.</li> |
---|
236 | |
---|
237 | <li>SRC: the name of the source file.</li> |
---|
238 | </ul> |
---|
239 | |
---|
240 | <p>The information given by each line is "deciphered" by Fcm::Extract. The |
---|
241 | information is processed in the following ways:</p> |
---|
242 | |
---|
243 | <ul class="pad"> |
---|
244 | <li>The configuration file type and version declarations are stored as |
---|
245 | properties of the Fcm::CfgFile instance. Fcm::Extract uses the |
---|
246 | information to ensure that it is reading an extract configuration |
---|
247 | file.</li> |
---|
248 | |
---|
249 | <li>The destination directory declarations are stored in a hash table, |
---|
250 | which is a property of the Fcm::Extract instance.</li> |
---|
251 | |
---|
252 | <li>The remote destionation machine, logname and directory declarations |
---|
253 | are stored in a hash table, which is another property of the Fcm::Extract |
---|
254 | instance.</li> |
---|
255 | |
---|
256 | <li>Build configuration declarations are stored in a hash table, which is |
---|
257 | yet another property of the Fcm::Extract instance.</li> |
---|
258 | |
---|
259 | <li>For each declaration of a repository branch, a new instance of |
---|
260 | Fcm::ReposBranch is created, if it does not already exist. (A |
---|
261 | Fcm::ReposBranch instance is identified by a "tag" property, which is the |
---|
262 | combination of its package name and its branch name.) The REPOS, VERSION, |
---|
263 | SRC and EXPSRC declarations set the "repos", "version", "dir" and "expdir" |
---|
264 | properties of the Fcm::ReposBranch instance.</li> |
---|
265 | |
---|
266 | <li>The override mode flag is stored as a property of the Fcm::Extract |
---|
267 | instance.</li> |
---|
268 | |
---|
269 | <li>For each declaration to USE a previous extract, a new instance of |
---|
270 | Fcm::Extract is created, with its "extracted" property set to true. The |
---|
271 | instance Fcm::Extract for the previous extraction creates a new instance of |
---|
272 | Fcm::CfgFile for its configuration file. The configuration of the previous |
---|
273 | extract is read and processed similarly to what was described above. The |
---|
274 | current instance of Fcm::Extract will then attempt to inherit the settings |
---|
275 | of the previous extraction where appropriate. The instances of the |
---|
276 | previous extractions are stored in an array, which can be fetched as a |
---|
277 | property of the current Fcm::Extract instance.</li> |
---|
278 | |
---|
279 | <li>The INC declaration is ignored, as it is already processed by |
---|
280 | Fcm::CfgFile.</li> |
---|
281 | |
---|
282 | <li>The MIRROR declaration changes the setting (TOOL, MIRROR) in the |
---|
283 | Fcm::Config instance.</li> |
---|
284 | </ul> |
---|
285 | |
---|
286 | <p>If a full extraction is required, Fcm::Extract will attempt to remove any |
---|
287 | sub-directories created by previous extractions in the same location. |
---|
288 | Destination directories are (re-)created as they are required.</p> |
---|
289 | |
---|
290 | <p>For each repository branch, if the REPOS declaration is a file system |
---|
291 | path, the VERSION declaration will be set automatically to the word "USER". |
---|
292 | If the REPOS declaration matches a FCM URL keyword pattern, it is expanded |
---|
293 | to the full URL. If REPOS is not in the local file system and the VERSION |
---|
294 | declaration is not a number, the system will attempt to convert the keyword |
---|
295 | back to a number. If the keyword is "HEAD", the system will use <tt>"svn |
---|
296 | info"</tt> to determine the revision number. Otherwise, it will attempt to |
---|
297 | match the keyword with a pre-defined FCM revision keyword. If there are any |
---|
298 | expanded source directory (EXPSRC) declarations, the system will use |
---|
299 | <tt>"svn ls -R"</tt> to search recursively for all normal source directories |
---|
300 | containing regular files. These directories are then added to the "dir" |
---|
301 | property of the Fcm::ReposBranch instance.</p> |
---|
302 | |
---|
303 | <h3><a name="task_seq">The extraction sequence</a></h3> |
---|
304 | |
---|
305 | <p>In the next step, the extract system converts the information given in |
---|
306 | the list of repository branches into a list of source directory sub-package. |
---|
307 | Each source directory sub-package will have a destination and a "stack" of |
---|
308 | extraction sequence. The sequence is basically a list for locating the |
---|
309 | source directories in the repository branches. The order of the sequence is |
---|
310 | based on the order in which a repository branch is declared. The logic has |
---|
311 | already been discussed in the user guide.</p> |
---|
312 | |
---|
313 | <p>The sequence is implemented by a list of Fcm::SrcDirLayer instances. For |
---|
314 | each Fcm::SrcDirLayer instance in an extraction sequence of a source |
---|
315 | directory, the system will attempt to find out its "last commit" revision, |
---|
316 | using the <tt>"svn info"</tt> command on the particular revision on the |
---|
317 | given URL. This information is normally cached in a file called ".config" in |
---|
318 | the cache sub-directory of the extraction destination root. For an |
---|
319 | incremental extraction, the system will consult the cache to obtain the list |
---|
320 | of "last commit" revisions for the source directories, instead of having to |
---|
321 | go through a large number of <tt>"svn info"</tt> commands again. The cache |
---|
322 | file is read/written using a temporary instance of Fcm::CfgFile. The label |
---|
323 | in each line consists of the package name of the sub-package, its URL and a |
---|
324 | revision number. The corresponding value is the "last commit" revision at |
---|
325 | the given revision number.</p> |
---|
326 | |
---|
327 | <h3><a name="task_ext">The extraction</a></h3> |
---|
328 | |
---|
329 | <p>With the extraction sequence in place for each source directory, the |
---|
330 | extraction itself can now take place. There are two steps in this |
---|
331 | process.</p> |
---|
332 | |
---|
333 | <p>For each "layer" in the extraction sequence of each source directory, if |
---|
334 | the "layer" contains a repository URL, the system extracts from that URL the |
---|
335 | source directory and place the resulting source files in a cache. From the |
---|
336 | cache sub-directory of the destination root, the cache for each source |
---|
337 | directory is placed under a relative path that reflects the sub-package name |
---|
338 | of the source directory. Underneath this path is a list of directories |
---|
339 | with names reflecting the name of the branch and the "last commit" |
---|
340 | revision, (separated by double underscore "__"). These are where the cache of |
---|
341 | the source files for the "layers" of the source directory are placed.</p> |
---|
342 | |
---|
343 | <p>It is also worth noting that source files from the local file system are |
---|
344 | not cached. They will be taken directly from their locations.</p> |
---|
345 | |
---|
346 | <p>Once we have the cached "layers" (or branches) of the source directories, |
---|
347 | the system will select the source files from the correct cache before |
---|
348 | updating the destinations. The logic of which has already been discussed in |
---|
349 | the user guide.</p> |
---|
350 | |
---|
351 | <p>At the end of this stage, we should have a directory tree in the |
---|
352 | destination source sub-directory, with the relative paths to the extracted |
---|
353 | files reflecting the sub-package names of those files.</p> |
---|
354 | |
---|
355 | <h3><a name="task_gen">The extract/build configuration generator</a></h3> |
---|
356 | |
---|
357 | <p>If extraction completes without any error, the system will attempt to |
---|
358 | write an expanded extract configuration file, where all revision keywords |
---|
359 | are expanded into numbers, and all source directory packages are declared. |
---|
360 | Subsequent dependent extractions will be able to re-use this configuration |
---|
361 | without having to invoke the Subversion client for repository and revision |
---|
362 | information.</p> |
---|
363 | |
---|
364 | <p>The system will also attempt to produce a build configuration file for |
---|
365 | feeding to the build system. The following "conversions" are performed:</p> |
---|
366 | |
---|
367 | <ul> |
---|
368 | <li>The destination root becomes the build root.</li> |
---|
369 | |
---|
370 | <li>The destination directories are all are declared as the source |
---|
371 | directories.</li> |
---|
372 | |
---|
373 | <li>All BLD declarations are unchanged, except that the BLD prefixes are |
---|
374 | removed from the label.</li> |
---|
375 | |
---|
376 | <li>All USE extract configuration statements are converted to USE build |
---|
377 | configuration statements.</li> |
---|
378 | </ul> |
---|
379 | |
---|
380 | <h3><a name="task_mir">The mirror interface</a></h3> |
---|
381 | |
---|
382 | <p>The system uses "rdist" or "rsync" to mirror the extracted source code |
---|
383 | and the generated configuration files to a remote machine.</p> |
---|
384 | |
---|
385 | <script type="text/javascript" src="maintain.js"> |
---|
386 | </script> |
---|
387 | </body> |
---|
388 | </html> |
---|