Chapter 2 The framework architecture

2.2 Why architecture?

The basic "requirement" of the physicists in the collaboration is a set of programs for doing event simulation, reconstruction, visualisation, etc. and a set of tools which facilitate the writing of analysis programs. Additionally a physicist wants something that is easy to use and (though he or she may claim otherwise) is extremely flexible. The purpose of the Gaudi application framework is to provide software which fulfils these requirements, but which additionally addresses a larger set of requirements, including the use of some of the software online.

If the software is to be easy to use it must require a limited amount of learning on the part of the user. In particular, once learned there should be no need to re-learn just because technology has moved on. (You do not need to re-take your licence every time you buy a new car.) Thus one of the principal design goals was to insulate users (physicist developers and physicist analysists) from irrelevant details such as what software libraries we use for data I/O, or for graphics. We have done this by developing an architecture. An architecture consists of the specification of a number of components and their interactions with each other. A component is a "block" of software which has a well specified interface and functionality. An interface is a collection of methods along with a statement of what each method actually does, i.e. its functionality.

We may summarise the main benefits we gain from this approach:

Flexibility

This approach gives flexibility because components may be plugged together in different ways to perform different tasks.

Simplicity: Software for using, for example, an object data base is in general fairly complex and time consuming to learn. Most of the detail is of little interest to someone who just wants to read data or store results. A "data access" component would have an interface which provided to the user only the required functionality. Additionally the interface would be the same independently of the underlying storage technology.
Robustness: As stated above a component can hide the underlying technology. As well as offering simplicity, this has the additional advantage that the underlying technology may be changed without the user even needing to know.

It is intended that almost all software written by physicists in the collaboration whether for event generation, reconstruction or analysis will be in the form of specialisations of a few specific components. Here, specialisation means taking a standard component and adding to its functionality while keeping the interface the same. Within the application framework this is done by deriving new classes from one of the base classes:

DataObject
Algorithm
Converter

In the rest of this chapter we will briefly consider the first two of these components and in particular the subject of the "separation" of data and algorithm. They will be covered in more depth in chapters 5 and 6. The third base class, Converter, exists more for technical necessity than anything else and will be discussed in chapter 13. Following this we give a brief outline of the main components that a physicist developer will come into contact with.

2.3 Data versus code

Broadly speaking, tasks such as physics analysis and event reconstruction consist of the manipulation of mathematical or physical quantities: points, vectors, matrices, hits, momenta, etc., by algorithms which are generally specified in terms of equations and natural language. The mapping of this type of task into a programming language such as FORTRAN is very natural, since there is a very clear distinction between "data" and "code". Data consists of variables such as:

      integer n      real p(3)

and code which may consist of a simple statement or a set of statements collected together into a function or procedure:

      real function innerProduct(p1, p2)      real p1(3),p2(3)      innerProduct = p1(1)*p2(1) + p1(2)*p2(2) + p1(3)*p2(3)      end

Thus the physical and mathematical quantities map to data and the algorithms map to a collection of functions.

A priori, we see no reason why moving to a language which supports the idea of objects, such as C++, should change the way we think of doing physics analysis. Thus the idea of having essentially mathematical objects such as vectors, points etc. and these being distinct from the more complex beasts which manipulate them, e.g. fitting algorithms etc. is still valid. This is the reason why the Gaudi application framework makes a clear distinction between "data" objects and "algorithm" objects.

Anything which has as its origin a concept such as hit, point, vector, trajectory, i.e. a clear "quantity-like" entity should be implemented by deriving a class from the DataObject base class.

On the other hand anything which is essentially a "procedure", i.e. a set of rules for performing transformations on more data-like objects, or for creating new data-like objects should be designed as a class derived from the Algorithm base class.

Further more you should not have objects derived from DataObject performing long complex algorithmic procedures. The intention is that these objects are "small".

Tracks which fit themselves are of course possible: you could have a constructor which took a list of hits as a parameter; but they are silly. Every track object would now have to contain all of the parameters used to perform the track fit, making it far from a simple object. Track-fitting is an algorithmic procedure; a track is probably best represented by a point and a vector, or perhaps a set of points and vectors. They are different.

2.4 Main components

The principle functionality of an algorithm is to take input data, manipulate it and produce new output data. Figure 1 shows how a concrete algorithm object interacts with the rest of the application framework to achieve this.

Figure 1 The main components of the framework as seen by an algorithm object.

The figure shows the four main services that algorithm objects use:

The event data store
The detector data store
The histogram service
The message service

The particle property service is an example of additional services that are available to an algorithm. The job options service (see Chapter 11) is used by the Algorithm base class, but is not usually explicitly seen by a concrete algorithm.

Each of these services is provided by a component and the use of these components is via an interface. The interface used by algorithm objects is shown in the figure, e.g. for both the event data and detector data stores it is the IDataProviderSvc interface. In general a component implements more than one interface. For example the event data store implements another interface: IDataManager which is used by the application manager to clear the store before a new event is read in.

An algorithm's access to data, whether the data is coming from or going to a persistent store or whether it is coming from or going to another algorithm is always via one of the data store components. The IDataProviderSvc interface allows algorithms to access data in the store and to add new data to the store. It is discussed further in chapter 6 where we consider the data store components in more detail.

The histogram service is another type of data store intended for the storage of histograms and other "statistical" objects, i.e. data objects with a lifetime of longer than a single event. Access is via the IHistogramSvc which is an extension to the IDataProviderSvc interface, and is discussed in chapter 9. The n-tuple service is similar, with access via the INtupleSvc extension to the IDataProviderSvc interface, as discussed in Chapter 10.

In general an algorithm will be configurable: It will require certain parameters, such as cut-offs, upper limits on the number of iterations, convergence criteria, etc., to be initialised before the algorithm may be executed. These parameters may be specified at run time via the job options mechanism. This is done by the job options service. Though it is not explicitly shown in the figure this component makes use of the IProperty interface which is implemented by the Algorithm base class.

During its execution an algorithm may wish to make reports on its progress or on errors that occur. All communication with the outside world should go through the message service component via the IMessageSvc interface. Use of this interface is discussed in Chapter 11.

As mentioned above, by virtue of its derivation from the Algorithm base class, any concrete algorithm class implements the IAlgorithm and IProperty interfaces, except for the three methods initialize(), execute(), and finalize() which must be explicitly implemented by the concrete algorithm. IAlgorithm is used by the application manager to control top-level algorithms. IProperty is usually used only by the job options service.

The figure also shows that a concrete algorithm may make use of additional objects internally to aid it in its function. These private objects do not need to inherit from any particular base class so long as they are only used internally. These objects are under the complete control of the algorithm object itself and so care is required to avoid memory leaks etc.

We have used the terms "interface" and "implements" quite freely above. Let us be more explicit about what we mean. We use the term interface to describe a pure virtual C++ class, i.e. a class with no data members, and no implementation of the methods that it declares. For example:

class PureAbstractClass {  virtual method1() = 0;  virtual method2() = 0;}

is a pure abstract class or interface. We say that a class implements such an interface if it is derived from it, for example:

class ConcreteComponent: public PureAbstractClass {  method1() { }  method2() { }}

A component which implements more than one interface does so via multiple inheritance, however, since the interfaces are pure abstract classes the usual problems associated with multiple inheritance do not occur. These interfaces are identified by a unique number which is available via a global constant of the form: IID_InterfaceType, such as for example IID_IDataProviderSvc. Using these it is possible to enquire what interfaces a particular component implements (as shown for example through the use queryInterface() in the finalize() method of the SimpleAnalysis example).

Within the framework every component, e.g. services and algorithms, has two qualities:

A concrete component class, e.g. TrackFinderAlgorithm or MessageSvc .
Its name, e.g. "KalmanFitAlgorithm " or "stdMessageService ".

2.5 Package structure

For large software systems, such as ours, it is clearly important to decompose the system into hierarchies of smaller and more manageable entities. This decomposition can have important consequences for implementation related issues, such as compile-time, link dependencies, configuration management, etc. For that we need to introduce the concept of package as the grouping of related components together into a cohesive physical entity. A package is also the minimal unit of software release.

We have decomposed the LHCb data processing software into the packages shown in Figure 2. At the lower level we find Gaudi which is the framework itself and does not depend on any external packages. In the second level there are the packages containing standard framework components (GaudiSvc , GaudiTools , GaudiAlg , Auditors ) as well as the detector description tools (DetDesc ) and the LHCb specific event data model (LHCbEvent ). These packages depend on the framework and on widely available foundation libraries such as CLHEP and HTL. In the same level we have specific implementations of the histogram persistency service based on HBOOK (HbookCnv ) and ROOT (RootHistCnv ). At the next level we will have packages consisting of implementations of event and detector data persistency services and converters. In this release we have an LHCb specific one based on SICB and ZEBRA (SicbCnv ) and a generic one (DbCnv ) which currently understands ROOT and ODBC compliant databases and is being tested with Objectivity/DB. Finally, at the top level we find the applications.

Figure 2 Current package structure of the LHCb software

2.5.1 Package Layout

Figure 3 shows the recommended layout for Gaudi packages (and for other LHCb software packages, such as sub-detector software). Note that the binaries directories are not in CVS, they are created by CMT when building a package.

Figure 3 Recommended layout of LHCb software packages

2.5.2 Packaging Guidelines for sub-detectors

Packaging is an important architectural issue for the Gaudi framework, but also for the sub-detector software packages based on Gaudi. Typically, sub-detector packages consist of:

Specific event model
Specific detector description
Sets of algorithms (digitisation, reconstruction, etc.)

The packaging should be such as to minimise the dependencies between packages, and must absolutely avoid cyclic dependencies. The granularity should not be too small or too big. Care should be taken to identify the external interfaces of packages: if the same interfaces are shared by many packages, they should be promoted to a more basic package that the others would then depend on. Finally, it is a good idea to discuss your packaging with the librarian and/or architect.

Note that sub-detectors are expected to organise their own project releases, for internal group consumption and integration tests, using the $LHCBDEV development area (described in Section 3.7.1). Once the code is ready for public, collaboration wide use, it should be released (by the librarian) in the public release area ($LHCBSOFT ).

2.5.3 Libraries in Gaudi

In the Gaudi framework development environment we can distinguish two different types of libraries: component libraries and linker libraries.These libraries are used for different purposes and built in different ways. Here we mention their characteristics.

2.5.3.1 Component libraries

These are libraries containing one or many software components implementing a number of abstract interfaces. For example this is the case for Gaudi libraries containing Algorithms, Services, Tools, etc. These libraries do not export symbols (they export only one which is used internally by the framework to discover what the library contains). These libraries are not needed for linking the application, they are used eventually if we explicitly load them at runtime. This is done by the property ApplicationMgr.DLLs , which is the list of component libraries that the framework should load dynamically. Obviously, changes in these libraries implementations never require an application to be re-linked.

When the trying to load a component library (e.g. Comp ) the framework will look for a library in various places following this sequence:

Try to find an environment variable with the name of the library suffixedby "Shr " (e.g. ${CompShr} ). If the environment variable exists then it should translate to the absolute location of the library without the file type suffix that is platform dependent (e.g. ${LibCompShr} = "$LHCBSOFT/packA/v1/i386_linux22/libComp" )
Try to locate the file libComp.so (for Linux) or libCompShr.dll (for NT) using the LD_LIBRARY_PATH for Linux and PATH for NT.

2.5.3.2 Linker libraries

These are libraries containing implementation classes. For example, libraries containing code of a number of base classes or specific classes without abstract interfaces, etc. These libraries, contrary to the component libraries, export all the symbols and are needed during the linking phase in the application building. These libraries can be linked to the application "statically" or "dynamically", requiring a different file format. In the first case the code is added physically to the executable file. In this case, changes in these libraries require the application to be re-linked, even if these changes do not affect the interfaces. In the second case, the linker only adds into the executable minimal information required for loading the library and resolving the symbols at run time. Locating and loading the proper shareable library at run time is done exclusively using the LD_LIBRARY_PATH for Linux and PATH for NT.

2.5.3.3 Building the libraries

Using CMT, we have simplified to way these two different types of libraries are built. The proper linker flags are predefined with the macros $(componentshr_linkopts) and $(libraryshr_linkopts) for component and linker libraries respectively. The macro "xxxxx_shlibflags " needs to be built as shown in the private part of the requirements file below.

Listing 1 Defining CMT macros for building component and linker libraries
1: `package Package` 2: `version v999` 3: `use ...` 4: 5: `library CompLib Components/.cpp` 6: `library LinkLib Library/.cpp` 7: 8: `include_path ${PACKAGEROOT}` 9: 10: `macro PackageDir "$(PACKAGEROOT)/$(BINDIR)"` 11: `macro Package_linkopts "$(PackageDir)/libLinkLib.so" \` 12: `VisualC "$(PackageDir)/libLinkLib.lib"` 13: `macro Package_stamps "$(PackageDir)/LinkLib.stamp"` 14: 15: `private` 16: 17: `macro CompLib_shlibflags "$(componentshr_linkopts) $(use_linkopts)"` 18: `macro LinkLib_shlibflags "$(libraryshr_linkopts) $(use_linkopts)" \` 19: `VisualC "$(libraryshr_linkopts) $(use_linkopts)` 20: `/def:$(PackageDir)/libLinkLib.def"`

2.5.3.4 Linking FORTRAN code

Any library containing FORTRAN code (more specifically, code that references COMMON blocks) must be linked statically. This is because COMMON blocks are, by definition, static entities. When mixing C++ code with FORTRAN, it is recommended to build separate libraries for the C++ and FORTRAN, and to write the code in such a way that communication between the C++ and FORTRAN world is done exclusively via wrappers, as is done for example in the SicbCnv package (see in particular the file SicbCnv/TopLevel/SicbFortran.cpp . In this way it is possible to build shareable libraries for the C++ code, even if it calls FORTRAN code internally.