doc/html/requirements.html

0001 <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML//EN">
0002 <BODY bgcolor="FFFFFF">
0003 <title>
0004           CMS MessageLogger: Open Issues Concerning Features Wanted by CMS
0005 </title>
0006
0007 <center>
0008 <h1> <img src="header-public.gif" align="center"> </h1>
0009
0010 <font color=red>
0011 <h1>CMS MessageLogger Service
0012 <br>
0013 Open Issues Concerning Features Wanted by CMS</h1>
0014 </font>
0015 </center>
0016 <ul>
0017 <li> <a href=#multipleID> Multiple Message Categories </a>
0018 <li> <a href=#log4cplus> Destination Reporting to log4cplus </a>
0019     <ul> <font color=blue> Implemented </font> </ul>
0020 <li> <a href=#probe> Probing for whether a message will be reported </a>
0021     <ul> <font color=blue> Interface Implemented </font> </ul>
0022 <li> <a href=#statistics> Statistics Destination </a>
0023 <li> <a href=#filtering> Other Filtering Options </a>
0024 <li> <a href=#context> Context (e.g., event number) in Messages </a>
0025 <li> <a href=#endl> Support For Use Of endl in Messages </a>
0026 <li> <a href=#messageobj> Multi-statement Building of Message Objects </a>
0027 <li> <a href=#control> Post-configuration Control of Logging Behavior </a>
0028 </ul>
0029
0030 <hr>
0031 <a name=#multipleID></a>
0032 <h2>Multiple Message Categories </h2>
0033
0034 <h3> The requirement </h3>
0035 A message can be issued  with multiple categoriess, as in
0036 <pre>
0037   edm::LogWarning(tracking&amp;overflow) << "some more text";
0038 </pre>
0039
0040 This is to be treated as both a category=tracking message,
0041 and an category=overflow
0042 message.
0043
0044 <h3> Questions </h3>
0045
0046 What does it mean to be two types of message.  The easiest thing to do would
0047 be to issue two distinct messages but assumedly that is not the most desirable
0048 behavior.  The questions include:
0049 <ul>
0050 <li> How should the output message look?  (We implied that just including the
0051 combined categories as the ID would be OK.)
0052 <li> Which destinations should react to this message?  (Probably destinations
0053 that would react to either of the combined categories.)
0054 <li> What should this do to the message counts for each category
0055  for each destination?
0056 <li> How is this treated for statistics?  (Probably one message of each type;
0057 but the first-two, last-one contexts might want to behave specially.)
0058 </ul>
0059
0060 <h3> Proposed treatment </h3>
0061 When a message is issued
0062 <pre>
0063   edm::LogWarning(tracking&amp;overflow) << "some more text";
0064 </pre>
0065 message counts for both "tracking" and "overflow" ID's are incremented.
0066 If a given destination would react to <em>either</em> a "tracking" message
0067 or an "overflow" message, it will react to this message, but only a single
0068 copy of the message output will appear.
0069 <p>
0070 In that message output, the category will look like tracking&amp;overflow,
0071 and even if
0072 the combined length is longer than the normally permitted length the full
0073 combined categoriess will appear.
0074 <p>
0075 Statistics destinations will note the appearance of one "tracking" message
0076 and one "overflow" message; thus the total of counts by message ID will no
0077 longer match the total of counts by severity.  The context (of the first
0078 two and last one of each type of message) kept by the statistics destination
0079 will react as it would to a single message of each of the categories.  Thus it
0080 is possible to have the same context (event number or whatever) noted twice
0081 (in two different categories) caused by this one error message.
0082
0083 <h3> Work consequences </h3>
0084
0085 The "issue the message twice" attitude would require no work on ErrorLogger
0086 internals, and routine work in the LogXYZ() functions. However, that is not
0087 the desired behavior.
0088 <p>
0089 The above proposal will require modifying the nature of a message ID within
0090 ErrorLogger.  At some points it is a potentially compound ID; at others (for
0091 example, in the limit maps and statistics maps) it is exactly as before.
0092 This is non-trivial, as the messageID penetrates much of the package, but
0093 as long as the behavior to shoot for is well-defined, this appears to be a
0094 solvable task.
0095
0096 <h3> Decision and Plan </h3>
0097
0098 The proposed treatment described above is agreed upon.
0099
0100 In order that the user be able to use the multiple-category syntax as soon as
0101 possible, we will temporarily implement the double-message treatment.
0102
0103 <hr>
0104 <a name=#log4cplus></a>
0105 <h2>Destination Reporting to log4cplus  </h2>
0106
0107 <h3> The requirement </h3>
0108
0109 We need a destination which, rather than sending to a file or ostream,
0110 delivers the header and text of the message to the CMS log4cplus facility.
0111
0112 <h3> Questions </h3>
0113
0114 What do we do to use log4cplus, and what options does it have:
0115 <ul>
0116 <li> Should we have a default destination of log4cplus, so that the user gets it
0117 even without mention in the .cfg file?
0118 <li> Is it mature enough to just use, or do we check out the package (and
0119 if so, can we make needed tweaks)?
0120 <li> What choices (if any) should we make if there are options in how to use
0121 log4cplus?
0122 <li> Are there any options which we should leave to the user via the .cfg file?
0123 </ul>
0124
0125
0126 <h3> Proposed treatment </h3>
0127
0128 <ul>
0129 <li>
0130 We create a ELlog4cplus destination class which reports to log4cplus.
0131 This is similar to ELoutput, but probably formats the header and remainder
0132 of the message separately.
0133 <li>
0134 We attach an ELlog4cplus destination always.
0135 <li>
0136 We provide a special named destination parameter "log4cplus" by which a user
0137 can control filtering of messages to that destination.
0138 <li>
0139 We are told by CMS which log4cplus options to use, and we do not provide
0140 further flexibility from the .cfg file for the user changing those options.
0141 </ul>
0142
0143 <h3> Work consequences </h3>
0144
0145 There are three potentially significant areas of work:
0146 <ol>
0147 <li> Understanding how to use log4cplus may be easy (especially if the
0148 product is mature and well documented) or arbitrarily hard.  We can give no
0149 estimate on how long this will take until we try it out.
0150 <li> Creating the ELlog4cplus destination will be some non-trivial but
0151 fixed amount of semi-routine work, since it is done with some knowledge of
0152 how ELoutput works.
0153 <li> Providing the parameters to control flexibility will take about half a day
0154 per parameter, once we know what sort of .cfg control (if any) is needed by CMS.
0155 </ol>
0156
0157 <h3> Decision and Plan </h3>
0158
0159 Instead of the proposed treatment, we have created a
0160 <font color = red>MLlog4cplus</font> service which ensures that
0161 an ELlog4cplus destination is attached to the logger.  This has the
0162 advantage of cutting any dependencies of MessageLogger on log4cplus (which is
0163 in xdaq).
0164
0165 The log4cplus capability is as of 12/23/05 in place and tested.  It is up to
0166 the user to deal with log4cplus, in particular, to assign whatever appenders
0167 are needed for the job.
0168
0169 Two temporary conditions are left:
0170 <ol>
0171 <li>
0172 We currently assign a file_appender writing to log4cplus.output whenever
0173 the MLlog4cplus service is specified in the .cfg file; it is
0174 probable that CMS will want to change this default action.
0175 <li>
0176 We have not yet done the work to allow the user to control, via the .cfg file,
0177 MessageLogger filtering of the ELlog4cplus destination.
0178 </ol>
0179
0180 <hr>
0181 <a name=probe></a>
0182 <h2>Probing for whether a message will be reported </a>
0183 </ul>  </h2>
0184
0185 <h3> The requirement </h3>
0186
0187 User code may sometimes generate messages which under many circumstances would
0188 be ignored by all destinations.  It would be desirable to be able to quickly
0189 probe whether any destination would respond to a given message severity
0190 and category, so that the work of preparing and formating the message items
0191 can be skipped if appropriate.
0192 <p>
0193
0194 There are two potential modes of usage for this capability.  The naive mode is
0195 to do the probe each time a message is to be prepared.  Another mode is to
0196 assume that the result of this probe will remain static, and to cache either
0197 the first result or the first negative result.  In that mode, a message
0198 which would be ignored will cost only one conditional on the cached boolean.
0199
0200 <h3> Questions </h3>
0201
0202 Should we also try to automate this checking, to avoid the cost of the
0203 operator &lt;&lt; when the user appends items to the line to log a message?
0204 <p>
0205 Since the cost of a probe will not be trivial, should we cache results and
0206 consider the probe of each message type to be a one-shot affair?
0207 That is,
0208 if you have learned whether this type of message is reported,
0209 should we assume that this answer will not change.
0210 This is not strictly true because of limits, but it may be a useful shortcut.
0211
0212 <h3> Proposed treatment </h3>
0213
0214 Create 4 new functions:
0215 <font color=blue>
0216 <pre>
0217   ProbeLogError   (const std::string & category);
0218   ProbeLogWarning (const std::string & category);
0219   ProbeLogInfo    (const std::string & category);
0220   ProbeLogDebug   (const std::string & category);
0221 </pre>
0222 </font>
0223
0224 These would return true if any destination would respond to the
0225 corresponding message.
0226 <p>
0227 The first time a message type is probed, we must issue some sort of special
0228 two-phase command to the MessageLoggerScribe, and it must cause an actual check
0229 useing ErrorLogger code.
0230 Once a message type has been probed in this way, the answer will be cached.
0231 We should implement this cache via a map
0232 (in the local thread, to avoid the complications of locking against new
0233 entries by other threads) of all messages already probed.
0234 When the same message type is probed again, the map is consulted.
0235 <p>
0236 In order that we eventually get the efficiency of rapidly ignoring
0237 which were previously reported but
0238 have reached their limit, when a true result is found in the map,
0239 one time in about twenty we should re-check.
0240 <p>
0241 We can
0242 automate the check of the cached map to occur
0243 whenever messages are issued.  (The user still has the option of probing
0244 to avoid work to prepare the items to be added to the message.)
0245 The rules would be:
0246 <ul>
0247 <li>
0248 If a message has never been probed, the LogXYZ commands always send that
0249 message along to the ErrorLogger.
0250 <li>
0251 If a message has ever been probed and found not to be reportable,
0252 then the result of a probe (automatic or explicit) will be that the
0253 message is not reportable.  For unreportable messages,
0254 operator &lt;&lt to the results of the LogXYZ commands become no-ops.
0255 <li>
0256 Every 11-th time the cache of results is used and delivers true,
0257 that message is re-probed in case it is now being ignored.  This interval
0258 increases via an exponential backoff.
0259 <li>
0260 If a message has been probed and found to be reportable (either by the cache
0261 or by a new actual probe),
0262 then LogXYZ will send that message along to the ErrorLogger.
0263 </ul>
0264 However, we can check the local map:
0265 if a message has already been probed and found to be ignored,
0266 then the  operator &lt;&lt can become a no-op.
0267 <p>
0268 This presents another mode of usage, one which we can recommend:
0269 The user probes a message category
0270 just once, and relies on the system to obviate extra work by automatically
0271 checking when appropriate.
0272 <p>
0273 We do have to warn about the effect of this on message statistics:
0274 Messaages which are sent to the logger but ignored make it into the statistics,
0275 but messages which are never even sent do not.  (Assumedly, the user would
0276 not care about the count of messages in categories which are completely
0277 ignored.)
0278
0279 <h3> Quick temporary treatment </h3>
0280
0281 We can quickly implement the propbe functions as returning true, thus
0282 allowing user code to start using this mechanism (with no efficiency gain
0283 until we implement the actual probing) immediately.
0284
0285 <font color=red><b>
0286 This treatment is now in place.
0287 </b></font>
0288
0289 <h3> Work consequences </h3>
0290
0291 The ErrorLogger currently has no means of probing for responding destinations.
0292 This will need to be created.  The difficulty should be moderate (time estimate
0293 of 1 day).
0294 <p>
0295 The MessageLoggerScribe will need a new opcode to do the probe.  The probe
0296 data structure would be a pointer to one integer (not a bool, because the
0297 probing end has to know when the result is now valid).  The probe routine
0298 (on the client side) will need to either sleep on that value changing, or
0299 otherwise wait till the true/false value is established.  This is not as
0300 easy as the other part, but probably will take another 1 day.
0301 <p>
0302 The establishment of the cache will be easy,
0303 but the re-writes of LogWarning etc. to take
0304 advantage of known non-response results will be subtle.
0305 In particular, the LogDebug macro may be very subtle if we don't want
0306 to take the cost of forming and passing __FILE__ and __LINE__ when we don't
0307 have to.
0308 <p>
0309 The total time estimate for this feature is three solid days.
0310
0311 <hr>
0312 <a name=#statistics></a>
0313 <h2>Statistics Destination </h2>
0314
0315 <h3> The requirement </h3>
0316
0317 The ErrorLogger has a nice ELstatistics destination for summarizing
0318 the messages.
0319 <p>
0320 We need a way for MessageLogger service users to to specify a
0321 statistics destination.
0322
0323 <h3> Questions </h3>
0324
0325 <ul>
0326 <li> Should we let the .cfg file specify where statistics are to be sent,
0327 or have some default destination?
0328 <li> What should trigger the reporting of statistics?  (For example, it could
0329 be part of the end-of-job or end-of-run activity of the MessageLogger
0330 service.)
0331 <li> How should we obtain the "short context" to report the contexts of the
0332 first two and last one of each type of message -- or should we punt that
0333 feature?
0334 </ul>
0335
0336
0337 <h3> Proposed treatment </h3>
0338
0339 <ul>
0340 <li>
0341 We treat the statistics destinations analogously to the output destinations,
0342 allowing the user to provide a list.  This allows the user to control those
0343 features of the statistics destinations (including threshold and output control)
0344 which are needed.
0345 <li>
0346 By default, we create a single statistics destination sending its output to
0347 the first of the output destinations (or to the log4cplus destination).
0348 <li>
0349 The issue of context is solvable (we can obtain the event number, for example)
0350 but it is an open issue as to how we would do this.  For a start, we could
0351 disable the noting of contexts.
0352 </ul>
0353
0354 <h3> Work consequences </h3>
0355
0356 Other than the context issue, we can probably set up statistics in a day.
0357
0358
0359 <hr>
0360 <a name=#filtering></a>
0361 <h2>Other Filtering Options  </h2>
0362
0363 <h3> The requirement </h3>
0364 The ErrorLogger package supplies
0365 other ways to specify limits and thresholds on which messages a destination will
0366 report.  For instance, one can set a limit for all message IDs <em>except</em>
0367 a specified type.
0368 The MessageLogger service might benefit from enabling some of these further
0369 options via the configuration file.
0370 <p>
0371 <em>This requirement may or may not finally be requested.</em>
0372
0373
0374 <h3> Questions </h3>
0375
0376 <ul>
0377 <li> Do we want exclusion limits?
0378 <li> Do we want rapid-discard thresholds?
0379 <li> Other flexibility (see the
0380 <a href=
0381 "www.fnal.gov/docs/working-groups/fpcltf/Pkg/ErrorLogger/doc/html/0ErrorLogger.html">
0382 ZOOM ErrorLogger</a> documentation for possibilities)?
0383 </ul>
0384
0385 <h3> Proposed treatment </h3>
0386 Additional parameters in the .cfg file, either destination-specific lines
0387 within the destination PSet, or general lines within the service=MessageLogger
0388 PSet.
0389
0390 <h3> Work consequences </h3>
0391
0392 Requires no modifications to the ErrorLogger package.
0393 However, each degree of flexibility will require code in MessageLoggerScribe
0394 to understand the parameter and issue appropriate calls to the ErrorLogger.
0395 Also, each feature requires CMS MessageLogger service documentation (otherwise
0396 it is next to useless).  The total time to enable a feature is about half a day.
0397
0398
0399 <hr>
0400 <a name=#context></a>
0401 <h2>Context (e.g., event number) in Messages </a>
0402 </ul>  </h2>
0403
0404 <h3> The requirement </h3>
0405
0406 The ErrorLogger package supplies
0407 ways to automatically append context information to messages.  The intent
0408 is to indicate event/run numbers without the message issuer having to think
0409 about it.  Perhaps the MessageLogger service should support this.
0410 <p>
0411 <em>This requirement may or may not finally be requested.</em>
0412
0413
0414 <h3> Questions </h3>
0415
0416 What should we use as the context?
0417 <ul>
0418 <li> How do we obtain the context (error number or whatever)?
0419 <li> Is there a useful abbreviated form?
0420 <li> Does getting the context take too much time?
0421 </ul>
0422
0423 <h3> Proposed treatment </h3>
0424
0425 We find out how to get context from an existing EDM service, and write a
0426 context supplier (in the ErrorLogger sense) to do so).  We propose that the
0427 format of the context not be user configurable.
0428
0429 <h3> Work consequences </h3>
0430
0431 Requires no modifications to the ErrorLogger package.
0432 Writing the context supplier is likely to take less than a day, assuming the
0433 info is available from the EDM.
0434
0435
0436 <hr>
0437 <a name=endl></a>
0438 <h2>Support For Use Of endl in Messages </a>
0439 </ul>  </h2>
0440
0441 <h3> The requirement </h3>
0442
0443 Users are used to using std::endl as a line terminator.
0444 The use of \n is already supported for this, but it might be
0445 desirable to also support endl.
0446 <p>
0447 <em>This requirement may or may not finally be requested.</em>
0448
0449
0450 <h3> Questions </h3>
0451
0452 Is endl any different in effect than \n?
0453
0454 <h3> Proposed treatment </h3>
0455
0456 The ErrorLogger package implementers originally attempted use std::endl
0457 as a message terminator.
0458 This led to technical difficulties we could not, at that time, surmount
0459 (thus the introduction of the errmsg manipulator.
0460 <p>
0461 Jim Kowalkowski and Marc Paterno claim they know how to do this without
0462 those difficulties.  If so, we will adapt that technique, treating endl as
0463 a "force line termination" directive (as opposed to a message terminator).
0464 <p>
0465 If this is not easy, we should not do it, as the gain is slight.
0466 <h3> Work consequences </h3>
0467
0468 Requires learning how to handle endl, and implementing that in ELoutput
0469 and other destinations.  The danger is that the problem issues may be subtle, in
0470 which case it is best to abandon the idea.
0471
0472 <hr>
0473 <a name=messageobj></a>
0474 <h2>Multi-statement Building of Message Objects </a>
0475 </ul>  </h2>
0476
0477 <h3> The requirement </h3>
0478
0479 In the ErrorLogger package, the user can gradually build an ErrorObj containing
0480 th message, and later dispatch it ot the logger.  A similar capability
0481 might be desirable in the MessageLogger service.
0482
0483 <p>
0484 <em>This requirement may or may not finally be requested.</em>
0485
0486
0487 <h3> Questions </h3>
0488 <ul>
0489 <li>
0490 What is the best way do instantiate the object?
0491 <li>
0492 How should the message be fired off to the log?
0493 </ul>
0494 <h3> Proposed treatment </h3>
0495
0496 The ErrorObj in the ErrorLogger package already has its severity and ID imbedded
0497 upon construction, and we propose to do the same here.
0498 We would provide 3 classes (the Debug form has complications we choose not to
0499 deal with:  InfoMessage, WarningMessage, and ErrorMessage.  The ctor would take
0500 the message ID.
0501 <p>
0502 The user community would probably vote for some sort of send() member function
0503 or special endmsg manipulator to indicate that the message is to be dispatched,
0504 but this leaves around a dangerously already-sent message object.  The correct
0505 idiom is to dispatch the message when the object is destructed.  The user code
0506 should look like:
0507 <pre>
0508   if ( problem_is_detected ) {
0509     WarningMessage warn ("thistypeoftrouble");
0510     warn << "information is";
0511     while (there_is_more_information) {
0512       warn << get_a_piece_of_information();
0513     }
0514   } // here warn goes out of scope and the message is dispatched
0515 </pre>
0516
0517 <h3> Work consequences </h3>
0518
0519 This would be straightforward since no ErrorLogger code need be modified.
0520 Including documentation it should take a couple of days.
0521
0522
0523 <hr>
0524 <a name=control></a>
0525 <h2>Post-configuration Control of Logging Behavior </a>
0526 </ul>  </h2>
0527
0528 <h3> The requirement </h3>
0529
0530 The current service allows control of the behavior of destinations (filtering
0531 and thresholds) and the overall logger, only via job-start configuration
0532 parameters found in the .cfg file.
0533 We could also support modification of these choices under programmatic control.
0534 <p>
0535 <em>This requirement may or may not finally be requested.</em>
0536
0537
0538 <h3> Questions </h3>
0539 <ul>
0540 <li>
0541 What degree of control should we support?
0542 <li>
0543 What is the syntax of the code controlling logging behavior?
0544 </ul>
0545
0546 <h3> Proposed treatment </h3>
0547
0548 We immediately step onto a slippery slope (more like, step off a steep cliff)
0549 when we open up the idea of runtime control of these options.
0550 Developing an interface for this control would take weeks at the least, and
0551 there would inevitably be arguments over all the equally-good ways of expressing
0552 the desired behavior.
0553 <p>
0554 The only sensible answers are to provide no post-configuration control at all,
0555 or to provide everything in the ErrorLogger package.  In the latter case, the
0556 capability is provided by having the service provide points or references to
0557 the actual ErrorLogger package objects, such as ELadministrator and the various
0558 ELdestControl handles.  The user would need to look in the ErrorLogger package
0559 documentation for how to use these.
0560
0561 <h3> Work consequences </h3>
0562
0563 If we can stick to our guns about just providing handles to raw ErrorLogger
0564 objects, this should not be too hard.  There may be a bit of work in developing
0565 an interface to get the ELdestControl for a given destination, but we
0566 probably already have the structures in place for that.
0567 <p>
0568 If we need to develop a separate CMS-approved interface for post-configuration
0569 adjustment of behavior, this will be a hopeless and endless task.
0570
0571
0572 <p><center>
0573 <img src="bar.gif"></center>
0574
0575 <p><center>
0576 <a href="http://www.uscms.org/SoftwareComputing/index.html">
0577 USCMS Software and Computing Home Page </a> -
0578 <a href="MessageLogger.html">CMS MessageLogger Service Page</a>
0579 </center>
0580
0581 <p>
0582       <hr>
0583       <address><a href="mailto:mf@fnal.gov">Mark Fischler</a></address>
0584 <!-- hhmts start -->
0585 Last modified: December 1, 2005
0586 <!-- hhmts end -->
0587 </body>