Frame Option bytes sent by server are too long and are rejected. Work-around please

acutetech · March 30, 2022, 7:30am

A device using AS923 region rules and DwellTime = 1 can receive only 11 bytes at DR2. I observe the TTN server sending 14 bytes of Frame Options. This packet is rejected by my device as violating the packet length rule, and consequently the device fails to operate.

Details follow: When my device has operated without a gateway, it has tracked the data rate/spreading factor down to DR2 and stored this in flash memory. AS923 region, DwellTime=1, DR2 has an 11 byte application payload limit. When the device boots again with a gateway connected to TTN V3 it joins properly (using DR2) . However its next uplink message receives a downlink message response from the server with 26 bytes: 12 bytes of header/MIC and 14 bytes of Frame Options (Fopts) MAC commands.

The device firmware detects the 14 bytes as being illegal (greater than 11 bytes) and so rejects the downlink message. The device cannot function.

The 14-byte Fopts field is as follows (raw data and parsed):
Raw data: 06 07 07 a0 af 8c 50 03 51 ff 00 01 09 35
06 = DevStatusReq (no payload)
07 = NewChannelReq (5 byte payload follows)
03 = LinkADRReq (4 byte payload follows)
09 = TxParamSetupReq (1 byte payload follows - including DwellTime = 1)

I suspect this is a bug - am I right?:

(a) The server should be aware of the 11 byte limit and not attempt to send 14 bytes.
(b) In any event, there is no need for the server to insist on DwellTime = 1 as there is no need for this restriction according to New Zealand regulations. (I posted on this here, but received no replies: Dwell Time and TxParamSetupReq for AS923 in New Zealand)

Is this a known problem? is there a work-around I can apply at the server level or device application level, without changing the LoRaWAN firmware?

Related post: How to set dwelltime to 0 at AS923 in lopy4?

Jeff-UK · March 30, 2022, 9:24am

The folk who have access to the servers, who develop the code, and configure the settings are the TTI core team rather than the Community forum contributors, moderators or volunteers, though some TTI’ers do visit not all threads get read or have the needed staff cover so best option is post to GIT if susected bug or post to the Things Slack boards - #support channel?

htdvisser · March 30, 2022, 1:06pm

Nice detailed description of the problem, thanks for that. Could you also let us know what frequency you configured for your end device (in the Console, go to your end device, then General Settings and expand the Network layer section)?

By the way: the LoRaWAN specification does not require end device to reject such downlinks:

The end-device SHALL only enforce the maximum Downlink MAC Payload Size defined for
DownlinkDwellTime = 0 (no dwell time enforced) regardless of the actual setting. This
prevents the end-device from discarding valid downlink messages which comply with the
regulatory requirements which may be unknown to the device (for example, when the device
is joining the network).

adrianmares · March 30, 2022, 1:32pm

Thanks for the report !

In addition to the frequency plan used, could you also tell us the MAC version and Regional Parameters version used during testing ?

Both older and newer RP versions are ambigous with respect to the settings that the end device will boot up with. To make things worse, there is no right answer to begin with, since the DownlinkDwellTime also changes the offset used for the RX1 data rate, so even being conservative with respect to the payload size will cause the RX1 window data rate to possibly be wrong (if the device is expecting DownlinkDwellTime=0).

The server insists on doing a TxParamSetup due to this matter - if the server does not ‘clarify’ on the DownlinkDwellTime, it is possible that RX1 transmissions won’t be possible.

If the boot time settings expected by the stack do not match the ones used by the end device (remember that the standard is ambigous here and for AS923 the boot settings are not provided), they may be provided using the --mac-settings.downlink-dwell-time and --mac-settings.uplink-dwell-time CLI options.

acutetech · March 31, 2022, 1:24am

Thanks all for your prompt replies. Additional info follows.

The console says settings for the end device are: “Asia 920-923MHz”, LoRaWAN spec 1.0.3", “RP001 regional Parameters 1.0.3 Rev A”

I am using the STM32WLE5 chip and an IDE provided by ST. A file st_readme.txt says:
“Implements LoRa Mac from Semtech/StackForce develop branch (26-May-2020 commits, version 4.4.4)”

I think the problematic code is in LoRaMac.c, ProcessRadioRxDone():

        case FRAME_TYPE_DATA_UNCONFIRMED_DOWN:
            // Check if the received payload size is valid
            getPhy.UplinkDwellTime = MacCtx.NvmCtx->MacParams.DownlinkDwellTime;
            getPhy.Datarate = MacCtx.McpsIndication.RxDatarate;
            getPhy.Attribute = PHY_MAX_PAYLOAD;

            // Get the maximum payload length
            if( MacCtx.NvmCtx->RepeaterSupport == true )
            {
                getPhy.Attribute = PHY_MAX_PAYLOAD_REPEATER;
            }

            phyParam = RegionGetPhyParam( MacCtx.NvmCtx->Region, &getPhy );

            if( ( MAX( 0, ( int16_t )( ( int16_t ) size - ( int16_t ) LORAMAC_FRAME_PAYLOAD_OVERHEAD_SIZE ) ) > ( int16_t )phyParam.Value ) ||
                ( size < LORAMAC_FRAME_PAYLOAD_MIN_SIZE ) )
            {
                MacCtx.McpsIndication.Status = LORAMAC_EVENT_INFO_STATUS_ERROR;

                PrepareRxDoneAbort( );
                return;
            }

(I have checked the latest LoRamac-node on Github and it seems unchanged.).

I think that at this time MacCtx.NvmCtx->MacParams.DownlinkDwellTime has been set to the default AS923_DEFAULT_DOWNLINK_DWELL_TIME which is 1. The RegionGetPhyParam() call returns a value based on both UplinkDwellTime and RxDatarate, which is 11 in this case.

If I understand Hylker, the code should be changed to:
getPhy.UplinkDwellTime = 0;

If so, then do you agree this is a bug in LoRaMac-node code base? If so is surprising that others have not found this before.

I may be able to fix this in my devices, but it would be nicer if there was a server-side fix, say split the MAC commands into two sets so there is never > 11 bytes.

Also - the server’s TxParamSetupReq MAC parameters are setting UplinkDwellTime and DownlinkDwellTime to 1. Are there any circumstances in which the server will set these to 0? The spec says “Used by the network server to set the maximum allowed dwell time and Max EIRP of end-device, based on local regulations” so it would seem that it is incumbent on the server code to determine my region and send 0.

Further confusion: the console has many options for the “Frequency Plan” setting, but these are not described using the spec’s terminology (e.g. AS923), nor can I see documentation that describes the differences. Is there a better AS923 setting for me than “Asia 920-923MHz”?

cslorabox · March 31, 2022, 3:00am

It seems like this issue in LoRaMAC-node has been subtly reported before, and then Semtech maintainers went around and did an unhelpful closing of issues based solely on time and not status…

github.com/Lora-net/LoRaMac-node

AS923 Region Incorrect Downlink Implementation

opened 04:30AM - 02 Oct 18 UTC

closed 02:07PM - 12 Feb 19 UTC

BenAtPip

Hi All, I'm currently working on an application using LRWAN from the ST-Cube …package which I understand uses this project for the Radio stack. I understand that this project deviates from the releases ST makes, but the activity on these posts and some of fixes I have seen have been applicable to the version we are implementing (a version of 1.2.0 which is being patched) My current issue is that while I can successfully send uplinks with the device I only sporadically receive downlinks from the backend. This is observed in Class A device type, with both ABP and OTAA, and with either ADR set to 0 or 1. The device typically receives it's first downlink from the LoRaWAN network, and responds to the MAC commands from the downlink. Following the first downlink however the device services downlinks without an apparent pattern. My observation is that either there is a receive window issue which prevents the device from receiving subsequent frames. My question is if anyone else has been experiencing these issues with either the ST code releases or this project, in either AS923 or other regions. I can provide further details on the device configuration and fixes implemented based on issues raised on this project as necessary.

and a second time here, again carelessly closed:

github.com/Lora-net/LoRaMac-node

LoRa Stack 4.4.0 and AS923 Dwell Problem

opened 09:43AM - 01 Mar 18 UTC

closed 01:29PM - 02 Jul 18 UTC

ARAradtec

question

The default Dwell uplink and downlink in AS923 is on. and since there is a max p…ayload check now in the stack Rx Done function, commands received in the RX windows from servers that were configured with dwell off, are longer then the allowed dwell on max payload and are discarded, including the TX Parameters Setup request command (that sets the dwell off ). (i am using multitech AEP MTCDT-H5-210A Firmware 1.4.3). setting the define **AS923_DEFAULT_DOWNLINK_DWELL_TIME** to 0 (off) solves the problem, and maybe it is a better default value for the end unit, because it allows the server to set the downlink dwell value for new joined units according to its configuration, and prevent the need to preconfigure the end unit dwell Values ,to match the server or the network dwell configuration.

The code has since been refactored a bit so the details of how the (unnecessary? mistaken?) enforcement is being performed have changed.

Interestingly, the problem was previously recognized and fixed in the radio chip receive setup code, but not in subsequent rx done processing:

github.com/Lora-net/LoRaMac-node

JoinAccept message not received when AS923_DEFAULT_DOWNLINK_DWELL_TIME is set to 1

opened 02:52AM - 31 May 17 UTC

closed 12:18PM - 07 Jul 17 UTC

anthonyblx

I have noticed that RegionAS923RxConfig() function will calculate the maxPayload… based on the DownlinkDwellTime. If the DownlinkDwellTime is 1 then maxPayload = MaxPayloadOfDatarateDwell1DownAS923[dr]; Which is a problem when we receive the JoinAccept message on the RxWindow1. As a result, Radio.SetMaxPayloadLength( modem, maxPayload + LORA_MAC_FRMPAYLOAD_OVERHEAD ); will set the maximum payload to 24 (maxPayload = 11, LORA_MAC_FRMPAYLOAD_OVERHEAD = 13). This is smaller than the JoinAccept which is 34 bytes long. Wouldn't it be better to ignore the DownlinkDwellTime when we are waiting on JoinAccept? Regards, Anthony

mluis1 · March 31, 2022, 7:59am

This is a quite complex subject and several discussion have been held concerning this.

The regional parameters specifications RP1 versions did not specify the default dwell time values for AU915 and AS923 regions.

Starting at RP2 versions the default dwell time for these regions is specified in order to avoid as much as possible potential issues.

On LoRaMac-node project when we have added support for AS923 regions we decided that the end-device should enforce the most restrictive limitations in order to ensure that it would always comply with countries national regulations. It was maybe not the best decision however we had to make one as it was not specified.

An end-device is only responsible to ensure that the uplink dwell time restrictions are respected. For downlinks it is the Network Server responsibility to ensure the respect of the downlink dwell time.
This is the reason why starting at RP2 specifications an uplink-dwell-time=1 and downlink-dwell-time=0 has been specified.

It has to be noted that potential issues may still happen in case a network server changes the Rx1DrOffset sent under the JoinAccept message. In case of AU915 and AS923 regions a network server shouldn’t do it as the first Rx window could become unusable.

In order to solve this issue I would recommend to update the LoRaMac stack to the latest version 4.6.0 which implements LoRaWAN 1.0.4 + RP2-1.0.1 specifications.
Version v4.4.4 is now quite old (May 26, 2020) and a lot of fixes have been done since. Please refer to the CHANGELOG.md file for further details.

github.com

Lora-net/LoRaMac-node/blob/master/CHANGELOG.md

# Changelog

All notable changes to this project will be documented in this file.

The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).

## LoRaWAN pre-certification results

Please refer to [Releases pre-certification-results](https://github.com/Lora-net/LoRaMac-node/wiki/releases-pre-certification-results) document for further information.

## [Unreleased]

## [4.6.0] - 2022-01-11

### General

- Release based on "LoRaWAN specification 1.0.4" and "LoRaWAN specification 1.1.0 + FCntDwn ERRATA" with "LoRaWAN Regional Parameters 2-1.0.1"
- GitHub reported issues corrections.

This file has been truncated. show original

Under v4.4.4 version the best way to solve the issue is to modify the AS923_DEFAULT_DOWNLINK_DWELL_TIME definition from 1 to 0.

github.com

Lora-net/LoRaMac-node/blob/ba17382bd5109513937afad07f068a781a503ef6/src/mac/region/RegionAS923.h#L128-L131

      
        
            /*!

             * Default downlink dwell time configuration

             */

            #define AS923_DEFAULT_DOWNLINK_DWELL_TIME           1

There were some recent discussions on this subject between my self and TTN and the outcome can be seen at following TheThingsNetwork/lorawan-stack issue: Maximum downlink payload size exceeded when dwell time is activated · Issue #4971 · TheThingsNetwork/lorawan-stack · GitHub

The following issues/PR are also related to the same subject:

github.com/TheThingsNetwork/lorawan-stack

Record if UplinkDwellTime is enabled on boot

opened 03:40PM - 21 May 19 UTC

closed 04:28PM - 04 Feb 22 UTC

rvolosatovs

c/network server s/in progress

#### Summary Record if `UplinkDwellTime` is enabled after device boot in band structure. Refs #709 #### Why do we need this? Network Server can avoid sending a redundant `TxParamSetupReq`, in cases, where `UplinkDownlinkTime` is set on boot and required by the frequency plan. #### What is already there? What do you see now?  Not this #### What is missing? What do you want to see? A field indicating whether the `UplinkDwellTime` is enabled after device boot, if such information is recorded in the spec. E.g. in `1.0.3` AU915: ![2019-05-21-17:18:12-screenshot](https://user-images.githubusercontent.com/12877905/58109788-88427380-7bee-11e9-8980-fcf74b75306d.png) Network Server should use this value to construct the MAC state of device on reset. #### How do you propose to implement this? `UplinkDwellTimeOnBoot *bool` on `band.Band` (nicer naming is welcome) Update `newMACState` in NS #### Can you do this yourself and submit a Pull Request? yes

github.com/TheThingsNetwork/lorawan-stack

Dwell time boot configuration

TheThingsNetwork:v3.18 ← TheThingsNetwork:feature/4971-dwell-time-config

opened 08:23PM - 24 Jan 22 UTC

adriansmares

+1298 -344

#### Summary  Closes https://github.com/TheThingsNetwork/lorawan-stack/issues/4971 Closes https://github.com/TheThingsNetwork/lorawan-stack/issues/725 #### Changes - Add the boot dwell time configuration options to `MACSettings` - Use the boot dwell time configuration in order to determine the current parameters of the end device after join/reset - Desired parameters are unchanged, only a bit of styling added - Add the boot dwell time configuration to specific versions of the ~AS923 and~ AU915 bands - ~I've added these only to the ones which have a LoRaMAC node release~ #### Testing Unit testing. ##### Regressions I'm a bit uncomfortable about the AS923 having default uplink dwell time enabled at boot time. Per spec, the device is not forced to have this dwell time enabled, so we may reject uplinks which have dwell time disabled at boot time. Are we sure we want to have these boot time options available in that band ? #### Checklist - [x] Scope: The referenced issue is addressed, there are no unrelated changes. - [x] Compatibility: The changes are backwards compatible with existing API, storage, configuration and CLI, according to the compatibility commitments in `README.md` for the chosen target branch. - [ ] Documentation: Relevant documentation is added or updated. - [ ] Changelog: Significant features, behavior changes, deprecations and fixes are added to `CHANGELOG.md`. - [x] Commits: Commit messages follow guidelines in `CONTRIBUTING.md`, there are no fixup commits left.

cslorabox · March 31, 2022, 3:37pm

That is correct thinking.

But having the end device enforce downlink payload length limits based on dwell time is mistaken and counterproductive - it does nothing to ensure compliance, because the end device does not control what the network transmits.

The reason the downlink dwell times needs to be accurately known is to determine the minimum downlink data rate in order to correctly set the radio to receive what the network might transmit.

But the mistaken code enforcing downlink packet size limits needs to be removed.

acutetech · March 31, 2022, 10:00pm

Thanks all.

As I understand it, mluis suggests two fixes, both involving changing the device code. This is reasonable for new devices but difficult to implement for devices in the field.

mluis references this Maximum downlink payload size exceeded when dwell time is activated · Issue #4971 · TheThingsNetwork/lorawan-stack · GitHub in which the summary is “Maximum downlink payload size is exceeded when the end device has dwell time enabled, but the Network Server’s frequency plan does not.” This kind of implies the problem does not exist if the Network Server’s frequency plan does have dwell time enabled. However, I think the problem exists for both settings “Asia 920-923 MHz” and “Asia 920-923 MHz (used by TTN Australia)” - in both cases the server sends 14 bytes of Fopts, which the device rejects. (The difference is that the TxParamSetupReq MAC clears both DwellTime bits for the Australian frequency plan).

It should be possible for the NS to foresee this problem and so split the Fopts in two - it would be benign to defer the NewChannelReq MAC for a later downlink, for example.

Or am I missing something?

descartes · March 31, 2022, 10:47pm

And is somewhat perverse - the “damage” is done, rejecting the MAC commands is likely to result in the NS rescheduling the very same downlink, thereby exacerbating the situation.

Hopefully the TTI team can implement something to split the sequence over a number of downlinks to accommodate existing devices perhaps by the crude method of not allowing NewChannelReq to be sent with LinkADRReq.

I will now go and dig through some recent firmware development to see if I have devices that may trip up over this corner case!

acutetech · April 1, 2022, 1:18am

Hi Nick

The NS does indeed send the same MAC commands in every downlink, and the device keeps on rejecting them.

I would have thought that it should be pretty easy to replicate, as most devices using the LoRaMac-node code base should behave the same way. I think what you need to do is to boot a device in the absence of a gateway. That way the device will retry using successively lower data rates, until it reaches DR2. I think this is saved in NVM. The next time it boots with a gateway present it will join the network OK (using DR2) but then you hit the problem reported here. Ironically, the LinkADRReq MAC gives the device permission to use a higher DR rate and so longer packets, but…

cslorabox · April 1, 2022, 1:25am

In a directly cabled off-air compliance test, yes.

But to experience it in the real world one would have to be in a region where a dwell limit applies in some settings such as various AS923 countries rather than one where it never applies like EU868 or applies all of the time like US915 (which compensates with 500 KHz downlink BW anyway)

acutetech · April 3, 2022, 11:19pm

Just to confirm: I made this change and the problem goes away:

I am still of the opinion that a server-side fix (NS does not try to send > 11 bytes of MAC commands) would be useful also.

descartes · April 4, 2022, 9:00am

Please raise this as an issue on GitHub - it won’t be picked up from the forum.

acutetech · April 5, 2022, 7:31am

Issue submitted here: Network Server should be aware of maximum downlink payload sizes and split MAC messages if appropriate · Issue #5370 · TheThingsNetwork/lorawan-stack · GitHub