notifications-api

mirror of https://github.com/GSA/notifications-api.git synced 2026-01-15 15:11:01 -05:00

Author	SHA1	Message	Date
sakisv	9faa3d34e1	Fix tests Specifically, no longer test for a p1 zendesk when sending an alert and drop misleading "p1" from test name when cancelling an alert. We're no longer creating a P1 from the code, but we _do_ create a zendesk ticket when sending out an alert. When cancelling, what we want to test is that we don't create a second ticket when the alert is cancelled.	2021-09-10 10:14:28 +03:00
Ben Thorner	bf0bf4e31c	Favour new "areas" format for PagerDuty alerts Broadcasts created via the API [1] and the Admin app [2] should both now have this field set. It's also more informative to show this, and broadcasts created via the API don't have IDs anyway. There's a small risk that an old broadcast that gets approved won't have this data, but it's for information only and we intend to backfill all old broadcasts in the near future. [1]: `023a06d5fb` [2]: `7dbe3afa19`	2021-08-27 14:22:12 +01:00
Ben Thorner	a7d92b9058	Replace / remove redundant uses of "areas" In one case ("areas=['manchester']") the format was even invalid, but in general the original value of the column is pretty much irrelevant for tests that involve updating it (it's highly unlikely the column would default to the same value as the test data).	2021-08-27 13:31:49 +01:00
Ben Thorner	312a895822	Merge pull request #3294 from alphagov/auto-expire-alerts-178926353 Auto expire old broadcast messages	2021-07-22 09:53:41 +01:00
Ben Thorner	5e9d8e5fa0	Auto expire old broadcast messages Since the expiry is sent as part of the message payload, we don't need to invoke the CBC proxies (and indeed there's no way to do so for an expired alert). In future we plan to extend this task so it triggers the regeneration of content on gov.uk/alerts. It's worth noting that 'finishes_at' can theoretically be None, in which case it's unclear when the alert should expire. While alerts from the Admin app should always have an expiry [1], we have many in the DB that don't, so it's worth checking for this scenario. [1]: `078ac10c8d/app/models/broadcast_message.py (L255)`	2021-07-21 13:05:11 +01:00
Ben Thorner	08f48379b4	Move ID generation into link test method Unlike the other IDs which are stored in the DB, this isn't relevant for the Celery task as it invokes a link test. Moving it into the proxy client will also enable us to generate a second ID in the next commits, where we start doing a link test for the failover lambda.	2021-07-19 16:00:55 +01:00
Ben Thorner	b6774bf0f7	Generate Vodafone link test sequence nos in proxy Previously the Celery task to trigger a link test had to know about the special case of a sequence number for Vodafone. Since we're about to change the client to perform multiple tests it makes sense to give it the knowledge of how to generate number itself. Note that we have to import the db inline to avoid a circular import, since this module is itself imported by app/__init__.py. Other invocations of the Vodafone client use stored sequence numbers from the DB, which are called "message numbers" in that context. Since the two use cases are very different (even the names are different!), having them in two places shouldn't cause any confusion.	2021-07-19 15:43:36 +01:00
Leo Hemsted	2ad9a3a380	retry service callbacks on 429 if we're served a 429, put the item on the retry queue and retry the same as if the service returned a 5xx. 429 is commonly returned for rate limit exceeding, and retrying on a delay is a typical response to that.	2021-07-13 16:09:17 +01:00
Pea Tyczynska	c28e9451d4	Bump moto version to try solve dependencies version conflict Also update mock import statements in some test files as they stopped working with this dependency update.	2021-07-08 15:37:19 +01:00
Rebecca Law	18dd9050a4	- make sure when processing a job that we check the total_sent + job.notification_count against the service.message_limit.	2021-06-28 13:07:48 +01:00
Rebecca Law	fd7486d751	- Merge daily limit functions into one, refactor call for daily limit check from process_job - refactor tests to standardise test names - refactor some tests to be more clear - remove unnecessary tests - include missing test	2021-06-24 11:05:22 +01:00
Rebecca Law	35b20ba363	Correct the daily limits cache. Last year we had an issue with the daily limit cache and the query that was populating it. As a result we have not been checking the daily limit properly. This PR should correct all that. The daily limit cache is not being incremented in app.notifications.process_notifications.persist_notification, this method is and should always be the only method used to create a notification. We increment the daily limit cache is redis is enabled (and it is always enabled for production) and the key type for the notification is team or normal. We check if the daily limit is exceed in many places: - app.celery.tasks.process_job - app.v2.notifications.post_notifications.post_notification - app.v2.notifications.post_notifications.post_precompiled_letter_notification - app.service.send_notification.send_one_off_notification - app.service.send_notification.send_pdf_letter_notification If the daily limits cache is not found, set the cache to 0 with an expiry of 24 hours. The daily limit cache key is service_id-yyy-mm-dd-count, so each day a new cache is created. The best thing about this PR is that the app.service_dao.fetch_todays_total_message_count query has been removed. This query was not performant and had been wrong for ages.	2021-06-22 16:15:36 +01:00
David McDonald	be035664c4	Add operator channel to broadcast settings route Looks identical to the government channel in terms of the interface	2021-06-09 13:49:06 +01:00
Rebecca Law	1bf5ce08b2	Add a error log for alert tasks. Many of the team members do not look at emails from zendesk, adding a current_app.logger.error message for things we care about to give developers a better chance of seeing them. I have purposely not added an erro log for `check_for_services_with_high_failure_rates_or_sending_to_tv_numbers` because it's not something we need to look at immediately.	2021-05-26 11:06:21 +01:00
Katie Smith	829b646931	Allow "government" in broadcast_channel schema This will allow admin to pass through a value of "government" for the broadcast_channel. We don't have any logic around the value of service.broadcast_channel, so no updates are needed to the tasks etc.	2021-05-11 16:56:56 +01:00
Katie Smith	4624328c36	Make service_broadcast_settings.provider non-nullable We set all existing null values to "all", then make the column non-nullable. Admin is already passing through the value of "all".	2021-05-10 15:59:22 +01:00
Katie Smith	1767535def	Allow service.allowed_broadcast_provider to be "all" We want to replace the value `None` for service.allowed_broadcast_provider with the value of "all". As a first step, we need to allow both values. Once notifications-admin has been changed to pass through "all" and all the data in the database has been updated, we can update the code to stop supporting both values.	2021-05-06 15:32:02 +01:00
Katie Smith	8365c749e4	Change letter zip file names for Insolvency Service letters DVLA would like to be able to identify letters sent by the Insolvency Service, so we are changing the zipfile name. They need all zipfile names to have the same structure, so we can't just add a marker to files sent by that service - we have to change all filenames. The new format is like this: `{NOTIFY}.{DATE}.{SEQUENCE_ID}.{UNIQUE_ID}.{SERVICE_ID}.{ORG_NAME}.{EXTENSION}`	2021-05-06 09:18:44 +01:00
Ben Thorner	23f4ae32df	Merge pull request #3214 from alphagov/check-broadcast-suspended Enforce service suspension for broadcasts	2021-04-28 15:01:11 +01:00
Ben Thorner	99bc29418e	Move request_id injection into send_task override This applies the same change we made in other apps [1][2]. Adding the override here is special, though, because it means the others will now get triggered, since this app is the start of the chain of tasks for a request. We will also retain existing request_id tracing for tasks within this app, since "apply_async" calls the "send_task" method internally, which is the one we're overriding. [1]: `6f3c118a1e` [2]: `2e08b7aa95`	2021-04-27 10:35:21 +01:00
Ben Thorner	a2af8b052a	Split up authorisation vs. sequencing checks While both of these are integrity errors (since we should never reach this point in the code + data), this just means the original method comment is still relevant to what immediately follows it.	2021-04-19 17:13:15 +01:00
Ben Thorner	936c9ebdfe	Test sanity checks by calling top-level task Since the checks are only performed in one place we can easily take extra care to ensure this in the tests, noting that we don't need to do any additional setup, except if no exception is raised - I've left these tests as-is, to avoid doing more setup. Note that we still check the happy path for when a provider message is already sending - just in a different test [1]. [1]: `3d71815956/tests/app/celery/test_broadcast_message_tasks.py (L263)`	2021-04-19 17:13:14 +01:00
Ben Thorner	ee52e3e2c9	Mirror integrity checks from the API It makes sense to have these checks [1] here, since in future we may add other ways of creating a broadcast event and omit them. [1]: `3d71815956/app/broadcast_message/rest.py (L198)`	2021-04-19 17:13:13 +01:00
Ben Thorner	0070473f31	Check for suspension before sending a broadcast This mirrors the check we do for jobs, which are also a high-impact task [1]. While this shouldn't be possible, just like other checks we're adding it here to be doubly certain. [1]: `3d71815956/app/celery/tasks.py (L74)`	2021-04-19 17:13:12 +01:00
Ben Thorner	b2398fcaf4	Rename CBCProxyFatalException We only actually use this when the data we're working with is in an unexpected state, which is unrelated to the CBC Proxy. Using this name also means we can re-use this exception in the next commits. Note that we may still care if a broadcast message has expired, since it's not expected that someone would send one in this condition.	2021-04-19 17:13:05 +01:00
Rebecca Law	34a378a60e	Update the Zendesk ticket content for `check_if_letters_still_in_created` The message to Zendesk includes a list of notification ids, this isn't really necessary and is included in the run book. Creation of the Zendesk ticket can fail if the message is too long, removing the list of ids can prevent that from happening.	2021-04-19 10:47:25 +01:00
Ben Thorner	be02573147	Fix apply_async not working with positional kwargs Celery's apply_async function accepts 'kwargs' as (get ready to be confused) either a positional argument, or a keyword argument: Positional: apply_async(['args'], {'kw': 'args'}) Keyword: apply_async(args=['args'], kwargs={'kw': 'args'}) We rely on the positional form in at least one place [1]. This fixes the overload of apply_async to cope with both forms, and continue to pass through any other (confusion time again) keyword args to super(), such as queue="queue". Note that we've also decided to stop accepting other positional args, since this is unnecessarily confusing, and we don't currently rely on it in our code. This stops it creeping in in future. [1]: `fde927e00e/app/job/rest.py (L186)`	2021-04-15 17:21:21 +01:00
Ben Thorner	ec6d87cd0f	Simplify argument passing in apply_async This avoids the need to keep in-sync with any future changes to the signature, and reduces the amount of irrelevant code to read.	2021-04-13 15:12:45 +01:00
David McDonald	2e6d761691	Merge pull request #3204 from alphagov/broadcast-envars Broadcast envars	2021-04-12 17:25:15 +01:00
David McDonald	295162c81d	Move CBC proxy enable check This change will make our development environments closer to production even if they aren't hooked up to the CBC proxy lambda functions. Now in development, we will create the broadcast event and create tasks for each broadcast provider event. We will still not create actual broadcast provider message rows in the DB and talk to the CBC proxies. This should be helpful in development to catch any issues we introduce to do with sending broadcast messaging. In time we may wish to have some fake CBC proxies in the AWS tools account that we can interact with to make it even more realistic.	2021-04-12 17:05:41 +01:00
Ben Thorner	3e507eea55	Merge pull request #3201 from alphagov/revamp-celery-stats Migrate towards new metrics for Celery tasks	2021-04-12 15:04:37 +01:00
Ben Thorner	37f91e0214	Add tests for apply_async injecting request_id	2021-04-12 14:50:55 +01:00
Ben Thorner	df6e27d8fd	Add test for extracting request_id in __call__ Tasks will fail if we leave the kwarg in, so I think it's quite important that we test this works. We don't cover this in any other test because we call the task functions directly, so the request_id kwarg doesn't get injected beforehand.	2021-04-12 14:50:53 +01:00
Ben Thorner	8954cec5a1	Add tests for celery task superclass This requires upgrading freezegun, as time.monotonic wasn't frozen by v1.0. Note that we need to explicitly specify the base class for the task in the test, the reason for which is quite subtle: - Normally, by using the 'notify_api' fixture, the base class is set to NotifyTask automatically by running app.create_app [1]. - However, when run alongside other tests, the imports of files with other celery tasks cause the base class to be instantiated and cached as the default Celery one. This means none of our tests actually use our custom superclass when testing tasks. Because we can't run 'apply_async' directly (since this would require an actual Celery broker), we need to manually push/pop the request Context that's normally done as part of sending a task. Note also that we use a UUID as the name for a task, since these are global. We want to avoid the task polluting other tests in future, as well as make it clear the task is being reused. [1]: `dea5828d0e/app/__init__.py (L113)`	2021-04-12 14:50:02 +01:00
Leo Hemsted	4a5b1c23bd	only send zendesk P1 for alerts we don't need to be re-notified when someone clicks cancel	2021-04-08 12:22:18 +01:00
Leo Hemsted	9bd8c0239c	look for 'live', not 'production' config['NOTIFY_ENVIRONMENT'] is hardcoded to `'live'` in the Live config class. The values as seen on the environment which we send real messages from: ``` >>> json.loads(os.environ['VCAP_APPLICATION'])['space_name'] # what cloudfoundry sets 'production' >>> os.environ['NOTIFY_ENVIRONMENT'] # we set this from cloudfoundry 'production' >>> current_app.config['NOTIFY_ENVIRONMENT'] # hardcoded in the Live config 'live' >>> current_app.config['NOTIFICATION_QUEUE_PREFIX'] # pulled from env var of same name 'live' >>> current_app.config['ENV'] # this is an unrelated flask variable 'production' ```	2021-04-08 12:17:22 +01:00
Leo Hemsted	df393e36c5	send a p1 when a broadcast goes out on production it's important to keep tabs on when these things leave our system. Sending a zendesk ticket that triggers a P1 is probably our simplest way of notifying the team when this happens (it's what we do with out of hours emergencies on the admin app too). We don't have any direct pagerduty integrations from the api app, but we already have the zendesk client hooked up. After broadcasts go live, we may want to change this to a P2 (but even then, there's arguments for keeping it P1 to start with I think). Don't cause a P1 if it goes out on staging as that might be MNOs testing.	2021-04-06 11:32:19 +01:00
David McDonald	6d410daae4	Remove the emergency alerts canary See https://github.com/alphagov/notifications-broadcasts-infra/pull/197 for why we no longer need this and we get to delete some code!	2021-03-26 18:31:53 +00:00
Rebecca Law	057c4e4568	Quick fix to ensure that billing doesn't fail if the crown is not set for the service. The letters rates for cronw and non crown are the same. It would be nice to remove the need for crown but for now this is a quick fix.	2021-03-25 08:42:46 +00:00
Pea Tyczynska	52c529ab3a	Use personalisation to set client_reference for letters which were sent through Notify interface only. This is done to avoid performance dip from additional operation for other notification types.	2021-03-24 14:55:10 +00:00
Ben Thorner	b2b14f39a3	Merge pull request #3183 from alphagov/remove-crown-letter-filename Remove non/crown indicator in letter filenames	2021-03-24 13:06:58 +00:00
Katie Smith	27b3cece7d	Send template id and version with delivery status callback This adds the `template_id` and `template_version` fields to the data sent to services from the `send_delivery_status_to_service` task. We need to account for the task not being passed these fields at first since there might be tasks retrying which don't have that data. Once all tasks have been called with the new fields we can then update the code to assume they are always there. Since we only send delivery status callbacks for SMS and emails, I've removed the tests where we call that task with letters.	2021-03-24 10:55:45 +00:00
Ben Thorner	8219b3c032	Remove non/crown indicator in letter filenames This is not required by DVLA and since [1] we no longer care about the end of letter filenames when collating them, so removing it is safe to do. Note that the name of the ZIP files of collated letters is based on a hash of the filenames, which needed updating in tests. Before merging this we need to do a test run in Staging, so DVLA can check that a mixture of the old / new filenames won't cause issues. [1]: https://github.com/alphagov/notifications-api/pull/3172	2021-03-18 13:05:12 +00:00
Katie Smith	3b78f863d5	Check for incomplete pending jobs We have a scheduled task that was checking for jobs still in progress. We saw a case where a scheduled job was stuck in a `pending` status as a result of an app shutting down. This changes the `check_job_status` task so that it also checks for scheduled jobs which are still pending after 30 minutes.	2021-03-18 08:24:36 +00:00
Ben Thorner	b43a367d5f	Relax lookup of letter PDFs in S3 buckets Previously we generated the filename we expected a letter PDF to be stored at in S3, and used that to retrieve it. However, the generated filename can change over the course of a notification's lifetime e.g. if the service changes from crown ('.C.') to non-crown ('.N.'). The prefix of the filename is stable: it's based on properties of the notification - reference and creation - that don't change. This commit changes the way we interact with letter PDFs in S3: - Uploading uses the original method to generate the full file name. The method is renamed to 'generate_' to distinguish it from the new one. - Downloading uses a new 'find_' method to get the filename using just its prefix, which makes it agnostic to changes in the filename suffix. Making this change helps to decouple our code from the requirements DVLA have on the filenames. While it means more traffic to S3, we rely on S3 in any case to download the files. From experience, we know S3 is highly reliable and performant, so don't anticipate any issues. In the tests we favour using moto to mock S3, so that the behaviour is realistic. There are a couple of places where we just mock the method, since what it returns isn't important for the test. Note that, since the new method requires a notification object, we need to change a query in one place, the columns of which were only selected to appease the original method to generate a filename.	2021-03-15 13:55:44 +00:00
David McDonald	41d95378ea	Remove everything for the performance platform We no longer will send them any stats so therefore don't need the code - the code to work out the nightly stats - the performance platform client - any configuration for the client - any nightly tasks that kick off the sending off the stats We will require a change in cronitor as we no longer will have this task run meaning we need to delete the cronitor check.	2021-03-15 12:04:53 +00:00
David McDonald	8325431462	Move saving of processing time into separate task We current do this as part of send-daily-performance-platform-stats but now this moves it into its own separate task. This is for two reasons - we will shortly get rid of the send-daily-performance-platform-stats task as we no longer will need to send anything to performance platform - even if we did decide to keep the task send-daily-performance-platform-stats and remove the specific bits that relate to the performance platform, it's probably nicer to rewrite the new task from scratch to make sure it's all clear and easy to understand	2021-03-15 11:44:01 +00:00
Ben Thorner	a91fde2fda	Run auto-correct on app/ and tests/	2021-03-12 11:45:45 +00:00
Rebecca Law	acfb759cb9	Change DVLA_EMAIL_ADDRESS to a list	2021-02-26 11:21:16 +00:00
David McDonald	82e5a1804b	Merge pull request #3155 from alphagov/migrate-broadcast-settings Backfill services_broadcast_settings table	2021-02-25 12:16:36 +00:00

1 2 3 4 5 ...

835 Commits