notifications-api

mirror of https://github.com/GSA/notifications-api.git synced 2026-01-14 06:31:12 -05:00

Author	SHA1	Message	Date
Richard Baker	e10f45b3a7	Cast Celery worker_max_tasks_per_child to int or None We use this config option when running workers that process non-memory-safe tasks to restart the worker after n tasks. Celery 5 requires this to be passed as an int or None. Signed-off-by: Richard Baker <richard.baker@digital.cabinet-office.gov.uk>	2021-11-05 11:09:09 +00:00
Ben Thorner	3ecbdbb260	Temporarily disable task argument checking This was added in Celery 4 [1]. and appears to be incompatible with our approach of injecting "request_id" into task arguments (example exception below). Although our other apps are on Celery 5 our logs don't show any similar issues, probably because all their tasks are invoked without request IDs. In the longterm we should decide if we want to enable argument checking and fix the tracing approach, or stop tracing request IDs in Celery tasks. [1]: https://docs.celeryproject.org/en/stable/userguide/tasks.html#argument-checking 2021-11-01T11:37:36 delivery delivery ERROR None "RETRY: Email notification f69a9305-686f-42eb-a2ee-61bc2ba1f5f3 failed" [in /Users/benthorner/Documents/Projects/api/app/celery/provider_tasks.py:68] Traceback (most recent call last): File "/Users/benthorner/Documents/Projects/api/app/celery/provider_tasks.py", line 53, in deliver_email raise TypeError("test retry") TypeError: test retry [2021-11-01 11:37:36,385: ERROR/ForkPoolWorker-1] RETRY: Email notification f69a9305-686f-42eb-a2ee-61bc2ba1f5f3 failed Traceback (most recent call last): File "/Users/benthorner/Documents/Projects/api/app/celery/provider_tasks.py", line 53, in deliver_email raise TypeError("test retry") TypeError: test retry [2021-11-01 11:37:36,394: WARNING/ForkPoolWorker-1] Task deliver_email[449cd221-173c-4e18-83ac-229e88c029a5] reject requeue=False: deliver_email() got an unexpected keyword argument 'request_id' Traceback (most recent call last): File "/Users/benthorner/Documents/Projects/api/app/celery/provider_tasks.py", line 53, in deliver_email raise TypeError("test retry") TypeError: test retry During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/Users/benthorner/.pyenv/versions/notifications-api/lib/python3.6/site-packages/celery/app/task.py", line 731, in retry S.apply_async() File "/Users/benthorner/.pyenv/versions/notifications-api/lib/python3.6/site-packages/celery/canvas.py", line 219, in apply_async return _apply(args, kwargs, *options) File "/Users/benthorner/.pyenv/versions/notifications-api/lib/python3.6/site-packages/celery/app/task.py", line 537, in apply_async check_arguments((args or ()), *(kwargs or {})) TypeError: deliver_email() got an unexpected keyword argument 'request_id' During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/Users/benthorner/.pyenv/versions/notifications-api/lib/python3.6/site-packages/celery/app/trace.py", line 450, in trace_task R = retval = fun(args, *kwargs) File "/Users/benthorner/Documents/Projects/api/app/celery/celery.py", line 74, in __call__ return super().__call__(args, *kwargs) File "/Users/benthorner/.pyenv/versions/notifications-api/lib/python3.6/site-packages/celery/app/trace.py", line 731, in __protected_call__ return self.run(args, **kwargs) File "/Users/benthorner/Documents/Projects/api/app/celery/provider_tasks.py", line 71, in deliver_email self.retry(queue=QueueNames.RETRY) File "/Users/benthorner/.pyenv/versions/notifications-api/lib/python3.6/site-packages/celery/app/task.py", line 733, in retry raise Reject(exc, requeue=False) celery.exceptions.Reject: (TypeError("deliver_email() got an unexpected keyword argument 'request_id'",), False)	2021-11-01 11:39:57 +00:00
Ben Thorner	29c92a9e54	Try removing boto package again	2021-11-01 09:54:10 +00:00
Ben Thorner	d0550533a7	Remove redundant polling_interval setting This appeared without explanation in [1], but it's the same as the default value [2] so we don't need to specify it - doing so gives the impression we made a decision, but that's not clear here. [1]: https://github.com/alphagov/notifications-api/pull/2142/files#diff-84f1a9419471e289c6b6e2b0209b329e20df6cef81d1f7f0a193ddc2fc6ad69dR153 [2]: https://docs.celeryproject.org/en/stable/getting-started/backends-and-brokers/sqs.html#polling-interval	2021-11-01 09:54:07 +00:00
Ben Thorner	44b3b42aba	Rewrite config to fix deprecation warnings The new format was introduced in Celery 4 [1] and is due for removal in Celery 6 [2], hence the warnings e.g. [2021-10-26 14:31:57,588: WARNING/MainProcess] /Users/benthorner/.pyenv/versions/notifications-api/lib/python3.6/site-packages/celery/app/utils.py:206: CDeprecationWarning: The 'CELERY_TIMEZONE' setting is deprecated and scheduled for removal in version 6.0.0. Use the timezone instead alternative=f'Use the {_TO_NEW_KEY[setting]} instead') This rewrites the config to match our other apps [3][4]. Some of the settings have been removed entirely: - "CELERY_ENABLE_UTC = True" - this has been enabled by default since Celery 3 [5]. - "CELERY_ACCEPT_CONTENT = ['json']", "CELERY_TASK_SERIALIZER = 'json'" - these are the default settings since Celery 4 [6][7]. Finally, this removes a redundant (and broken) bit of development config - NOTIFICATION_QUEUE_PREFIX - that should be set in environment.sh [8]. [1]: https://docs.celeryproject.org/en/stable/history/whatsnew-4.0.html#lowercase-setting-names [2]: https://docs.celeryproject.org/en/stable/history/whatsnew-5.0.html#step-2-update-your-configuration-with-the-new-setting-names [3]: `252ad01d39/app/config.py (L27)` [4]: `03df0d9252/app/__init__.py (L33)` [5]: https://docs.celeryproject.org/en/stable/userguide/configuration.html#std-setting-enable_utc [6]: https://docs.celeryproject.org/en/stable/userguide/configuration.html#std-setting-task_serializer [7]: https://docs.celeryproject.org/en/stable/userguide/configuration.html#std-setting-accept_content [8]: `2edbdec4ee/README.md (environmentsh)`	2021-11-01 09:54:05 +00:00
Leo Hemsted	19394ab9dd	construct celery queues once in the base config previously, we were confusing things by appending to CELERY_QUEUES in both dev and test configs - these are executed at import time, so the list contained all queues twice, regardless of what config you're actually using. Fortunately, the -Q command that we supply the workers with overrides this config option, so other environments weren't affected. Given that, we can tidy up this code by just declaring it in the base config every time	2021-11-01 09:54:04 +00:00
Ben Thorner	c2fe1b04bb	Fix test checking for nested exception Previously this type of exception was raised at the top level and the task did not retry [1]. Since Celery 4+ the behaviour changed so that a Retry exception will be raised unless we explicitly say we want to raise the original one [2]. It's unclear if we actually want to retry this task for any type of exception, but it's out-of-scope for this PR to decide on this, so here we just reraise the exception to make it compatible with the new version of Celery and the existing test. [1]: https://github.com/alphagov/notifications-api/pull/2832/files#diff-926badba91648d56a973e16bd92da3345b23bc60dc89360119b1df08de52723fL77 [2]: `32b52ca875 (diff-db604dd7cb51e386710260ff2eba378aac19ba11eec97904bbf097b68caeada6L625)`	2021-11-01 09:54:03 +00:00
Ben Thorner	4125ed3f10	Merge pull request #3352 from alphagov/remove-raw-request-stats-180016688 Revert "add raw request timings to provider send functions"	2021-10-29 15:04:25 +01:00
Ben Thorner	d1586a8f81	CC DVLA in tickets about outstanding letters Previously we sent them emails about this manually. We also tried a Zendesk macro/trigger approach, but using a CC means: - We can control the behaviour ourselves (Zendesk triggers can only be edited by admins outside our team). - We keep the DVLA notification approach consistent and in one place, so notifications always go to the same people. - Any further (public) updates to the ticket will also trigger a notification to DVLA (previous trigger only notified on creation).	2021-10-29 11:46:29 +01:00
Ben Thorner	3eeba0266b	Revert "add raw request timings to provider send functions" This reverts commit `f2f2509c9b`. Raw request stats were added to investigate a hunch about a performance issue we were seeing [1], but turned out not to be relevant. We don't use them anymore so we can tidy up. [1]: https://github.com/alphagov/notifications-api/pull/2858	2021-10-28 11:12:18 +01:00
David McDonald	5a51ab6131	Bug fix: update normalised_to, not just `to` after letter sanitise When a precompiled letter is sent to us, we set the `to` field as 'Provided as PDF' in `1c1023a877/app/v2/notifications/post_notifications.py (L100-L104)` This then also sets `normalised_to` as `providedaspdf`. However, when template preview sanitises the letter, pulls out the address and gives it to the API, we were only setting `to` to be the new address and had forgotten to also amend `normalised_to` to be the normalised version. This meant that for all these letters we accidentally left `normalised_to` as `providedaspdf`. The impact of this was that we can not then search for these letters in the admin user interface as they rely on the `normalised_to` field containing the recipient address. This commit fixes that bug by also setting the `normalised_to` field	2021-10-27 11:56:25 +01:00
Chris Hill-Scott	54bcf618da	Store the `event` field from CAP XML broadcasts We don’t store everything that comes in the CAP XML when someone creates a broadcast via the API. One thing we do store is `<identifier>` (in a column called `reference`) which is a unique (to the external system) identifier for the broadcast. We show this in the front end instead of the template name, because broadcasts created from the API don’t use templates. However this ID isn’t very friendly – the Environment Agency just supply a UUID. The Environment Agency also populate the `<event>` field with some human readable text, for example: > 013 Issue Severe Flood Warning EA (013 is an area code which will be meaningful to the Flood Warning Service team) We should show this in the UI instead of the reference. The first step towards this is storing it in the database and returning it in the REST endpoints. Later we can have the admin app prefer `cap_event` over `reference`, where `cap_event` is present. We can’t backfill this data because we don’t keep a copy of the original XML. Seems like `<event>` is a mandatory property of `<info>`, so we don’t need to worry about the field being missing (`<info>` is optional in CAP but we require it because it contains stuff like the areas which we need in order to send out the broadcast`). *** https://www.pivotaltracker.com/story/show/176927060	2021-10-26 11:12:27 +01:00
Ben Thorner	d703251b13	Merge pull request #3348 from alphagov/better-callback-stats-180016688 Include status in stats about delivery times	2021-10-22 11:59:24 +01:00
Ben Thorner	f974108934	Include status in stats about delivery times Previously these metrics weren't very useful because they could be skewed by long timings for failed notifications, which can take up to 72 hours to deliver. I'm intentionally not trying to have a dual running period (with the old and new names) because: - We don't use the current stats for anything (checking Grafana). - The current stats get turned into a "bucket" metric in Prometheus [1][2], which isn't very useful because it can only tell us the mean time to deliver, but we're actually interested in percentiles. Switching to a new naming is an opportunity to fix the raw data and the way it's aggregated, using the same kind of "summary" metric that we now use for stats about our Celery tasks [3]. [1]: `c330a8ac8a/paas/statsd/statsd-mapping.yml (L82)` [2]: https://prometheus.io/docs/practices/histograms/#quantiles [3]: https://github.com/alphagov/notifications-aws/pull/890	2021-10-20 17:22:59 +01:00
Leo Hemsted	0b8c6ef263	Merge pull request #3339 from alphagov/letter-runbook-link tweak zendesk message for no ack files alert	2021-10-20 15:23:33 +01:00
Pea Tyczynska	1b6f9505da	Call `publish-govuk-alerts` task when alert expires The `auto-expire-broadcast-messages` task checks for expired broadcasts at five minute intervals. This change now calls the `publish-govuk-alerts` task in govuk-alerts if there are expired broadcasts so that the site is updated. Co-authored-by: Katie Smith <katie.smith@digital.cabinet-office.gov.uk>	2021-10-18 08:41:25 +01:00
Katie Smith	04bfd6bfdb	Trigger task to publish alerts when sending or cancelling alert When we send or cancel a broadcast message, we now trigger a task in govuk-alerts repo that polls our API for alerts and publishes a fresh list of alerts. Co-authored-by: Pea Tyczynska <pea.tyczynska@digital.cabinet-office.gov.uk>	2021-10-18 08:41:24 +01:00
Ben Thorner	7d631960eb	Fix incorrect ordering in command wrapper Previously this was causing the wrapper function to become a command before it started mirroring the original (functools.wraps), which meant any previous option decorators were "lost".* We didn't notice the problem in the original PR [1] because the new command under test has its option decorators after the command decorator, in contrast with all other (now broken) commands. The original wrapper applied the functools decorator first [2], so this change just reinstates that ordering. *This is a hand-wavey explanation as I haven't looked into how functools.wraps interacts with option decorators. [1]: `922fd2f333`# [2]: `922fd2f333 (diff-c4e75c8613e916687a97191a7a79110dfb47e96ef7df96f7ba25dd94ba64943dL101)`	2021-10-08 14:21:59 +01:00
Leo Hemsted	b8c4e19072	tweak zendesk message for no ack files alert include a link to a runbook entry. also the list of acknowledgement files can be very long, so make that the last thing, and use new lines to space out the message.	2021-10-08 13:45:02 +01:00
Chris Hill-Scott	544bfbf569	Add separate config item for failed login count It’s confusing that changing `MAX_VERIFY_CODE_COUNT` also limits the number of failed login attempts that a user of text messages 2FA can make. This makes the parameters independent, and adds a test to make sure any future changes which affect the limit of failed login attempts are covered.	2021-10-04 10:45:07 +01:00
Chris Hill-Scott	786893d920	Reduce max concurrent 2 factor codes I was doing some analysis and saw that in the last 24 hours the most codes that anyone had was in a 15 minute window was 3. So I think we can safely reduce this to 5 to get a bit more security with enough headroom to not have any negative impact to the user.	2021-10-04 10:45:06 +01:00
Chris Hill-Scott	19ad11e383	Don’t repeat digits in security codes People with dyslexia and dyscalculia find it difficult to transpose codes which have consecutive, repeated digits[1]. This commits enhances the algorithm for generating codes to not repeat the previous digit in a code. This reduces the key space for our codes from 100,000 possibilities to 65,610 possibilities. 1. https://twitter.com/annaecook/status/1442567679710150662	2021-09-30 10:24:17 +01:00
Katie Smith	58597653df	Update how "sending to TV numbers" Zendesk tickets are created	2021-09-29 11:26:20 +01:00
Katie Smith	0c0c7f4478	Update how "letters still created status" Zendesk tickets are created	2021-09-29 11:23:28 +01:00
Katie Smith	2f66e38fb9	Update how "missing ackfile for letters" Zendesk tickets are created	2021-09-29 11:10:50 +01:00
Katie Smith	64c0a3fb9d	Update how 'letters still sending' Zendesk tickets are created These now use the new Zendesk form.	2021-09-29 11:07:37 +01:00
Katie Smith	b114dadcae	Update how pending virus check Zendesk tickets are created This updates the tickets that are created when the `check_if_letters_still_pending_virus_check` scheduled task detects letters in the `pending-virus-check` state.	2021-09-29 11:03:48 +01:00
Katie Smith	9ff0ca0363	Update how live broadcast Zendesk tickets are created These now use the Notify Form in Zendesk	2021-09-29 10:59:07 +01:00
Ben Thorner	68eeb1defa	Merge pull request #3325 from alphagov/prevent-empty-areas-178986763 Add validation to prevent blank area names	2021-09-17 15:20:15 +01:00
Ben Thorner	d8a0967ec0	Add validation to prevent blank area names Now that these are used for display on gov.uk/alerts we need to make sure the data is being set properly. We've already found an example where it wasn't [1]. We validate external broadcasts in two stages: with the official CAP XML schema [2] and then again with our own, more specific schema for the converted JSON. Since this validation is a custom requirement I've made it part of the JSON schema. Note that jsonschema recommends avoiding metachars like "\w" since they're not supported by all implementations [3]. I've tested the new validation manually and it works as expected by disallowing e.g. " " but still alowing "foo" and "foo bar". [1]: https://www.notifications.service.gov.uk/services/120107d0-d99a-4c42-8b70-f37d2f28879b/rejected-alerts/d6e0c70e-60f6-4422-8589-2a2d159c63f2 [2]: `81a25ff1ef/app/xml_schemas/CAP-v1.2.xsd` [3]: http://json-schema.org/understanding-json-schema/reference/regular_expressions.html	2021-09-17 13:33:52 +01:00
Ben Thorner	6a53871455	Restructure govuk-alerts endpoint to be internal In response to: https://github.com/alphagov/notifications-api/pull/3305#pullrequestreview-726672421 Previously this was added among the public /v2 endpoints, but it's only meant for internal use. While only the govuk-alerts app would be able to access it, the location and /v2 URL suggested otherwise. This restructures the endpoint so it resembles other internal ones.	2021-09-15 15:36:17 +01:00
Ben Thorner	35430e9a9f	Refactor custom validation into own function This sets a pattern for adding another in the next commits.	2021-09-15 11:02:50 +01:00
Leo Hemsted	33ca817e17	return contact list created_at in UTC, not BST make sure timestamps returned from the api are always consistent. The only place in models where we're serializing a BST timestamp is on the Notification.serialize_for_csv method now, which at least is a bit different as this is user-facing (it also returns a formatted human-readable notification_status for example).	2021-09-14 12:41:52 +01:00
Ben Thorner	c53ee6de94	Merge pull request #3323 from alphagov/add-permission-local-dev Make it easy to develop with broadcast services	2021-09-14 11:22:00 +01:00
Ben Thorner	922fd2f333	Support testing commands and add first test We have a lot of commands and it's important we test the ones that are meant to be used in the future to ensure they work when they're needed. Testing Flask commands is usually easy as written in their docs [1], but I had to make some changes to the way we decorate the command functions so they can work with test DB objects - I couldn't find any example of someone else encountering the same problem. [1]: https://flask.palletsprojects.com/en/2.0.x/testing/#testing-cli-commands	2021-09-14 09:29:23 +01:00
sakisv	df739d6b94	Fix flake8	2021-09-10 10:17:28 +03:00
sakisv	65c21f694c	Don't raise P1 for broadcasts This is happening on the AWS side now as part of alphagov/notifications-broadcasts-infra#267 - but we still want to keep the zendesk ticket as it contains useful context _and_ provides visibility to the team.	2021-09-09 16:44:19 +03:00
Ben Thorner	04d8678c27	Make it easy to develop with broadcast services Previously I had to handcraft some SQL to give myself access to a broadcast service I created locally. I've done this enough times that I think it's worth automating.	2021-09-08 09:45:11 +01:00
Ben Thorner	54808104a6	Stop defaulting simple_polygons to empty array This is now done by the Admin app [1]. [1]: `baf20e0075`	2021-09-07 17:16:24 +01:00
Ben Thorner	59d0ab4f65	Stop defaulting "ids" to an empty array This is so we can distinguish custom broadcasts in the Admin app [1]. I've also extended the POST test for custom broadcasts to check we're correctly reading data for "names", as this wasn't being tested previously. [1]: `411fda81c0`	2021-09-07 17:16:22 +01:00
Ben Thorner	dd41cf854c	Remove support for old "areas" sub-field All broadcasts with this field have now been migrated to use "ids". This also removes a few lines that were missed in previous PRs: - Added by mistake: `fd7ebbebb0 (diff-045554136e1462693a6cbb6328b2e056a81e8b348e94575edd8f72b78c5da96eR115)` - Missed removal: `ec1171f85c (diff-045554136e1462693a6cbb6328b2e056a81e8b348e94575edd8f72b78c5da96eR110)`	2021-09-07 17:14:57 +01:00
Ben Thorner	6af39c4d3b	Remove redundant force_overrride feature This is no longer needed as all areas data has now been migrated.	2021-09-06 09:53:39 +01:00
Ben Thorner	d50c563f08	Remove support for "areas_2" field The Admin app was only using this temporarily and is now using the "areas" field instead [1], so we can delete this one. [1]: https://github.com/alphagov/notifications-admin/pull/4006	2021-09-01 17:42:15 +01:00
Ben Thorner	43ddfe0560	Remove old "simple_polygons" fields in schemas These were missed in [1]. [1]: https://github.com/alphagov/notifications-api/pull/3314	2021-09-01 17:23:29 +01:00
Ben Thorner	be7272b44f	Merge pull request #3314 from alphagov/next-stage-areas-migration-178986763 Switch "areas" field to "areas_2" format	2021-09-01 16:16:30 +01:00
Chris Hill-Scott	2c7e4657ce	Don’t update `email_access_validated_at` on password reset As of https://github.com/alphagov/notifications-admin/pull/4000/files the admin app is doing this, so we don’t need to do it here as well.	2021-09-01 09:54:54 +01:00
Chris Hill-Scott	6900505b05	Don’t call random.choices with zero weighting As of `041d8b48a2` it’s not valid to call `random.choices` without giving at least one of the options a positive weighting. This makes sense, because giving a zero weighting is effectively saying ‘theres’s only one choice, but don’t choose it’. In our codebase this is applicable where there’s only one international provider, which we want to use even when it’s been de-prioritised for domestic SMS. This doesn’t cause a problem now, but will if we upgrade to Python versions greater than 3.9.0.	2021-08-31 11:08:27 +01:00
Ben Thorner	bf0bf4e31c	Favour new "areas" format for PagerDuty alerts Broadcasts created via the API [1] and the Admin app [2] should both now have this field set. It's also more informative to show this, and broadcasts created via the API don't have IDs anyway. There's a small risk that an old broadcast that gets approved won't have this data, but it's for information only and we intend to backfill all old broadcasts in the near future. [1]: `023a06d5fb` [2]: `7dbe3afa19`	2021-08-27 14:22:12 +01:00
Ben Thorner	ec1171f85c	Switch "areas" field to "areas_2" format The Admin app is now temporarily using the "areas_2" field while we migrate "areas" to the new format [1]. [1]: https://github.com/alphagov/notifications-admin/pull/4004	2021-08-27 14:22:11 +01:00
Ben Thorner	a7d92b9058	Replace / remove redundant uses of "areas" In one case ("areas=['manchester']") the format was even invalid, but in general the original value of the column is pretty much irrelevant for tests that involve updating it (it's highly unlikely the column would default to the same value as the test data).	2021-08-27 13:31:49 +01:00

1 2 3 4 5 ...

4801 Commits