Collaboration on standard Wayland protocol extensions

Discussion:

Drew DeVault

2016-03-27 20:34:37 UTC

Greetings! I am the maintainer of the Sway Wayland compositor.

http://swaywm.org

It's almost the Year of Wayland on the Desktop(tm), and I have
reached out to each of the projects this message is addressed to (GNOME,
Kwin, and wayland-devel) to collaborate on some shared protocol
extensions for doing a handful of common tasks such as display
configuration and taking screenshots. Life will be much easier for
projects like ffmpeg and imagemagick if they don't have to implement
compositor-specific code for capturing the screen!

I want to start by establishing the requirements for these protocols.
Broadly speaking, I am looking to create protocols for the following
use-cases:

- Screen capture
- Output configuration
- More detailed surface roles (should it be floating, is it a modal,
does it want to draw its own decorations, etc)
- Input device configuration

I think that these are the core protocols necessary for
cross-compositor compatability and to support most existing tools for
X11 like ffmpeg. Considering the security goals of Wayland, it will also
likely be necessary to implement some kind of protocol for requesting
and granting sensitive permissions to clients.

How does this list look? What sorts of concerns do you guys have with
respect to what features each protocol needs to support? Have I missed
any major protocols that we'll have to work on? Once we have a good list
of requirements I'll start writing some XML.

--
Drew DeVault

Martin Peres

2016-03-27 20:50:48 UTC

Permalink

Post by Drew DeVault
Greetings! I am the maintainer of the Sway Wayland compositor.
http://swaywm.org
It's almost the Year of Wayland on the Desktop(tm), and I have
reached out to each of the projects this message is addressed to (GNOME,
Kwin, and wayland-devel) to collaborate on some shared protocol
extensions for doing a handful of common tasks such as display
configuration and taking screenshots. Life will be much easier for
projects like ffmpeg and imagemagick if they don't have to implement
compositor-specific code for capturing the screen!
I want to start by establishing the requirements for these protocols.
Broadly speaking, I am looking to create protocols for the following
- Screen capture
- Output configuration
- More detailed surface roles (should it be floating, is it a modal,
does it want to draw its own decorations, etc)
- Input device configuration
I think that these are the core protocols necessary for
cross-compositor compatability and to support most existing tools for
X11 like ffmpeg. Considering the security goals of Wayland, it will also
likely be necessary to implement some kind of protocol for requesting
and granting sensitive permissions to clients.
How does this list look? What sorts of concerns do you guys have with
respect to what features each protocol needs to support? Have I missed
any major protocols that we'll have to work on? Once we have a good list
of requirements I'll start writing some XML.
--
Drew DeVault

We had discussions about it years ago and here are the results of them:
http://mupuf.org/blog/2014/02/19/wayland-compositors-why-and-how-to-handle/
http://mupuf.org/blog/2014/03/18/managing-auth-ui-in-linux/

And here is the software we created, under the name "Wayland Security
Modules":
http://www.x.org/wiki/Events/XDC2014/XDC2014DodierPeresSecurity/xorg-talk.pdf
https://github.com/mupuf/libwsm

This approach has generally be liked by KDE, but not by Gnome who, last
i heard, did not care about cross-platform apps doing privileged
operations. This may have changed since they also decided to work on
sandboxing (xdg-app) and implemented something like the following
approach when they said they would never do because it changed the API:
http://mupuf.org/blog/2014/05/14/introducing-sandbox-utils-0-6-1/

I really wish we can have everyone onboard on one solution to get these
cross-platform apps and so far, I do not see any better solution than WSM.

Martin

Drew DeVault

2016-03-27 21:00:47 UTC

Permalink

Thanks for the links! I'll read through them. I figured that a
discussion like this had happened in the past around how to give clients
privledge, but I couldn't find anything that would allow them to
actually do the thing they were given permission to. We should flesh out
both parts of this model. I read over libwsm and it seems like a fairly
sane approach. I'd like to read the arguments for/against it.

This approach has generally be liked by KDE, but not by Gnome who, last i
heard, did not care about cross-platform apps doing privileged operations.
This may have changed since they also decided to work on sandboxing
(xdg-app) and implemented something like the following approach when they
http://mupuf.org/blog/2014/05/14/introducing-sandbox-utils-0-6-1/

I would hope that our friends at Gnome aren't planning on implementing
software to cover every screen capturing use case in mutter! I'd like to
find a way to use OBS (https://obsproject.com/) from Wayland, for
example.

I really wish we can have everyone onboard on one solution to get these
cross-platform apps and so far, I do not see any better solution than WSM.

Well, I'm definitely on board. Sway is clearly a smaller project than
Gnome or KDE and I would rather not build the "Sway Desktop
Environment". I think we can arrive at some solutions that are in line
with the Unix way AND meet the goals of the big DEs.

--
Drew DeVault

Jasper St. Pierre

2016-03-27 23:41:43 UTC

Permalink

You're probably referring to my response when you say "GNOME does not
care about cross-platform apps doing privileged operations". My
response wasn't meant to be speaking on behalf of GNOME. These are my
opinions and mine alone.

My opinion is still as follows: having seen how SELinux and PAM work
out in practice, I'm skeptical of any "Security Module" which
implements policy. The "module" part of it rarely happens, since
people simply gravitate towards a standard policy. What's interesting
to me isn't a piece of code that allows or rejects operations, it's
the resulting UI *around* those operations and managing them, since
that's really, at the end of the day, all the user cares about.

It would be a significant failure to me if we didn't have a standard
way for a user to examine or recall the policy of an application,
using whatever API they wanted. If every module implements its own
policy store separately, such a UI would be extremely difficult to
build.

From what I read, Wayland Security Modules didn't seem to even provide
that as a baseline, which is why I believe they're tackling the
problem from the wrong angle.

Post by Martin Peres

http://mupuf.org/blog/2014/02/19/wayland-compositors-why-and-how-to-handle/
http://mupuf.org/blog/2014/03/18/managing-auth-ui-in-linux/
And here is the software we created, under the name "Wayland Security
http://www.x.org/wiki/Events/XDC2014/XDC2014DodierPeresSecurity/xorg-talk.pdf
https://github.com/mupuf/libwsm
This approach has generally be liked by KDE, but not by Gnome who, last i
heard, did not care about cross-platform apps doing privileged operations.
This may have changed since they also decided to work on sandboxing
(xdg-app) and implemented something like the following approach when they
http://mupuf.org/blog/2014/05/14/introducing-sandbox-utils-0-6-1/
I really wish we can have everyone onboard on one solution to get these
cross-platform apps and so far, I do not see any better solution than WSM.
Martin
_______________________________________________
wayland-devel mailing list
https://lists.freedesktop.org/mailman/listinfo/wayland-devel

--
Jasper

Drew DeVault

2016-03-28 02:33:52 UTC

Permalink

Post by Jasper St. Pierre
My opinion is still as follows: having seen how SELinux and PAM work
out in practice, I'm skeptical of any "Security Module" which
implements policy. The "module" part of it rarely happens, since
people simply gravitate towards a standard policy. What's interesting
to me isn't a piece of code that allows or rejects operations, it's
the resulting UI *around* those operations and managing them, since
that's really, at the end of the day, all the user cares about.

It has been done successfully, though. Consider the experience for iOS
and Android permissions. When an application needs to do something
sensitive, a simple dialog pops up explaining what it's asking for, and
allowing the user to consent once or forever. It's pretty simple and I
think we can accomplish something similar.

Post by Jasper St. Pierre
It would be a significant failure to me if we didn't have a standard
way for a user to examine or recall the policy of an application,
using whatever API they wanted. If every module implements its own
policy store separately, such a UI would be extremely difficult to
build.

Ah, but here we are, all talking about it together. Let's make a
solution that works for all of us, then.

Post by Jasper St. Pierre
From what I read, Wayland Security Modules didn't seem to even provide
that as a baseline, which is why I believe they're tackling the
problem from the wrong angle.

What are your specific concerns with it? I would tend to agree. I think
that it's not bad as an implementation of this mechanic, but I agree
that it's approaching the problem wrong. I think it would be wiser to
start with how clients ask the compositor for permissions and how they
receive them, then leave the details libwsm implements up to the
compositors.

I think a protocol extension would work just fine to implement a
permission requesting/granting dialogue between clients and compositors.

--
Drew DeVault

Jasper St. Pierre

2016-03-28 05:21:52 UTC

Permalink

Post by Drew DeVault
What are your specific concerns with it? I would tend to agree. I think
that it's not bad as an implementation of this mechanic, but I agree
that it's approaching the problem wrong. I think it would be wiser to
start with how clients ask the compositor for permissions and how they
receive them, then leave the details libwsm implements up to the
compositors.
I think a protocol extension would work just fine to implement a
permission requesting/granting dialogue between clients and compositors.

That's what we should be doing, and that's why I'm not a huge fan of
WSM -- it provides a solution for the stuff that doesn't matter, and
doesn't make any progress on the part we need to tackle. I won't enjoy
using libwsm because it adds complexity and error cases (e.g. what
happens with no modules, like on a misconfigured system?), without
solving the actual problem.

Also, as I've mentioned in my emails before, APIs aren't exclusively
used through Wayland, they might also be on other systems like DBus,
which already has its own confusing policy system. It gets even worse
when protocols might cross both systems. So libwsm is already far in
the negative points bucket to me -- a Wayland-protocol centric
solution that ignores other IPCs and APIs, is configurable for no
purpose as far as I can tell, and still doesn't have an approachable
story about how it provides more security to the user.

I would rather the effort be spent making secure interfaces, exactly
as you've described.

Post by Drew DeVault
--
Drew DeVault

--
Jasper

Drew DeVault

2016-03-28 13:03:55 UTC

Permalink

Post by Jasper St. Pierre
I would rather the effort be spent making secure interfaces, exactly
as you've described.

Agreed. I think it should be pretty straightforward:

Client->Server: What features do you support?
Server->Client: These privledged features are available.
Client->Server: I want this feature (nonblocking)
[compositor prompts user to agree]
Server->Client: Yes/no
[compositor enables the use of those protocols for this client]

I can start to write up some XML to describe this formally. We can take
some inspiration from the pointer-constraints protocol and I'll also
rewrite that protocol with this model in mind. Does anyone see anything
missing from this exchange?

--
Drew DeVault

Martin Peres

2016-03-28 19:50:39 UTC