alloc_percpu: remove alignment on cpu_workqueue_struct.

Since we're improving alloc_percpu, we don't need to waste space.
On 32-bit this drops struct cpu_workqueue_struct from 128 to 64 bytes.

File and line			Number		Size		Total
Before:
  kernel/workqueue.c:819	 72		 128		 9126
After:
  kernel/workqueue.c:819	 72		  64		 4608

It's still the biggest dynamic percpu user though.

Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>
---
 kernel/workqueue.c |    2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/kernel/workqueue.c b/kernel/workqueue.c
--- a/kernel/workqueue.c
+++ b/kernel/workqueue.c
@@ -50,7 +50,7 @@ struct cpu_workqueue_struct {
 	struct task_struct *thread;
 
 	int run_depth;		/* Detect run_workqueue() recursion depth */
-} ____cacheline_aligned;
+};
 
 /*
  * The externally visible workqueue abstraction is an array of
